Three skills principles in machine learning basics of machine learning

Source: Internet
Author: User

The Ames Razor principle (Occam ' s Razor)

One sentence is said, "an explanation of the data should is mad as simple as possible,but no simpler".
The meaning of machine learning is that the simplest explanation of the data is the best explanation (the simplest model, fits the data is also and the most plausible).


For example, the picture above, the right is not better than the left to explain? That's obviously not the case.

Do not add entities if it is not necessary
The law of the Valeo Razor, the simple and effective principle, says, do not waste more things to do, with less things, the same can do things.

So, we tend to choose simple, less parametric assumptions than complex assumptions, and we want to choose simpler models so that the number of valid assumptions is not many.
Another explanation is that, assuming there is a simple hypothesis H, if it can distinguish a set of data well, then it is true that this set of data has some regularity.

Sampling deviation (sampling Bias)

If the data is sampled in a biased way,learning would produce a similarily biased outcome.
This sentence tells us that if the sampled data is biased, then the effect of learning is also biased, this situation is called sampling deviation.
In reality, we need training data and test data from the same distribution.
To avoid this problem, what we can do is to understand the test environment, so that the training environment or training data and test environment as close as possible.

Data snooping (snooping)

Any process you use data is indirectly prying into the data, so when you make a decision, you know that the data may have been contaminated by the complexity of the model in your mind.
The effective way to avoid this situation is to:

  • Don't look at the data until you make a decision
  • Always have doubts.

Reprint please indicate the author Jason Ding and its provenance
GitHub home page (http://jasonding1354.github.io/)
CSDN Blog (http://blog.csdn.net/jasonding1354)
Jane Book homepage (http://www.jianshu.com/users/2bd9b48f6ea8/latest_articles)

Three skills principles in machine learning basics of machine learning

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.