Machine Learning Knowledge Framework

Source: Internet
Author: User

Today began to study Stanford University CS229 course, do not want to completely copy the handout, hope to add their own understanding, beginners, inevitably error, welcome correct.

The first lesson introduces the knowledge framework for machine learning, CS229, as long as the inductive reasoning approach in machine learning is described, which induces general conclusions from enough cases and, in turn, helps people solve problems related to specific cases, especially those that have not been seen before. Inductive learning is the most important means of machine learning at present.

According to the characteristics of input data, inductive learning can be divided into four categories: supervised learning, unsupervised learning, semi-supervised learning and intensive learning.
supervised learning : "Input data" and "expected output data" are known, known as "callout" data. The task is to find the input and output correspondence (function), the "actual output data" and "Expected output data" error as a criterion to evaluate the current system performance.
Example: Provide training data on "housing information" and corresponding "house price" to predict the price of a house other than the training data
Given the "tumor volume" and the corresponding "Whether it is malignant" training data, predicting whether a tumor outside the training data is malignant

Unsupervised learning : Only input data, no expected output data, called "unlabeled data". The task is to find the distribution pattern of the input data or the correlation between the different parts.
Example: Classification of images, recognition of objects in images

semi-supervised learning : Supervised learning data need to be labeled, unsupervised learning data do not need to have labels, data volume is large, the data is a heavy burden of work, so some scholars put forward semi-supervised learning. The data used is divided into two parts: a small amount of callout data and a large amount of unlabeled data. The learning process is to obtain a preliminary model by learning the labeling data, using the model to determine the expected output of unlabeled data, transforming it into labeled data, and then learning through this part of the transformed data. (Trial-and-error learning may lead to incorrect labeling, resulting in decreased learning results)

Reinforcement Learning : According to the input data output, the rewards and punishments, rewards and punishments feedback to the model, the model to make adjustments. Reinforcement learning emphasizes cumulative benefits and focuses on the results of a series of decisions, rather than on the results of each isolated point. Reinforcement learning comes from the discipline of automatic control, the earliest mainly used in robotics.
Examples: robotics, driverless cars, etc.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.