Machine learning "1" (Python Machines Learning reading notes)

Source: Internet
Author: User

is still published as a reading note, not involving too many code and tools, as an understanding of the article to introduce machine learning.

The article is divided into two parts, machine learning Overview and Scikit-learn Brief Introduction, the two parts of close relationship, combined writing, so that the overall length, divided into 1, 22.

First, it's about machine learning. Key points are as follows:

1.1 Machine learning three main ways

1.2 Four processes

1.3 Python-related implementations (installation packages)

"1.1"

There are three main types of machine learning methods: Supervised learning (supervised learning), unsupervised learning (unsupervised learning), and reinforcement learning (enhanced learning)

Supervised learning (supervised learning):

The main purpose of supervised learning is to predict the future and classify it from existing data. First look at the basic definition:

The process of adjusting the classifier's parameters to meet the required performance, also known as supervised training or teacher learning, is adjusted using a set of known classes of samples.     Supervised learning is a machine learning task that infers a function from the tagged training data. In layman's interpretation, supervised learning is the analysis of a group (or groups of) known data, the optimal model of the condition, and the analysis of the data of the unknown result with this model, and the prediction and classification. This is widely used in our usual simple modeling and has a high degree of confidence. Thus, one of its major features is " there are training samples "The two most commonly used algorithms for          supervised learning are: Statistical classification classification and regression analysis regression      which algorithm to use, you need to choose according to the characteristics of the data. If the data is discrete, non-linear, statistically classified, and the data is linear, then regression analysis is chosen. The following two images illustrate the difference between the two:                           Figure 1   Statistical classification                                                                                                  Figure 2 Regression analysis          Reinforcement Learning (Enhanced learning)   &NBsp The purpose of reinforcement learning is to make the optimal decision in the corresponding environment. In essence, reinforcement learning has a result that we call a "return signal", which can be described as a "return function", which is similar to supervised learning. Therefore, this concept is first described in advance.     Unlike supervised learning, the return function of enhanced learning is not just a function value or a classification label, it is used to measure the merits of a step (or behavior), and the results are good. Therefore, the enhancement study is in the interaction, seeks the decision-making best. It is closely related to the environment.             The most typical example is the chess program. The program evaluates each step and gets the corresponding return function, and we only need to find a path with the largest return value (the sum of the returns per step), which is considered the best path.      unsupervised learning non-supervised learning     first-glance features: Dealing with unlabeled data or data of Unknown structure is mentioned earlier, the most important difference between unsupervised and supervised is whether the sample can be trained.     Two algorithms are: clustering and dimensionality reduction, from a very good understanding of the literal, clustering is the similarity of high objects converge into the same class. The data is not trained and analyzed directly. dimensionality reduction is to reduce the data dimension. The following two pictures, can be very good show:                                            3  Clustering                                          Figure 4 dimensionality reduction   "1.2" machine learning process         is mainly divided into four parts:  (1) preprocessing–getting data into shape      preprocessing is a critical step in machine learning and is one of the most fundamental steps in any data analysis. Real-life data is always not exactly in our format, and we cannot avoid the need to manipulate the data so that it becomes the format we need. Specific ways not detailed, you can refer to the previous articles, some mention, can also be used in practice, System learning   (2) Training and selecting a predictive model     In a nutshell, choose a suitable model.   (3) Evaluating models      Popular survey, error analysis and inspection   (4) predicting unseen data instances     If the model is ideal, you can apply     "1.3" Installing Python packages    numpy 1.9.1
scipy 0.14.0
Scikit-learn 0.15.2
Matplotlib 1.4.0
Pandas 0.15.2

Machine learning "1" (Python Machines Learning reading notes)

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.