Topic Center

Contact Sales

Home > Others

Summarize each algorithm and application scenario in a nutshell?

Last Update:2017-04-06 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

EM, is a maximum likelihood estimation method for probabilistic model parameters with implied variables. It is mainly used in the field of machine learning and computer vision data clustering.

LR, logistic regression, is also linear regression, by fitting a curve to fit a sample, and then using a logical function for interval scaling, but generally used for classification, mainly used in CTR estimation, referral system, etc.;
SVM, support vector machine, by finding a super plane in the sample space, to achieve the classification of samples, but also for regression, mainly used in text classification, image recognition and other fields, see:;
NN, neural network, by finding some kind of non-linear model fitting data, mainly used in image processing;
NB, naive Bayesian, by finding a sample of the joint step, and then through the Bayesian formula, calculate the posterior probability of the sample, thus classification, mainly used for text classification;
DT, decision tree, build a tree, in the node according to a certain rule (general use of information entropy) to carry out the sample division, the essence is in the sample space for block division, mainly used for classification, but also to do regression, but more as a weak classifier, used in model embedding;
RF, with the forest, is composed of a number of decision trees forest, each forest training sample is sampled from the overall sample, each node needs to be divided by the characteristics of sampling, which makes each tree has a unique field of knowledge, thus has a better generalization ability;
GBDT, gradient-boosting decision trees, in fact, are made up of many trees, and RF is different, each tree training sample is the residual of the previous tree, which embodies the idea of the gradient, while the final structure is the combination of all the trees or votes, mainly used in the recommendation, relevance, etc.;
Knn,k nearest neighbor, should be the simplest ml method, for the unknown label sample, see its nearest K sample (using a distance formula, Markov distance or European distance) which label is the most, it belongs to this category;

Naive Bayes (Naive Bayes) method is a classification method based on Bayesian theorem and independent hypothesis of characteristic condition, and for a given training data set, the joint probability distribution of the input/output is first based on the hypothesis of characteristic condition. Then, based on this model, for a given input x, The maximum output y of the posteriori probability is obtained by Bayes theorem. For the given items to be categorized, the probability of each category appearing under the conditions in which the item appears, and which one is the largest, is considered to be the category to which this category belongs.

Summarize each algorithm and application scenario in a nutshell?

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

docker in nutshell python in nutshell r in nutshell amazon agile in nutshell data structures and algorithm analysis in java bitcoin in nutshell hadoop in nutshell

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Summarize each algorithm and application scenario in a nutshell?

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support