International - English

Cart Console

Topic Center

Contact Sales

Home > Others

KNN Classification Algorithm Supplement

Last Update:2017-07-11 Source: Internet

Author: User

Tags square root

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

KNN Supplement:

1, the K value is set to how big?

K is too small, the classification results are susceptible to noise points,K is too large, the nearest neighbor may contain too many other categories of points.

(for distance weighting, the effect of K-value setting can be reduced)

The k value is usually determined by cross-examination ( k=1 as the benchmark)

rule of thumb:K is generally lower than the square root of the number of training samples

2, how to determine the most appropriate category?

The weighted voting method is more appropriate. And how to weighting, need to be based on specific business and data characteristics to explore

3, how to choose the right distance measurement?

The impact of high dimensions on distance measurement: It is well known that the more the number of variables, the more the Euclidean distance is less discriminating.

The effect of variable range on distance: The variable with the larger range is often dominated by the distance calculation, so the variables should be normalized first.

4. Should training samples be treated equally?

In the training set, some samples may be more worthy of reliance.

It can also be said that the quality of the sample data problem

Different weights can be applied to various samples to enhance the weight of dependent samples and reduce the impact of unreliable samples .

5, performance problems?

KNN is a lazy algorithm , usually do not study hard, test (the test sample classification) only cramming (temporarily to find K nearest neighbor).

The consequence of laziness: the construction model is simple, but the system overhead of classifying the test samples is large, because all training samples are scanned and distances are computed.

There are a number of ways to improve the efficiency of calculations, such as compressing training samples.

KNN Classification Algorithm Supplement

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

hsl supplement antivirus supplement sklearn knn knn sklearn atd supplement tcp supplement security classification

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

KNN Classification Algorithm Supplement

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support