Summary of Clustering algorithm

Source: Internet
Author: User

Read some papers, a summary, share to everyone!

Clustering algorithm can be divided into the following categories: 1 The main idea of partitioning method is: Given the number of partitions to be constructed K, randomly select K objects in the database, each object represents the average or center of a class, and divides it into the nearest class based on the distance from the remaining object to the center of the class.    Then recalculate the center of each class, repeating the process until all objects are no longer allocated. Typical partitioning methods include: K-means, K-Medoids, CLARA, Clarans, FCM, etc. 2) The main idea of hierarchical approach based on hierarchical method is: to decompose the set of a given data object in a hierarchical way.    According to the formation of hierarchical decomposition, the hierarchical method is divided into condensed method (bottom-up) and splitting method (top-down). The algorithm based on hierarchical clustering mainly includes: Cure algorithm, BIRCH, rock algorithm, Chameleon and so on. 3) The main idea of density-based methods is to continue clustering as long as the density of the adjacent area exceeds a given threshold beforehand.    This way, you can find classes of arbitrary shapes and filter out "noise" points. Density-based clustering algorithms mainly include: DBSCAN, OPTICS, Denclue, and so on. 4) The main idea of grid-based method is to quantify the object space into a finite number of cells, and to form a grid structure to perform all cluster operations on the grid structure. Grid-based methods are: STING, Clque, Wavecluster. 5) The main idea of model-based approach is to assume a model for each class and find the optimal fit of the data to a given model.     Model-based algorithm localization clustering is achieved by constructing a density function that reflects the spatial distribution of data points, and it also automatically determines the number of clusters by standard statistics, and considers noise data or outliers to generate robust clustering methods. The algorithm based on the model method mainly includes: Cobweb, Classit and so on.

Summary of Clustering algorithm

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.