Research and implementation of clustering and convex package algorithm under MapReduce framework

Last Update:2015-03-17 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Chengdu University of Technology Zhaoju

First of all, this paper studies the generation and value growth of large data, explains the necessity of improving the execution efficiency of data mining algorithm, and gives a general introduction to the technology and tools that support large-data processing nowadays. Then the paper studies the running mechanism of Hadoop file system, the stored procedure and the programming model of MapReduce framework, and the operation principle. Secondly, the data are distributed and processed on a certain scale Hadoop cluster to evaluate the performance of the whole cluster to see if it is suitable for the standard data Mining task. In the MapReduce framework, the search and sequencing tasks are performed to analyze the effects of different system configurations. At the same time, the K clustering algorithm is provided for iterative implementation in MapReduce framework. Finally, the traditional computer graphics convex package algorithm is implemented in parallel with the MapReduce frame, and the experimental data is simulated with the K algorithm, which shows that the convex packet algorithm can be applied to the research of the data mining algorithm in the MapReduce framework, The result of data mining algorithm is introduced in data compression.

Research and implementation of clustering and convex package algorithm under MapReduce framework

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Research and implementation of clustering and convex package algorithm under MapReduce framework

Contact Us

Recommend Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support