Improvement of Aprioi algorithm under MapReduce frame
Source: Internet
Author: User
KeywordsCloud computing Mapreduceaprioi algorithm
Improvement of Aprioi algorithm under MapReduce frame
Wang Wang Junhong Yu Jiao Gedommei
Massive data mining using traditional Apriori algorithm will waste a lot of storage space and communication resources, resulting in inefficient algorithm, therefore, proposed the MapReduce framework of the APRIOI algorithm, first of all, using the horizontal partition method to divide the MapReduce database into n separate data blocks , and then sent to the M work node with dynamic load balancing. Each node scans its own data blocks, produces local candidate frequent itemsets, calculates the support threshold for each candidate frequent itemsets and compares the minimum support threshold to determine the final frequent itemsets. Improved algorithm can reduce data flow between nodes , only need to scan two times transaction database can excavate all frequent itemsets, save scanning time and storage space, improve mining efficiency.
Improvement of Aprioi algorithm under MapReduce frame
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.