--partition Algorithm for Mining Association rules

Source: Internet
Author: User

Association rules are expressions such as a->b, and A and B are the two subkeys that intersect each other in the entire set.

The main purpose of mining association rules is to find meaningful correlation relationships in data. Shopping basket analysis is the analysis of customer purchase behavior to discover the relationship between different products.

Support degree, confidence level, promotion degree

Support Degree (a->b) =| ab|/| s|

Confidence level (A->B) =| ab|/| a|

This partition-based algorithm first divides the database logically into disjoint chunks, considers a block each time and generates all the frequency sets for it, then merges the resulting frequency set to generate all possible frequency sets, and finally calculates the support for these itemsets. Here the size of the block is chosen so that each block can be placed in the primary, and each stage is scanned only once. The correctness of the algorithm is guaranteed by the frequency set in at least one block of each possible frequency set. The algorithms discussed above can be highly parallel, and each chunk can be assigned to a processor to generate a frequency set. After each loop that generates the frequency set, the processor communicates between the processors to produce a global candidate K-Key set. Usually the communication process here is the main bottleneck of the algorithm execution time, on the other hand, the time that each independent processor generates the frequency set is also a bottleneck. Other methods also share a hash tree between multiple processors to generate the frequency set. More parallelization methods for generating frequency sets can be found in the literature [AS96] .

--partition Algorithm for Mining Association rules

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.