Skyline calculation based on the MapReduce projection partition under the frame

Source: Internet
Author: User
Keywords Large data mapreduce skyline calculation hyperplane projection division
Tags .xin application based computing data data processing framework large data

Skyline calculation based on the MapReduce projection partition under the frame

Wang Shuyan Yang Xin Lickerich

Recent, Skyline Computing plays a more and more important role in decision application. The research on single machine processing has been more mature. Nowadays large data explosion, Skyline Computing is confronted with the problem of data processing. The MapReduce is a parallel model, which is widely used in the application processing. As we all know, mapreduce processing requires that tasks be decomposable. Sk When the yline computation executes on the MapReduce, the method of decomposing the task has the Grid division, Based on the angle of division and so on. Grid partitioning behaves well only when the data dimension is low, and the segmentation based on angle is suitable for low and high dimensional data however, a complex and time-consuming coordinate conversion process is needed before partitioning. A data set is decomposed by a kind of segmentation based on the angle division, which is suitable for low and high dimensional data, and it is simpler to transform the coordinates before the partition. Single. Based on the partition of the hyperplane projection, an algorithm for Skyline computation on MapReduce is proposed MR-HPP (MapReduce with hyperplane-projections-based partition), An effective filtering algorithm, PSF (presorting filter), is proposed in the filtering phase of the algorithm. A large number of comparison experiments based on the Hadoop platform show that the algorithm is accurate, efficient and stable.


Skyline calculation based on the MapReduce projection partition under the frame

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.