optimization of sorting algorithm based on mapreduce model
Jin Yu
MapReduce has developed into a parallel computing model for large data domain standards. Ideally, a mapreduce system should allow all nodes involved in the calculation to be highly balanced and minimize space usage, CPU and I/O usage, and network transport overhead. Traditional algorithms are usually only optimized for one of the above metrics. Based on the good parallelism of the algorithm, the design criterion of mapreduce optimization algorithm is put forward. According to the theory analysis of the most important sorting algorithm in the Data processing field, the last algorithm under the constraint of multiple indexes is given, and it is proved that the optimization algorithm satisfies the criterion of mapreduce optimization algorithm. Finally, the effectiveness and efficiency of the optimized sorting algorithm are verified by experiments.
optimization of sorting algorithm based on mapreduce model
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.