Wang Jialin's 11th lecture on hadoop graphic training course: Analysis of the Principles, mechanisms, and flowcharts of mapreduce in "the path to a practical master of cloud computing distributed Big Data hadoop-from scratch"

Source: Internet
Author: User

This section mainly analyzes the principles and processes of mapreduce.


Complete release directory of "cloud computing distributed Big Data hadoop hands-on"


Cloud computing distributed Big Data practical technology hadoop exchange group:312494188Cloud computing practices will be released in the group every day. welcome to join us!


You must at least know the following points about mapreduce:

1. mapreduce runs on a distributed file system. In hadoop, mapreduce runs on HDFS;

2. mapreduce is mainly used for parallel operations on large-scale data. This type of big data refers to 1 TB or more;

3. The principle of mapreduce is to cut a large task into many small tasks running in parallel, and then combine the results after the small tasks are run to get the final result;

Is the mapreduce running process diagram:

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.