"Ha"

Source: Internet
Author: User

Hadoop provides a stable shared storage and analysis system, which is implemented by HDFs and analyzed by MapReduce

MapReduce is a distributed data processing mode and execution environment

HDFs is a distributed file system

Why use MapReduce instead of databases + more disks?

1, disk drive trends: The increase in addressing time is much slower than the speed at which the transfer rate is increased, addressing is limited by the delay of the disk operation, and the transfer rate corresponds to the bandwidth of the disk, if the access mode of the data is limited to addressing, it can lead to spend a lot of time to read and write data

2, in the update of a small number of database records, the traditional B-tree through the sorting/merging to rebuild the database is very good, but the most of the database data update, the efficiency is no mapreduce high

3. MapReduce is suitable for dealing with problems that need to analyze the entire data set, and is analyzed in batch mode for applications that are written and read multiple times.

"Ha"

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.