Hadoop Learning notes-which scenarios are not suitable for which scenarios

Source: Internet
Author: User

Hadoop is not a panacea, some scenarios apply, and some do not.

The purpose of Hadoop design consists of the following aspects, namely the so-called application scenario:

1: Extra Large file

Can be hundreds of m, hundreds of t this level of file.

2: Streaming data access

Hadoop works for a single write, multiple-read scenario, where data is copied and analyzed over a long period of time.

3: Commercial Hardware

That is to say the street everywhere can buy the kind of hardware, such hardware failure rate is high, so have a good fault-tolerant mechanism.

Next talk about the scenarios that are not applicable:

1: Low Latency data access

The purpose of Hadoop design is high throughput, so there are no optimizations for low latency data access, and if low latency is required, you can look at hbase.

2: A large number of small files

Because Namenode stores the file's metadata in memory, a large number of small files generate a lot of metadata. In this case, the number of millions other documents is still feasible, and then there are a lot of problems.

3: Multi-user write, Arbitrary modification

Hadoop does not now support multi-person write, arbitrarily modified features. This means that each write is added at the end of the file.

Hadoop Learning notes-which scenarios are not suitable for which scenarios

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.