HDFS Benefits:
1. High fault tolerance.
A) data automatically saves multiple copies
b) Automatic recovery after a copy is lost
2. Suitable for batch processing
A) mobile computing rather than moving data
b) Data location exposure to the computing framework
3. Suitable for large data processing
A) GB, TB, or even petabytes of data
b) Number of documents of more than million
c) 10K + node
4. Can be component on the cheap machine
A) Increased reliability through multiple replicas
b) provides fault tolerance and recovery mechanisms
HDFs Disadvantages:
1. Low Latency Data access
A) such as the millisecond level
b) Low latency and high throughput
2. Small file access
A) consumes Namenode of memory
b) Seek time exceeds read time
3. Concurrent write, File random modification
A) A file can only have one writer
b) only support append
HDFs processing of a large number of small files: compression of small files, that is, 10,000 small files in HDFs only as a file for storage, compression processing.
Copyright NOTICE: This article for Bo Master original article, without Bo Master permission not reproduced.
HDFs Pros and cons