| Original book chapters |
Chapter title of the original book |
Translate article serial number |
Translating the article title |
Link |
| 4.1 |
Joining |
Hadoop (1) |
MapReduce Connection: Re-partitioned connection (repartition join) |
Http://www.cnblogs.com/datacloud/p/3578509.html |
| 4.1.1 |
Repartition Join |
Hadoop (1) |
MapReduce Connection: Re-partitioned connection (repartition join) |
Http://www.cnblogs.com/datacloud/p/3578509.html |
| 4.1.2 |
Replicated joins |
Hadoop (2) |
MapReduce Connection: Replication connection (Replication join) |
Http://www.cnblogs.com/datacloud/p/3579333.html |
| 4.1.3 |
Semi-joins |
Hadoop (3) |
MapReduce connection: Half connection (Semi-join) |
Http://www.cnblogs.com/datacloud/p/3579975.html |
| 4.1.4 |
Picking the best join strategy for your data |
Hadoop (4) |
MapReduce connections: Choosing the Best Connection Strategy |
Http://www.cnblogs.com/datacloud/p/3582113.html |
| 4.2 |
Sorting |
Hadoop (5) |
MapReduce Sort: Order (secondary sort) |
Http://www.cnblogs.com/datacloud/p/3584640.html |
| 4.2.1 |
Secondary sort |
Hadoop (5) |
MapReduce Sort: Order (secondary sort) |
Http://www.cnblogs.com/datacloud/p/3584640.html |
| 4.2.2 |
Total order Sorting |
Hadoop (6) |
MapReduce Sort: Total order (sorting) |
Http://www.cnblogs.com/datacloud/p/3586761.html |
| 4.3 |
Sampling |
Hadoop (7) |
MapReduce: Sampling (sampling) |
Http://www.cnblogs.com/datacloud/p/3588120.html |
| 6.1 |
Measuring MapReduce and your environment |
Hadoop (8) |
MapReduce Performance Tuning: Performance measurement (measuring) |
Http://www.cnblogs.com/datacloud/p/3589875.html |
| 6.2 |
Determining the cause of your performance woes |
Hadoop (9) |
MapReduce Performance Tuning: Understanding performance Bottlenecks and diagnosing map performance bottlenecks |
Http://www.cnblogs.com/datacloud/p/3591981.html |
| 6.2.1 |
Understanding what can impact MapReduce job performance |
Hadoop (9) |
MapReduce Performance Tuning: Understanding performance Bottlenecks and diagnosing map performance bottlenecks |
Http://www.cnblogs.com/datacloud/p/3591981.html |
| 6.2.2 |
MAP woes |
Hadoop (9) |
MapReduce Performance Tuning: Understanding performance Bottlenecks and diagnosing map performance bottlenecks |
Http://www.cnblogs.com/datacloud/p/3591981.html |
| 6.2.3 |
Reducer woes |
Hadoop (10) |
MapReduce Performance Tuning: Diagnosing reduce performance bottlenecks |
Http://www.cnblogs.com/datacloud/p/3595682.html |
| 6.2.4 |
General task woes |
Hadoop (11) |
MapReduce Performance Tuning: Diagnosing a general bottleneck |
Http://www.cnblogs.com/datacloud/p/3596294.html |
| 6.2.5 |
Hardware woes |
Hadoop (12) |
MapReduce Performance Tuning: Diagnosing hardware performance bottlenecks |
Http://www.cnblogs.com/datacloud/p/3597909.html |
| 6.4.3 |
Optimizing the shuffle and sort phase |
Hadoop (13) |
MapReduce Performance Tuning: Optimizing Shuffle (shuffle) and sequencing stages |
Http://www.cnblogs.com/datacloud/p/3599920.html |
| 6.4.4 |
Skew Mitigation |
Hadoop (14) |
MapReduce Performance Tuning: Reducing the performance penalty for data skew |
Http://www.cnblogs.com/datacloud/p/3601624.html |
| 6.4.5 |
Optimizing User space Java in MapReduce |
Hadoop (15) |
MapReduce Performance Tuning: Optimizing user Java code for MapReduce |
Http://www.cnblogs.com/datacloud/p/3603191.html |
| 6.4.6 |
Data serialization |
Hadoop (16) |
MapReduce Performance Tuning: Optimizing Data serialization |
Http://www.cnblogs.com/datacloud/p/3608591.html |
| 6.5 |
Chapter Summary |
Hadoop (16) |
MapReduce Performance Tuning: Optimizing Data serialization |
Http://www.cnblogs.com/datacloud/p/3608591.html |
| 5.1 |
Working with small files |
Hadoop (17) |
MapReduce file processing: Small files |
Http://www.cnblogs.com/datacloud/p/3611459.html |
| 5.2 |
Efficient storage with compression (tech 25, 26) |
Hadoop (19) |
MapReduce file Processing: compression-based efficient storage (i) |
Http://www.cnblogs.com/datacloud/p/3612817.html |
| 5.2 |
Efficient storage with compression (Tech 27) |
Hadoop (19) |
MapReduce file Processing: compression-based efficient storage (i) |
Http://www.cnblogs.com/datacloud/p/3616544.html |
| Appendix A.10 |
LZOP |
Hadoop (20) |
Appendix A.10 compression format LZOP compiling installation configuration |
Http://www.cnblogs.com/datacloud/p/3617586.html |
| Appendix D.1 |
An optimized repartition join framework |
Hadoop (21) |
Appendix D.1 optimized re-partitioning framework |
Http://www.cnblogs.com/datacloud/p/3617079.html |
| Appendix D.2 |
A replicated Join Framework |
Hadoop (22) |
Appendix D.2 Replication Connection Framework |
Http://www.cnblogs.com/datacloud/p/3617078.html |