(EXT) HBase Official documents _ Chinese version

Source: Internet
Author: User
Tags garbage collection zookeeper


Source: http://www.yankay.com/wp-content/hbase/book.html

HBase Official documents

copyright©2010 Apache Software Foundation, Shanda Games-Data Warehouse team-Yenkai (translated)

Revision History
Revision 0.90.4
Configuration, getting started with data model

Abstract

This is the official document for Apache HBase, HBase is a distributed, versioned (versioned), built in a column database on Apache Hadoop and Apache zookeeper.

I am (translator) familiar with HBase source code, engaged in the development of hbase work, if there is no clear, welcome to discuss. Mailbox yankaycom@gmail.com

Table of Contents Preface 1. Getting Started 1.1. Introduction to 1.2. Start 1.2.1 quickly. Download unpack the latest version of 1.2.2. Start HBase 1.2.3. Shell Practice 1.2.4. Stop HBase 1.2.5. What to do next 1.3. Slow start (relatively fast start) 1.3.1. The required software 1.3.2. HBase operating mode: stand-alone and distributed 1.3.3. Configuration Example 2. Upgrade 2.1. Upgraded from HBase 0.20.x or 0.89.x to HBase 0.90.x 3. Configure 3.1. hbase-site.xml  and  hbase-default.xml 3.1.1. HBase default configuration 3.2. hbase-env.sh 3.3. log4j.properties 3.4. Important configuration 3.5. The required configuration 3.6. The recommended configuration 3.6.1. zookeeper.session.timeout 3.6.2. hbase.regionserver.handler.count 3.6.3. The configuration of the large memory machine 3.6.4. Lzo compression 3.6.5. The larger regions 3.6.6. Management splitting 3.7. Connects the client configuration and dependency 3.7.1 of the HBase cluster. Java Client Configuration 4. The HBase Shell 4.1. Use Script 4.2. Shell tips 4.2.1. IRBRC 4.2.2. LOG Time Conversion 4.2.3. Debug 5. Build HBase 5.1. Add a HBase release to the Apache ' Maven Repository 6. Developers 6.1. IDEs 6.1.1. Eclipse 6.2. Unit Test 6.2.1. Mocito 7. HBase and MapReduce 7.1. The default HBase MapReduce separator (Splitter) 7.2. HBase Input MapReduce Example 7.3. Access the other HBase Tables 7.4 in a mapreduce job. Forecast execution 8. Schema Design for HBase 8.1. Schema to create 8.2. Number of Column families 8.3. monotonically increasing row keys/time series data (log) 8.4. Minimize the size of row and column by 8.5. Version of the time 9. Metrics 9.1. Metric installation 9.2. Regionserver Metrics 9.2.1. hbase.regionserver.blockcachecount 9.2.2. hbase.regionserver.blockcachefree 9.2.3. hbase.regionserver.blockcachehitratio 9.2.4. hbase.regionserver.blockcachesize 9.2.5.  Hbase.regionserver.compactionQueueSize 9.2.6. hbase.regionserver.fsreadlatency_avg_time 9.2.7.  Hbase.regionserver.fsReadLatency_num_ops 9.2.8. hbase.regionserver.fssynclatency_avg_time 9.2.9.  Hbase.regionserver.fsSyncLatency_num_ops 9.2.10. hbase.regionserver.fswritelatency_avg_time 9.2.11.  Hbase.regionserver.fsWriteLatency_num_ops 9.2.12. hbase.regionserver.memstoresizemb 9.2.13.  Hbase.regionserver.regions 9.2.14. hbase.regionserver.requests 9.2.15.  Hbase.regionserver.storeFileIndexSizeMB 9.2.16. hbase.regionserver.stores 9.2.17.  Hbase.regionserver.storeFiles 10. Replicate 11 across clusters. Data Model 11.1. Concept View 11.2. Physical View 11.3. Table 11.4. Line 11.5. Column Family 11.6. Cells 11.7. Version 11.7.1. HBase operation (contains version operation) 11.7.2. Existing limit of 12. Schema 12.1. Client 12.1.1. Connect 12.1.2. Write buffering and bulk Operation 12.1.3. Filters 12.2. Daemons 12.2.1. Master 12.2.2. Regionserver 12.3. Regions 12.3.1. Region size 12.3.2. Region splits 12.3.3. Region load balancing 12.3.4. Store 12.4. Write ahead Log (WAL) 12.4.1. Objective 12.4.2. WAL Flushing 12.4.3. WAL splitting 13. Performance Tuning 13.1. Java 13.1.1. Garbage collection and HBase 13.2. Configure 13.2.1. The number of regions is 13.2.2. Manage compressed 13.2.3. Compressed 13.2.4. hbase.regionserver.handler.count 13.2.5. hfile.block.cache.size 13.2.6.  Hbase.regionserver.global.memstore.upperLimit 13.2.7. hbase.regionserver.global.memstore.lowerlimit 13.2.8.  hbase.hstore.blockingstorefiles 13.2.9. hbase.hregion.memstore.block.multiplier 13.3. Number of Column families 13.4. Data gathers 13.5. Batch loading 13.5.1. Table creation: Pre-created regions 13.6. HBase client 13.6.1. AutoFlush 13.6.2. Scan Caching 13.6.3. Scan property to select 13.6.4. Close Resultscanners 13.6.5. Block Cache 13.6.6. Load optimization for Row keys 14. Bloom Filters 14.1. Configure 14.1.1. hcolumndescriptor  configuration 14.1.2. io.hfile.bloom.enabled  Global shutdown switch 14.1.3. io.hfile.bloom.error.rate 14.1.4. io.hfile.bloom.max.fold 14.2. The BLOOMFILTER, FILEINFO data structure 14.2.2 in Bloom storefile footprint 14.2.1. storefile. Bloomfilter entries 15 in the  StoreFile  meta data. HBase Troubleshooting and Debug 15.1. General guideline 15.2. Logs 15.2.1. Log position 15.3. Tool 15.3.1. Search-hadoop.com 15.3.2. Tail 15.3.3. Top 15.3.4. JPS 15.3.5. Jstack 15.3.6. Opentsdb 15.3.7. Clusterssh+top 15.4. Client 15.4.1. Scannertimeoutexception 15.5. Regionserver 15.5.1. Startup error

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.