(EXT) HBase Official documents

(EXT) HBase Official documents _ Chinese version

Last Update:2018-07-21 Source: Internet

Author: User

Tags garbage collection zookeeper

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Source: http://www.yankay.com/wp-content/hbase/book.html

HBase Official documents

Revision History
Revision 0.90.4
Configuration, getting started with data model

Abstract

This is the official document for Apache HBase, HBase is a distributed, versioned (versioned), built in a column database on Apache Hadoop and Apache zookeeper.

I am (translator) familiar with HBase source code, engaged in the development of hbase work, if there is no clear, welcome to discuss. Mailbox yankaycom@gmail.com

Table of Contents Preface 1. Getting Started 1.1. Introduction to 1.2. Start 1.2.1 quickly. Download unpack the latest version of 1.2.2. Start HBase 1.2.3. Shell Practice 1.2.4. Stop HBase 1.2.5. What to do next 1.3. Slow start (relatively fast start) 1.3.1. The required software 1.3.2. HBase operating mode: stand-alone and distributed 1.3.3. Configuration Example 2. Upgrade 2.1. Upgraded from HBase 0.20.x or 0.89.x to HBase 0.90.x 3. Configure 3.1. hbase-site.xml and hbase-default.xml 3.1.1. HBase default configuration 3.2. hbase-env.sh 3.3. log4j.properties 3.4. Important configuration 3.5. The required configuration 3.6. The recommended configuration 3.6.1. zookeeper.session.timeout 3.6.2. hbase.regionserver.handler.count 3.6.3. The configuration of the large memory machine 3.6.4. Lzo compression 3.6.5. The larger regions 3.6.6. Management splitting 3.7. Connects the client configuration and dependency 3.7.1 of the HBase cluster. Java Client Configuration 4. The HBase Shell 4.1. Use Script 4.2. Shell tips 4.2.1.&NBSP;IRBRC 4.2.2. LOG Time Conversion 4.2.3. Debug 5. Build HBase 5.1. Add a HBase release to the Apache ' Maven Repository 6. Developers 6.1. IDEs 6.1.1. Eclipse 6.2. Unit Test 6.2.1. Mocito 7. HBase and MapReduce 7.1. The default HBase MapReduce separator (Splitter) 7.2. HBase Input MapReduce Example 7.3. Access the other HBase Tables 7.4 in a mapreduce job. Forecast execution 8. Schema Design for HBase 8.1. Schema to create 8.2. Number of Column families 8.3. monotonically increasing row keys/time series data (log) 8.4. Minimize the size of row and column by 8.5. Version of the time 9. Metrics 9.1. Metric installation 9.2. Regionserver Metrics 9.2.1. hbase.regionserver.blockcachecount 9.2.2. hbase.regionserver.blockcachefree 9.2.3. hbase.regionserver.blockcachehitratio 9.2.4. hbase.regionserver.blockcachesize 9.2.5. Hbase.regionserver.compactionQueueSize 9.2.6. hbase.regionserver.fsreadlatency_avg_time 9.2.7. Hbase.regionserver.fsReadLatency_num_ops 9.2.8. hbase.regionserver.fssynclatency_avg_time 9.2.9. Hbase.regionserver.fsSyncLatency_num_ops 9.2.10. hbase.regionserver.fswritelatency_avg_time 9.2.11. Hbase.regionserver.fsWriteLatency_num_ops 9.2.12. hbase.regionserver.memstoresizemb 9.2.13. Hbase.regionserver.regions 9.2.14. hbase.regionserver.requests 9.2.15. Hbase.regionserver.storeFileIndexSizeMB 9.2.16. hbase.regionserver.stores 9.2.17. Hbase.regionserver.storeFiles 10. Replicate 11 across clusters. Data Model 11.1. Concept View 11.2. Physical View 11.3. Table 11.4. Line 11.5. Column Family 11.6. Cells 11.7. Version 11.7.1. HBase operation (contains version operation) 11.7.2. Existing limit of 12. Schema 12.1. Client 12.1.1. Connect 12.1.2. Write buffering and bulk Operation 12.1.3. Filters 12.2. Daemons 12.2.1. Master 12.2.2. Regionserver 12.3. Regions 12.3.1. Region size 12.3.2. Region splits 12.3.3. Region load balancing 12.3.4. Store 12.4. Write ahead Log (WAL) 12.4.1. Objective 12.4.2. WAL Flushing 12.4.3. WAL splitting 13. Performance Tuning 13.1. Java 13.1.1. Garbage collection and HBase 13.2. Configure 13.2.1. The number of regions is 13.2.2. Manage compressed 13.2.3. Compressed 13.2.4. hbase.regionserver.handler.count 13.2.5. hfile.block.cache.size 13.2.6. Hbase.regionserver.global.memstore.upperLimit 13.2.7. hbase.regionserver.global.memstore.lowerlimit 13.2.8. hbase.hstore.blockingstorefiles 13.2.9. hbase.hregion.memstore.block.multiplier 13.3. Number of Column families 13.4. Data gathers 13.5. Batch loading 13.5.1. Table creation: Pre-created regions 13.6. HBase client 13.6.1. AutoFlush 13.6.2. Scan Caching 13.6.3. Scan property to select 13.6.4. Close Resultscanners 13.6.5. Block Cache 13.6.6. Load optimization for Row keys 14. Bloom Filters 14.1. Configure 14.1.1. hcolumndescriptor configuration 14.1.2. io.hfile.bloom.enabled Global shutdown switch 14.1.3. io.hfile.bloom.error.rate 14.1.4. io.hfile.bloom.max.fold 14.2. The BLOOMFILTER,&NBSP;FILEINFO data structure 14.2.2 in Bloom storefile footprint 14.2.1. storefile. Bloomfilter entries 15 in the StoreFile meta data. HBase Troubleshooting and Debug 15.1. General guideline 15.2. Logs 15.2.1. Log position 15.3. Tool 15.3.1. Search-hadoop.com 15.3.2. Tail 15.3.3. Top 15.3.4. JPS 15.3.5. Jstack 15.3.6. Opentsdb 15.3.7. Clusterssh+top 15.4. Client 15.4.1. Scannertimeoutexception 15.5. Regionserver 15.5.1. Startup error

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More