Name	Command-expression
Create a table	Create ' table name ', ' column name 1 ', ' Column Name 2 ', ' Column name n '
Add a record	Put ' table name ', ' Row name ', ' Column name: ', ' value '
View Records	Get ' table name ', ' Row name '
View the total number of records in a table	Count ' table name '
Deleting records	Delete ' table name ', ' Row name ', ' column name '
Delete a table	To block the table before it can be deleted, the first step disable ' table name ' Second step drop ' table name '
View all records	Scan "Table name"
View all data in a column of a table	Scan "table name", [' Column name: ']
Update record	is to rewrite it again to overwrite

If you are a novice team some of the commands of HBase are not very familiar, you can enter HBase Shell mode you can input the help command to see the commands you can execute and instructions to the command, for example, scan this command, in Help not only mention the command, It also explains in detail the parameters and functions that can be used in the scan command, such as how to query by column name and how to use the limit, StartRow:

Scan   Scan a table, pass table name and optionally a dictionary of scanner specifications.  
 Scanner specifications may include one or more of the  Following:limit, StartRow, Stoprow, TIMESTAMP, or COLUMNS.
 If no columns is specified, all columns'll be scanned.  To scan all members of a column family, leave the 
 qualifier empty as in  ' col_family: '.  Examples:
   hbase> scan '. META. '
   Hbase> Scan '. META. ', {COLUMNS = ' info:regioninfo '}
   hbase> scan ' t1 ', {COLUMNS = [' C1 ', ' C2 '], LIMIT = ten, StartRow = ' xyz '}

using the Java API to manipulate hbase servers

The following jar packages are required

     Hbase-0.20.6.jar Hadoop-core-0.20.1.jar Commons-logging-1.1.1.jar Zookeeper-3.3.0.jar log4j-1
 .2.91.jar import Org.apache.hadoop.conf.Configuration;
 Import org.apache.hadoop.hbase.HBaseConfiguration;
 Import Org.apache.hadoop.hbase.HColumnDescriptor;
 Import Org.apache.hadoop.hbase.HTableDescriptor;
 Import Org.apache.hadoop.hbase.KeyValue;
 Import Org.apache.hadoop.hbase.client.HBaseAdmin;
 Import org.apache.hadoop.hbase.client.HTable;
 Import Org.apache.hadoop.hbase.client.Result;
 Import Org.apache.hadoop.hbase.client.ResultScanner;
 Import Org.apache.hadoop.hbase.client.Scan;
 
Import Org.apache.hadoop.hbase.io.BatchUpdate;
     @SuppressWarnings ("deprecation") public class Hbasetestcase {static hbaseconfiguration cfg = null;
         static {Configuration Hbase_config = new configuration ();
         Hbase_config.set ("Hbase.zookeeper.quorum", "192.168.50.216");
   Hbase_config.set ("Hbase.zookeeper.property.clientPort", "2181");      CFG = new Hbaseconfiguration (hbase_config); }/** * Create a table */public static void Creattable (String tablename) throws Exception {HBas
         Eadmin admin = new hbaseadmin (CFG);
         if (admin.tableexists (tablename)) {System.out.println ("table Exists!!!");
             } else{Htabledescriptor Tabledesc = new Htabledescriptor (tablename);
             Tabledesc.addfamily (New Hcolumndescriptor ("Name:"));
             Admin.createtable (TABLEDESC);
         SYSTEM.OUT.PRINTLN ("CREATE table OK."); }}/** * Add a data */public static void AddData (String tablename) throws Exceptio
              n{htable table = new htable (CFG, tablename);  
             BatchUpdate update = new BatchUpdate ("Huangyi");  
             Update.put ("Name:java", "http://www.javabloger.com". GetBytes ());  
         Table.commit (update);
     SYSTEM.OUT.PRINTLN ("Add data ok.");
  }   
    /** * Show All data */public static void Getalldata (String tablename) throws exception{Htabl
          e table = new htable (CFG, tablename);
          Scan s = new scan ();
          Resultscanner SS = Table.getscanner (s);
                 for (Result r:ss) {KeyValue Kv:r.raw ()) {System.out.print (New String (Kv.getcolumn ()));
              System.out.println (New String (Kv.getvalue ())); }}} public static void Main (String [] agrs) {try {string ta
                 Blename= "TableName";
                 Hbasetestcase.creattable (tablename);
                 Hbasetestcase.adddata (tablename);
             Hbasetestcase.getalldata (tablename);
         } catch (Exception e) {e.printstacktrace ();  }
         
    }
     
}

3-hbase Optimization Tips

This article from a few aspects of the simple talk about hbase some of the optimization techniques, only as part of my study notes, because learning more afraid of forgetting, left to see for themselves.

1 modifying Linux system Parameters

The maximum number of open files in Linux system the default parameter value is 1024, if you do not make changes to the concurrency will appear when the "Too many Open files" error, resulting in the entire hbase is not operational, you can use the Ulimit-n command to modify, or modify/ etc/security/limits.conf and/proc/sys/fs/file-max parameters, specifically how to modify can go to google keyword "Linux limits.conf"

2 JVM Configuration

Modify the configuration parameters in the hbase-env.sh file to configure the appropriate parameters based on your machine hardware and the JVM (32/64 bits) of the current operating system

Hbase_heapsize 4000 HBase The size of the JVM heap used

Hbase_opts "‐SERVER‐XX:+USECONCMARKSWEEPGC" JVM GC option

Hbase_manages_zkfalse whether to use zookeeper for distributed management

3 HBase Persistence

After restarting the operating system the data in HBase is completely absent, you can create a table, write a piece of data without making any changes, then restart the machine, reboot and then you enter HBase's shell using the list command to see the current table, none of them. It's not quite a cup. It doesn't matter. You can set the Hbase.rootdir value in Hbase/conf/hbase-default.xml to specify a folder where the file is saved, for example: <value>file:///you/ Hbase-data/path</value>, the tables and data in HBase that you created are written directly to your disk, as shown in the figure:

You can also specify the path to your distributed file system HDFs such as: Hdfs://namenode_server:port/hbase_rootdir, which is written on your distributed file system.

4 Configuring HBase Run Parameters

Next you need to configure the Hbase/conf/hbase-default.xml file, the following are the more important configuration parameters I think

Hbase.client.write.buffer

Description: This parameter can be set to the size of the write data buffer, when the client and server to transmit data, the server in order to improve the performance of the system to open a write buffer to handle it, this parameter setting if the large set, will have a certain system memory requirements, directly affect the performance of the system.

Hbase.master.meta.thread.rescanfrequency

Description: How long hmaster the system table root and Meta scan once, this parameter can be set a bit longer, reduce the system energy consumption.

Hbase.regionserver.handler.count

Description: Because the Hbase/hadoop server is designed with multiplexed, non-blocking I/O mode, it can be processed through a thread, but because the method that the client side calls is blocking I/O, it is designed to place the object passed by the client first in the queue, and a heap of handler (Thread) is generated when the server is started, and the handler is polling to get the object and execute the corresponding method, which defaults to 25 , you can set a larger number according to the actual scenario.

Hbase.regionserver.thread.splitcompactcheckfrequency

Description: This parameter is the time interval for how often to regionserver a server to run a split/compaction, although a compact operation will be performed before split. The compact operation may be minor The compact may also be major compact.compact after Will take midkey from all the storefile files in all the stores. This midkey may not be in the full data mid. The following data for a row-key may cross different hregion.

Hbase.hregion.max.filesize

Description: Hstorefile maximum value in Hregion, the column family in any table will be sliced once it exceeds this size, and the default size of Hstroefile is 256M.

Hfile.block.cache.size

Description: Specifies the percentage that the Hfile/storefile cache allocates in the JVM heap, the default value is 0.2, which means 20%, and if you set it to 0, the option is masked.

Hbase.zookeeper.property.maxClientCnxns

Description: The option for this configuration is from zookeeper, which indicates the number of concurrent connections that the Zookeeper client accesses concurrently, and zookeeper is an entry for hbase. The value of this parameter can be enlarged appropriately.

Hbase.regionserver.global.memstore.upperLimit

Description: The size parameter configuration for all memstores in the region server, the default value is 0.4, which means 40%, and if set to 0, is the option to mask.

Hbase.hregion.memstore.flush.size

Description: The cached content in Memstore will be written to disk after it has exceeded the configured range, for example: The delete operation is written in Memstore, indicating that value, column or family are to be deleted, and hbase periodically makes a major of the stored file Compaction, at that time HBase will brush memstore into a new hfile storage file. If the major compaction is not done within a certain time frame, and the range of Memstore is written to disk.

5 log4j logs in HBase

The log output level in HBase is opened by default for debug, info-level logs, and the log level can be adjusted to suit your needs, and HBase's log4j log configuration file is in the Hbase\conf\log4j.properties directory.

4– Storage

A table created in HBase can be distributed across multiple hregion, saying that a table can be split into chunks, each of which calls us a hregion. Each hregion will save a table inside a contiguous data, the user created the large table of each hregion block is provided by the Hregion server maintenance, Access Hregion block is to go through the hregion server, A hregion block corresponds to a hregion server, and a complete table can be saved on multiple hregion. The correspondence between Hregion Server and region is a one-to-many relationship. Each hregion is physically divided into three parts: Hmemcache (Cache), Hlog (log), Hstore (persistence layer).
These relationships look like this in my mind, as shown in the figure:

The relationship between 1.HRegionServer, Hregion, Hmemcache, Hlog, Hstore, as shown in the figure:

The distribution of the data in the 2.HBase table with the Hregionserver, as shown in the figure:

HBase Read Data

HBase reads the data first and reads the contents of the Hmemcache, and if it does not fetch the data in the Hstore, it improves the performance of the data read.

HBase writes data

HBase writes the data to Hmemcache and Hlog, hmemcache the cache, hlog the transaction log for synchronization Hmemcache and Hstore, and when the flush cache is launched, the data is persisted to Hstore, and empty the Hmemecache.

When the client accesses the data through Hmaster, each hregion server maintains a long connection to the Hmaster server, and Hmaster is the manager of the HBase distributed system, his main task is to tell each hregion The server it wants to maintain which hregion. These data of the user can be saved on the Hadoop Distributed file system. If the primary server Hmaster freezes, the entire system will be invalid. Below I will consider how to solve the problem of hmaster SPFO, this problem is a bit similar to the SPFO problem of Hadoop, just a namenode maintenance of the global Datanode,hdfs once the crash all hung up, some people say that the use of heartbeat to solve the problem , but I always want to find out other solutions, more time, there is always a way.

Yesterday in the environment of hadoop-0.21.0, hbase-0.20.6 for a long time, has been an error message, the following:

 Exception in thread ' main ' Java.io.IOException:Call to localhost/serv6:9000 failed On local exception:java.io.EOFException 10/11/10 15:34:34 ERROR master. Hmaster:can Not start master java.lang.reflect.InvocationTargetException at sun.reflect.NativeConstructorAccess Orimpl.newinstance0 (Native Method) at Sun.reflect.NativeConstructorAccessorImpl.newInstance (nativeconstructoracce ssorimpl.java:39) at Sun.reflect.DelegatingConstructorAccessorImpl.newInstance (Delegatingconstructoraccessorimpl . java:27) at Java.lang.reflect.Constructor.newInstance (constructor.java:513) at Org.apache.hadoop.hbase. Master. 
  Hmaster.domain (hmaster.java:1233) at Org.apache.hadoop.hbase.master.HMaster.main (hmaster.java:1274)

Dead or Alive connection is not on HDFs, also can't connect hmaster, depressed ah.

I think about it, think slowly, my eyes bright java.io.EOFException This exception, is it possible that the RPC protocol format inconsistency caused. This means that the server side and the client version are inconsistent. For an HDFS server side, everything is good, sure enough is the version of the problem, and finally use hadoop-0.20.2 collocation hbase-0.20.6 more stable.

The final effect is as shown in the figure:

Some textual descriptions of the above figure: the Hadoop version is 0.20.2, the hbase version is 0.20.6, a table tab1 is created in HBase, exiting the HBase shell environment, viewed with the Hadoop command, Files in the file system that's a lot more. A newly created TAB1 directory, the above picture illustrates HBase running in the Distributed File System Apache HDFs.

5 (cluster)-pressure load and fail forward

In the previous article on HBase, which described the architecture of hbase in a distributed environment, this article will explain how HBase eliminates single point of failure in distributed environments (SPFO), doing a small experiment about HBase's high availability in a distributed environment, seeing some phenomena in its own eyes, Extend some of the topics of thought.

Let's recap. HBase main components: Hbasemaster hregionserver hbase Client hbase Thrift server hbase REST Server

Hbasemaster

Hmaster is responsible for assigning areas to Hregionserver, and is responsible for load balancing hreginserver in the cluster environment, Hmaster is also responsible for monitoring the hreginserver in the cluster environment, If a hreginserver down, Hbasemaster will not use the Hreginserver to provide services Hlo

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

discover introductory rate chapter customizer chapter 7 chapter test b geometry walden chapter 1 all chapter books precalculus chapter 3 chapter 10 nervous system

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Base Introductory Chapter

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support