Hive entry 3-Integration of Hive and HBase

Source: Internet
Author: User
Opening Remarks: Hive and HBase integration function is to use their own external API interface communication, mutual communication is mainly rely on hive_hbase-handler.jar tool class (HiveStorageHandlers: there is still some interest in hive_hbase-handler.jar this stuff, free to grind.

Opening Remarks: Hive and HBase integration function is to use their own external API interface communication, mutual communication is mainly rely on the hive_hbase-handler.jar tool class (Hive Storage Handlers: there is still some interest in hive_hbase-handler.jar this stuff, free to grind.

Opening remarks:
Hive and HBase integration function implementation is to use the two itself external API interface communication, mutual communication is mainly rely on the hive_hbase-handler.jar tool class (Hive Storage Handlers:

Saliva:
There is still some interest in hive_hbase-handler.jar this stuff, free to grind.

I. two considerations:
1. the required software includes Hadoop, Hive, Hbase, and Zookeeper. Integration of Hive and HBase requires the Hive version, so do not download it. earlier than 0.6.0 old version, Hive.0.6.0 version only support with HBase docking, so in the Hive lib directory can see more hive_hbase-handler.jar jar package, he is Hive extended storage Handler, we recommend that you use version 0.20.6 for HBase. This time I didn't start the HDFS cluster environment. All the testing environments are on one machine.

2. When running Hive, the following error may occur, indicating that your JVM is not allocated enough space,Error MessageAs follows:
Invalid maximum heap size:-Xmx4096m
The specified size exceeds the maximum representable size.
Cocould not create the Java virtual machine.

Solution:
/Work/hive/bin/ext # vim util/execHiveCmd. sh file contains row 33rd
Modify,
Hadoops _heapsize = 4096
Is
Hadoops _heapsize = 256

Add export $ HIVE_HOME =/work/hive to/etc/profile /.

Ii. Start the runtime environment
1. Start Hive
Hive-auxpath/work/hive/lib/hive_hbase-handler.jar,/work/hive/lib/hbase-0.20.3.jar,/work/hive/lib/zookeeper-3.2.2.jar-hiveconf hbase. master = 127.0.0.1: 60000
Load the tools required by Hive and point to the master server address of HBase. My HBase master server and Hive run on the same machine, so I point to the local machine.

2. Start HBase
/Work/hbase/bin/hbase master start

3. Start Zookeeper
/Work/zookeeper/bin/zkServer. sh start

Iii. Execution
Create a table in Hive and associate it with each other
Create table hbase_table_1 (key int, value string) stored by 'org. apache. hadoop. hive. hbase. hbasestoragehandler' with serdeproperties ("hbase. columns. mapping "=": key, cf1: val ") TBLPROPERTIES (" hbase. table. name "=" xyz ");

Run a table creation statement in Hive and import the data
Create a table
Create table pokes (foo INT, bar STRING );
Data Import
Load data local inpath'/work/hive/examples/files/kv1.txt 'Overwrite into table pokes;

Insert a data entry into the table associated with Hive and HBase.
Insert overwrite table hbase_table_1 SELECT * FROM pokes WHERE foo = 98;
After running successfully ,:

Data is inserted using the MapReduce policy algorithm and written to HBase at the same time ,:

Run the scan 'xyz' and describe "xyz" commands in HBase shell to view the table structure and run the following results:

Xyz is a table created in Hbase through Hive. The ing attribute is specified in the Hive table creation statement."Hbase. columns. mapping" = ": key, cf1: val"And create a table name in HBase"Hbase. table. name" = "xyz"

Insert a record in hbase when running the put command
Put 'xyz', '123', 'cf1: val', 'www .javabloger.com'

Run the query statement on hive to check whether the data inserted in hbase has been synchronized,
Select * from hbase_table_1 WHERE key = 10001;
:

Final Effect
The above integration process and operation steps have been completed. NowAdd records to HBase in Hive. You also add records to Hive in HBase.Indicates that Hive and HBase are successfully integrated. Can we write massive amounts of data in HBase and query the message in Hive? HBase does not support complex queries, but HBase can obtain one or more rows of data based on keys, scan data intervals, and filter operations. Complex queries can be completed by Hive, one as the storage entry (HBase) and the other as the query entry (Hive ). As shown in.


Haha, I laughed. The above is just my point of view.

In this case, I will continue to update later. Thank you for your reading.

Related Articles:
Apache Hive entry 2
Apache Hive Entry 1

HBase entry 4
HBase entry 3
HBase entry 2
HBase Basics

-End-

Original article address: Hive entry 3-Integration of Hive and HBase. Thank you for sharing it with me.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.