Read about hive hbase integration performance, The latest news, videos, and discussion topics about hive hbase integration performance from alibabacloud.com
The purpose of configuring hive and HBase integration is to use the HQL syntax to implement additions and deletions of hbase database, the basic principle is to use the API interface between them to communicate with each other, the communication is mainly dependent on the Hive_hbase-handler.jar tool class. Note, howeve
Opening Remarks: Hive and HBase integration function is to use their own external API interface communication, mutual communication is mainly rely on hive_hbase-handler.jar tool class (HiveStorageHandlers: there is still some interest in hive_hbase-handler.jar this stuff, free to grind.
Opening Remarks: Hive and
Document directory
1. Hadoop and Hbase have been installed successfully.
2. Copy the hbase-0.90.4.jar and zookeeper-3.3.2.jar to hive/lib.
3. Modify the hive-site.xml file in hive/conf and add the following content at the bottom:
4. Copy the
Hive integration with hbase
ConfigurationSet hbase. jar package with the actual installation of hbase jar package replacement CD/opt/hive/lib/ls hbase-0.94.2 * Rm-RF
, b.aae140,b.aae149, b.baa044,b.baa035, b.baa036,b.aae034 from Jinan_si3u_ac01 a right JOIN jinan_si3u_ac06_temp b On a.aac001 =b.aac001;
Table name
Column Name
View Name
Jinan_si3u_ac01
AAC001AAC003AAA109
Fact_view
Jinan_si3u_ac06_temp
AAE140AAE149BAA044BAA035BAA036AAE034
Through select * from Fact_view; You can query the valid data.KYLIN use of the hive vie
First, Prepare the EnvironmentThe prerequisites have been installed. HBase , HiveDetection Hive/lib does the directory contain: hive-hbase-handler- version . Jar ( through this file and hbase communication )Second, Copy Files Enter the lib directory of
Hive Integration with HBaseCREATE Table Lectrure.hbase_lecture10 (sname string, score int) stored by ' Org.apache.hadoop.hive.hbase.HBaseStorageHandler ' whth serdeproperties ("hbase.columns.mapping" = ': Key,cf1:score ')Tblproperties ("hbase.table.name" = "hbase_lecture10");With serdeproperties: Specifies the attribute, which specifies the field mappings for the HBase
Hive and HBase integration, query exception, re-execute the Hive statistical command, there may be exceptions, because my MapReduce has been used before, and has already added pro to Hadoop
Hive and HBase
The integration of hive and hbase is to use their own external API interface to communicate with each other, communication is mainly dependent on hive-hbase-handler.jar tools; hive-hbase
Written in front one:
In this paper, hive and hbase are integrated so that hive can read the data in HBase, so that the two most commonly used frameworks in the Hadoop ecosystem are combined to complement each other.
Written in front two:
Use software description
To contract all software storage directory:
/ho
$ sudo/etc/init.d/hbase-master Start$ sudo/etc/init.d/hbase-thrift Start$ sudo/etc/init.d/hbase-rest Start
Only on the Hadoop-node.
The code is as follows
Copy Code
$ sudo/etc/init.d/hbase-regionserver Start
10. View the status of the serviceView http://hado
Hive Integrated HBase principle
Hive is a data Warehouse tool based on Hadoop that maps structured data files to a database table and provides complete SQL query functionality that translates SQL statements into MapReduce tasks. The advantage is that the learning cost is low, the simple mapreduce statistic can be realized quickly by SQL statement, and it is very
Hive Integrated HBase effectively leverages the storage attributes of the HBase database, such as row updates and column indexes. Keep in mind the consistency of the HBase jar packages during integration. the implementation of the integr
: hive is highly-delayed, structured, and analysis-oriented, while hbase is low-latency, unstructured, and programming-oriented. Hive Data Warehouses have high latency on hadoop.
Hbase is located at the structured storage layer, and hadoop HDFS provides hbase with high-reli
Label:Execute SQL statements using hive or Impala to manipulate data stored in HBaseHiveImpalaHBase HiveQL大数据
Execute SQL statements using hive or Impala to manipulate data stored in HBase
0. Abstract
First, the basic environment
Ii. data stored in HBase, using
. Merge. mapfiles = false
Set hive. merge. mapfiles = false; insert into Table hive_user_info partition (Dt = '$ {date}') Select udid, if (jailbreak = 0, 1), Concat (DT, '', hour, ':', time_minute), 0, device_id, '2', null from show_log where dt = '$ {date}' and udid! = 'Null' and udid! = "";
This problem found in hive-0.13.0 and integration testing for
latency of MapReduce.To achieve Impala and HBase integration, we can obtain the following benefits:
We can use familiar SQL statements. Like traditional relational databases, it is easy to provide SQL Design for complex queries and statistical analysis.
Impala query statistics and analysis is much faster than native MapReduce and Hive.
To integrate Impala wi
Warehouse is high latency on Hadoop.Where HBase is located in a structured storage tier, Hadoop HDFS provides high-reliability, low-level storage support for HBase, and Hadoop MapReduce provides high-performance computing power for HBase. Zookeeper provides a stable service and failover mechanism for hbase.In addition
ConferenceJDBC-based implementation? Works with your popular database vendorsAuto-generation of tedious user-side code? Write MapReduce applications to work with your data, fasterIntegration with Hive? Allows you to stay in a SQL-based environmentExtensible backend? Database-specific code paths for better performance
Detailed operation manual:Http://archive.cloudera.com/cdh/3/sqoop/SqoopUserGuide.html (of
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.