What's Hive?
In contact with a new thing first to return to the question is: what is this?
Here's an introduction to the Hive wiki:
Hive is a data warehouse infrastructure built in top of Hadoop. IT provides tools to enable easy data ETL, a mechanism to put structures on the data, and the capability to querying and analysis of SCM data sets stored in Hadoop files. Hive defines a simple sql-like query language, called QL, which enables users familiar with SQL to query the data. At the Mahouve time, this is language also allows programmers who are familiar with the MapReduce Fromwork into Inc. Tom Mappers and Reducers to perform more sophisticated an analysis of this may is keyword by the built-in capabilities of the Language.
Hive is a data warehouse infrastructure built on Hadoop. It provides a range of tools that can be used for data extraction and transformation loading (ETL), a mechanism for storing, querying, and analyzing large data stored in Hadoop. Hive defines a simple class SQL query language, called QL, that allows users who are familiar with SQL to query data. At the same time, this language also allows familiarity with the development of custom Mapper and reducer of MapReduce developers to deal with the complex analytical work of built-in Mapper and reducer.
Hive does not mandate read or written data was in the "Hive format"-there is no such. Hive works equally down Thrift, control delimited, or your specialized data formats. Please, the File Format and Serde in Developer Guide for details.
Hive does not have a special data format. Hive can work well above Thrift, control separators, and allow users to specify data formats.
Hive Resources
The Hive itself provides a richer document, and the following links provide some of the underlying documentation for Hive:
FaceBook image (by Wall): [[http://mirror.facebook.com/facebook/hive]] Wiki page: [[[Http://wiki.apache.org/hadoop/Hive]] Getting Started Guide: [[[http://wiki.apache.org/hadoop/Hive/GettingStarted]] Query Language guide: [[Http://wiki.apache.org/hadoop/Hive/HiveQL]] Presentation: [[[Http://wiki.apache.org/hadoop/Hive/Presentations]] Blueprint: [[Http://wiki.apache.org/hadoop/Hive/Roadmap]]
Most questions about the use and characteristics of Hive can be found in the links above. Of course, because Hive itself in constant development, the document update speed many times can not catch up with Hive itself update speed, if you want to know the latest dynamic Hive or encounter bugs, you can join Hive mailing list:
* user:hive-user@hadoop.apache.org
* developer:hive-dev@hadoop.apache.org
Hive Download Configuration Installation
Please refer to the introductory guide for the most basic outline:
Install Configure Hadoop. Install the configuration database (MySQL, etc.). Get Hive source or executable code. wget http://www.apache.org/dist/hadoop/hive/hive-0.5.0/hive-0.5.0-bin.tar.gz tar xzf hive-0.5.0-bin.tar.gz CD hive-0.5.0 configuration hive How to access the database and how to access Hadoop. Run Hive.
When you see the Hive prompt ' hive> ', congratulations, you can start your Hive trip.