Cloudera impala is an engine that runs distributed queries on HDFS and hbase.
This source is a snapshot of our internal development version. We regularly update the version.
This readme document describes how to use this source to build cloudera Impala. For more information, see:
Https://ccp.cloudera.com/display/IMPALA10BETADOC/Cloudera+Impala+1.0+Beta+Documentation
Build cloudera Impala on centos 6.2
Prerequisites:
Install necessary packages:
Sudo Yum install boost-test boost-Program-options libevent-devel automake libtool flex bison gcc-C ++ OpenSSL-devel \
Make cmake doxygen. x86_64 glib-devel boost-devel Python-devel bzip2-devel SVN libevent-devel cyrus-sasl-devel \
Wget git unzip
Install thrift 0.7.0
Note: We will support the updated thrift version in the future, but now we have only tested it for 0.7.0.
Wget http://archive.apache.org/dist/thrift/0.7.0/thrift-0.7.0.tar.gz
Tar xvzf thrift-0.7.0.tar.gz
D thrift-0.7.0
Chmod 755 configure
./Configure -- With-pic
Make
Sudo make install
CD contrib/fb303
Chmod 755./Bootstrap. Sh
./Bootstrap. Sh
Chmod 755 configure
./Configure
Make
Sudo make install
Install llvm
Wget http://llvm.org/releases/3.0/llvm-3.0.tar.gz
Tar xvzf llvm-3.0.tar.gz
CD llvm.3.0.src/tools
SVN Co http://llvm.org/svn/llvm-project/cfe/tags/RELEASE_30/final/ clang
CD ../Projects
SVN Co http://llvm.org/svn/llvm-project/compiler-rt/tags/RELEASE_30/final/ compiler-RT
CD ..
./Configure -- With-pic
Make
Sudo make install
Install JDK
Confirm that Oracle Java Development Kit 6 is installed, and the environment variable has been set.
Http://blog.csdn.net/larrylgq/article/details/7261187
Install Maven
Wget http://mirrors.tuna.tsinghua.edu.cn/apache/maven/maven-3/3.0.4/binaries/apache-maven-3.0.4-bin.tar.gz
Tar xvf apache-maven-3.0.4.tar.gz & sudo MV apache-maven-3.0.4/usr/local
Add environment variables to. bashrc
Export m2_home =/usr/local/apache-maven-3.0.4
Export m2 = $ m2_home/bin
Export Path = $ M2: $ path
Run:
Source ~ /. Bashrc
MVN-version
Print:
Apache Maven 3.0.4 (r1232337; 16:44:56 + 0800)
Maven home:/usr/local/apache-maven-3.0.4
Java version: 1.7.0, vendor: Oracle Corporation
Java home:/usr/Java/jdk1.7.0/JRE
Default locale: zh_cn, platform encoding: UTF-8
OS name: "Linux", version: "2.6.32-279.14.1.el6.x86 _ 64", arch: "amd64", family: "Unix"
Clone Impala source code
Git clone https://github.com/cloudera/impala.git
After cloning the file, you must change the file owner to the current Installer (root by default)
Set Impala Environment Variables
CD impala
. Bin/impala-config.sh
Confirm the environment variables:
Env | grep "Impala. * version"
Print:
Impala_cyrus_sasl_version = 2.1.23
Impala_hbase_version = 0.92.1-cdh4.1.0
Impala_snappy_version = 1.0.5
Impala_gtest_version = 1.6.0
Impala_gperftools_version = 2.0
Impala_gflags_version = 2.0
Impala_glog_version = 0.3.2
Impala_hadoop_version = 2.0.0-cdh4.1.0
Impala_hive_version = 0.9.0-cdh4.1.0
Impala_mongoose_version = 3.3
Impala_thrift_version = 0.7.0
Download required third-party packages
CD thirdparty
./Download_thirdparty.sh
Construct impala
CD $ {impala_home}
./Build_public.sh-build_thirdparty
Package
After successful build, there should be a binary package under $ {impala_home}/be/build/debug/service
You can run the following script to start an Impala background.
$ {Impala_home}/bin/ start-impalad.sh-use_statestore = false
Note: The start-impalad.sh sets the environment variables required for Impala to run successfully
Configure Impala to use HDFS, hbase, or hive MetaStore to store related configuration files in classpath through the bin/set-classpath.sh. We use Fe/src/test/resources internally for this purpose.
Impala Shell
Impala shell is a convenient command interface. You canProgramStart it:
$ {Impala_home}/bin/impala-shell.sh
By: https://github.com/cloudera/impala