Original connection http://xiguada.org/carbondata_compile/
What's carbondata? Carbondata is a fully indexed columnar and Hadoop native data-store for processing heavy analytical workloads and detailed Queries on big data. In customer benchmarks, carbondata have proven to manage petabyte of data running on extraordinarily low-cost hardware and Answers queries around times faster than the current open source solutions (column-oriented SQL on Hadoop data-stores). Compile and install I want to try it quickly, but the official website actually did not have a ready to compile the project, no way, can only compile a. Installation takes three steps (of course, jdk7 or jdk8,,maven more than 3.3)-Download the Spark 1.5.0 or the updated version. -Download and install Apache Thrift 0.9.3, and confirm the addition to the system path. -Download apache carbondata code and compile. 1 Spark can be downloaded directly, after decompression set path executable Spark-submit. 2 Install thrift before you need to install dependencies on my virtual machine ah Ubuntu under install dependent commands are as follows. Sudo apt-get install libboost-dev libboost-test-dev libboost-program-options-dev libevent-dev automake libtool flex bison pkg-config g++ Libssl-dev then compile the installation under Thrift./configuresudo Makesudo make install 3 compile carbondatamvn-dskiptests-pspark-1.6-dspark.version=1.6.2 clean package 4 into the bin directory, modify the/bin/spark-submit in the Carbon-spark-sql file, and change to Spark-submit 5 Generate sample.csv Files CD carbondatacat > Sample.csv << eofid,name,city,age1,david,shenzhen,312,eason,shenzhen,273, JARRY,WUHAN,35EOF&NBSP;6 executes ./carbon-spark-sqlspark-sql> CREATE table if not exists test_table (ID string, name string, City string, age Int) STORED by ' carbondata ' spark-sql> load Data inpath '. /sample.csv ' into table test_tablespark-sql> select City, AVG (age), sum (age) from test_table Group by city execution result she Nzhen 29.0 58wuhan 35.0 35 look and execute SP Arksql like, carbondata this middle do what, have what effect? Continue analysis later.
Carbondata Compilation and installation