Apache Kylin Single-node and multi-node installation

Source: Internet
Author: User

Kylin using the installation documentation
Kylin Introduction
What is Kylin?
Apache Kylin is an open source distributed analytics engine that was originally developed by ebay to contribute to the open source community. It provides SQL query interface and Multidimensional Analysis (OLAP) capabilities on top of Hadoop to support large-scale data processing, with the ability to handle terabytes and even petabytes of analytic tasks, to query large hive tables at sub-second levels and to support high concurrency. To put it simply, the core idea of Kylin is that it is precomputed, that is, the metric that can be used for multidimensional analysis is pre-calculated, and the calculated results are saved as cubes for direct access when queried. The high complexity of the aggregation operation, multi-table connection and other operations into the prediction of the results of the query, which determines that kylin can have a good fast query and high concurrency capability.
Extensible hyper-fast OLAP engine:
Kylin is designed to reduce the latency of billions of data queries on Hadoop
-Hadoop ANSI SQL Interface:
Kylin provides standard SQL support for most query functions for Hadoop
-Interactive query capabilities:
With Kylin, users can interact with Hadoop data in sub-second, providing better performance on the same data set than hive
-Multidimensional cubes (MOLAP cube):
Users can define data models and build cubes for more than tens of billions of datasets in Kylin
-Seamless integration with BI tools:
Kylin provides integration capabilities with BI tools such as tableau, which will soon provide integration with other tools
-Other Features:
-Job management and monitoring
-Compression and coding
-Incremental update
-Use HBase coprocessor
-Dinstinc count approximation algorithm based on Hyperloglog
-Friendly web interface to manage, monitor and use cubes
-Project and cube-level access control security
-Support LDAP (network)

Kylin is a distributed analysis engine with space in exchange for time, the Kylin core is cube, with different dimensions, some of the query is expected to be calculated, into HBase, for different business scenarios will be set up different cube to use, to meet the vast majority of query requirements. Personal

Kylin Usage Scenarios
Kylin is suitable for a large number of data periodic queries,

Advantages and disadvantages of Kylin
Cons: Multi-dimensional index to the various groups of multi-dimensional cooperation is expected, the offline index requires a large amount of computation and time, the final index will also occupy more disk space.
Pros: Kylin's cube preprocessing significantly reduces online data size and makes it more advantageous for hyper-scale data.
Kylin Architecture

Installation of Kylin
Single-node installation
Kylin installation is simple but Kylin is built on top of big data clusters, so the following conditions are met
? hadoop:2.4+
? hive:0.13+
? hbase:0.98+, 1.x
? jdk:1.7+
Then download the binary package http://kylin.apache.org/download/
Uploading a binary package to a node
Switch to directory, unzip files
TAR-XVZF file
Configuration file path
Vi/etc/profile
Export kylin_home=/path/
For example:
Export Kylin_home=/home/apache-kylin-2.2.0-bin
Start Kylin
CD to the specified decompression directory: Execute./bin/kylin.sh start
Stop service:
Execute the./bin/kylin.sh stop
Note: Execution must ensure that the user has permission to establish the HABSE table,

Multi-node Installation
Installation steps are the same as multi-node installation, you need to modify the configuration file
Modify the Kylin.properties under Conf
Because there can only be one job node, the
Kylin.server.mode=all only one node is all and the other node is query
kylin.server.cluster-servers=localhost:7070 add addresses to this line for example
kylin.server.cluster-servers=192.168.3.251:7070,192.168.3.252:7070
Installing the Load Balancer Ngnix
To configure load balancing:
Nginx ("Engine X") is a high-performance HTTP and reverse proxy server, also a IMAP/POP3/SMTP proxy server. Nginx was developed by Igor Sysoev for the second rambler.ru site of Russian traffic, and the first public version 0.1.0 was released on October 4, 2004. It publishes the source code in the form of a BSD license, which is known for its stability, rich feature set, sample configuration files, and consumption of low system resources. June 1, 2011, Nginx 1.0.4 released.
1 Dependent Downloads
wget https://ftp.pcre.org/pub/pcre/pcre-8.13.tar.gz
Yum-y install make zlib zlib-devel gcc-c++ libtool OpenSSL openssl-devel
OpenSSL may need to be installed separately
Installing Pcre
1 TAR-ZXVF soft/pcre-8.38.tar.gz
2 CD pcre-8.38/
3./configure
4 Make
5 Make Install
Check whether the installation was successful:
Switch to the installation directory
Pcre version


2 Nginx Download
wget http://nginx.org/download/nginx-1.5.9.tar.gz
1 TAR–XZVF Unpacking Kit
2 CD Unpacking Kit
3./configure
4. Make
5 Make Install
Check if the installation is successful
Cd/var/local/nginx

Indicates successful installation
3 Configuring Load Balancing:

Start
Start stop reboot and test
1) Start
#方法1
#/usr/local/nginx/sbin/nginx-c/usr/local/nginx/conf/nginx.conf
#方法2
# Cd/usr/local/nginx/sbin
#./nginx

2) Stop
#查询nginx主进程号
Ps-ef | grep nginx
#停止进程
Kill-quit Main process Number
#快速停止
Kill-term Main process Number
#强制停止
Pkill-9 Nginx

3) reboot (first boot required:/usr/local/nginx/sbin/nginx-c/usr/local/nginx/conf/nginx.conf)
/usr/local/nginx/sbin/nginx-s Reload
For installation and configuration details, please refer to: http://www.linuxidc.com/Linux/2016-08/134110.htm
Access port 80 and add Kylin path



Use of Kylin
Login Password account is Admin/kylin
Please refer to the official website for details on the use of Kylin. Cube is built in close conjunction with business scenarios. Needs to be explained with business



Apache Kylin Single-node and multi-node installation

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.