APACHE kylin™ Overview
Apache kylin™ is an open-source, distributed analytics engine that provides SQL query interface and Multidimensional Analysis (OLAP) capabilities on top of Hadoop to support hyper-scale data, originally developed and contributed by ebay Inc to the open source community. It can query large hive tables in sub-second.
What is Kylin?
-Extensible hyper-fast OLAP engine:
Kylin is designed to reduce the latency of billions of data queries on Hadoop
-Hadoop ANSI SQL Interface:
Kylin provides standard SQL support for most query functions for Hadoop
-Interactive query capabilities:
With Kylin, users can interact with Hadoop data in sub-second, providing better performance on the same data set than hive
-Multidimensional cubes (MOLAP cube):
Users can define data models and build cubes for more than tens of billions of datasets in Kylin
-Seamless integration with BI tools:
Kylin provides integration capabilities with BI tools such as tableau, which will soon provide integration with other tools
-Other Features:
-Job management and monitoring
-Compression and coding
-Incremental update
-Use HBase coprocessor
-Dinstinc count approximation algorithm based on Hyperloglog
-Friendly web interface to manage, monitor and use cubes
-Project and cube-level access control security
-Support LDAP
KYLIN Ecological Circle
Kylin Core: Kylin OLAP engine infrastructure, including metadata (Metadata) engine, query engine, job engine, and storage engine, including rest servers in response to client requests
Extensions: plugins that support additional features and features
Integration : integration with scheduling system, ETL, monitoring and other life cycle management systems
user interface: third-party user interface extended on top of Kylin core
Drive: ODBC and JDBC drivers to support different tools and products, such as tableau
APACHE KYLIN? Overview