The Lens provides a unified data analysis interface. Data Analysis task segmentation is achieved by providing a single view across multiple data stores while optimizing the environment for execution. Seamless integration of Hadoop implements functionality similar to traditional data warehouses.
Key features of the project:
The simple metadata layer provides an abstract view layer for data storage
A single shared-mode server, based on Hive meta-storage. Patterns are shared through data pipeline Hcatalog and analytics applications:
OLAP Cube QL SQL-like high-level language for querying and describing datasets stored in different data cubes (Cubes)
JDBC Driver and Java client library to process queries
Lens Application Server -This is a REST server that allows users to query data, change the data model, schedule queries and quota limits for queries
driver-based architecture allows embedding in report systems such as Hive, column data storage, Redshift, etc.
engine selection based on cost algorithm -this algorithm optimizes the use of resources and automatically selects the best execution engine by the complexity of the query.
The architecture of Apache Lens is as follows:
Apache Open Source Project--lens