good elasticity, support a variety of operating systems and database systems, can operate a variety of heterogeneous data sources;
Open Architecture and API. has an open architecture and easy-to-use two-time development interface.
Currently more well-known open source ETL
1, Ali Open source software: datax
Datax is a heterogeneous data source offline Synchronization tool that is dedicated to achieving stable and efficient data synchronization between heterogeneous data sources including relational databases (MySQL, Oracle, etc.), HDFS, Hive, ODPS, HBase, FTP, and more. (Excerpt from Wikipedia)
2. Apache
ETL Tool Pentaho Kettle's transformation and job integration
1. Kettle
1.1. Introduction
Kettle is an open-source etl Tool written in pure java. It extracts data efficiently and stably (data migration tool ). Kettle has two types of script files: transformation and job. tran
Label:The first knowledge Talend, the feeling function is very powerful, can synchronize many kinds of databases, simultaneously can clean, the filter, the Java Code processing data, the data import and export.Talend is an open source software for ETL (data extraction extract, transfer transform, load load) for the data integration tools market. Talend provides a
Pentaho
Pentaho is the world's most popular open-source business intelligence software. It is a workflow-oriented Bi suite that focuses on solutions rather than tool components. It integrates multiple open-source projects, the go
relatively large frameworks, integrated with a considerable number of open-source projects, jfreereport, Mondrian, kettle, WEKA are basically used. It is particularly suitable for the development of large-scale and complex projects.
PentahoIn China, there are a lot of users and more documents. In particular, it is worth mentioning that on the Internet his Chinese support is quite good, and many vol
, conversion, and loading (ETL) operations and support the increasing data warehouses, provides online analysis and extended report analysis functions.
Undoubtedly, you can integrate many of these features with different open-source software products. ETL products such as Pentaho
the search.
Quick response to complex aggregate class queries: For complex analytical SQL queries such as SUM, COUNT, AVG, GROUP by
Infobright the value
Save design overhead. No complex Data Warehouse model design requirements (such as star model, snowflake model), no need materialized views, data partitioning, index building
Conserve storage resources. High compression ratios are usually 10:1, and some applications can reach 40:1
Integrated utilization is extensive. Compat
the block, instead of indexing, to accelerate search.
Quick response to complex aggregate queries: Suitable for complex analytical SQL queries, such as SUM, COUNT, AVG, and GROUP
InfobrightValue
Save design costs. No complex data warehouse model design requirements (such as star model and snowflake model), no materialized view, Data Partition, and index creation required
Save storage resources. The high compression ratio is usually, and some applications may reach 40: 1
Integration is
avoid the problem of maintaining indexes and indexes expanding with data. Data in each column is compressed and stored in blocks. Each Knowledge Grid node records the statistical information in the block, instead of indexing, to accelerate search.
Quick response to complex aggregate queries: suitable for complex analytical SQL queries, such as SUM, COUNT, AVG, and GROUP
InfobrightValue
Save design costs. No complex data warehouse model design requirements (such as Star model and snowflake
open source database software in the world. It can run in almost any operating system environment and drag from one platform to another without any configuration changes. MySQL is applicable to enterprise-level applications, Internet websites, and Zenoss. It is comparable to the most expensive commercial relational database system.
8. Pentaho
applicable to enterprise-level applications, Internet websites, and zenoss. It is comparable to the most expensive commercial relational database system.
8. pentaho
Pentaho is a commercial company that provides open-source business intelligence product community versions. Their products can be used for free, developed
own products with several Hadoop partners, including the NoSQL company. In 2012, the project achieved the desired financing results while also gaining the favor of customers. Although Pentaho has become famous in particular circles, it should also be a new and noteworthy business Intelligence project for most friends who are unfamiliar with the market.
PostgreSQL
Although for some time since the NoSQL database criticism of the sound of the upro
for OLAP and dynamic reports written in Java/J2EE. it combines static reports (based on jasperreports), a swing tables for OLAP analysis, and charts (based on jfreechart ). it reads from external data sources as SQL, Excel, XML, and others, and produces export outputs as PDF, XML, and application specific files for later off-line visualization of reports.
8. freereportbuilder
Freereportbuilder is a Java report tool that can work with any database that has a JDBC driver.
9. openreports
Op
1. jasperreports is a Java-based open-source report tool that can be used to create reports in the Java environment like other ide report tools. Jasperreports supports PDF, HTML, xls, CSV, and XML file output formats. Jasperreports is currently the most common reporting tool for Java developers.
2. pentaho is a workflow-oriented Bi suite that focuses on solutions
features they want. In addition, they are concerned about program quality and version control. Open-source software is developed based on communities, so it is updated frequently.
Other enterprises have similar experiences. They do not want to re-develop the Bi platform, but integrate mature platform products.
However, for many other ISVs, the price of open-
Opentaps open source ERP + CRM is a well-designed and popular CRM system built based on Apache OFBiz (the open for business project. This project provides many Chinese documents.
Main features:1. Provides a comprehensive suite that allows you to grasp your business situation 360 degrees. From customer to order to inventory to finance. Opentaps is a complete set
Infobright is a column-based database based on unique patented knowledge grid technology. Infobright is an open-source MySQL Data Warehouse solution that introduces the column storage solution, high-strength data compression, and optimized Statistical Computing (Class
Infobright is a column-based database based on unique patented knowledge grid technology. Infobright is an
Summary of open-source Bi KIT tools
There are a lot of commercial Bi suites. Similarly, there are also a lot of open-source Bi suites, but they are not shared, so many excellent Bi suites are not used. The open-source B
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.