[Post] Open-source Bi system Classification

Source: Internet
Author: User
Open-source Bi SYSTEM

Directory

Open-source Bi system Classification
Bi application tools
  1. ETL tools
  2. Table tools
Eclipse Birt
OLAP tools
Open source database
Open-source Bi suite
  1. Bizgre
  2. Openi
  3. Pentaho
  4. Spagobi
Open-source Bi system Classification
Bi application tools
  1. ETL tools
  2. Table tools
Eclipse Birt
OLAP tools
Open source database
Open-source Bi suite
  1. Bizgre
  2. Openi
  3. Pentaho
  4. Spagobi
Expand
Open-source Bi system Classification

In terms of scale and the improvement of Bi system support, these many Bi projects can be divided into three types: Framework, stand-alone tools, and Bi suit.

Framework

Open-source framework, which is not available in commercial BI systems. We can use them to build our own Bi tools or to enhance and expand our BI solutions.

Stand-alone tools

An independent bi tool is one of the most open-source projects. Many tools only focus on some links and aspects of the Bi system, such as ETL, report, OLAP, and database.

Bi suit

A collection of tools that provide features for multiple BI systems under a unified architecture. As far as the current situation is concerned, no package provides a complete end-to-end Bi solution for both commercial and open-source software. These open-source Bi suit forms a suite by connecting multiple other components and tools. Because the Bi system involves a lot of tools, therefore, it is very difficult to integrate a complete set of BI solutions.

Tools in BI solutions

A complete Bi solution has multiple tools to complete the work at various stages of the Bi system.

Edit this section Bi application tools ETL tools

Data extraction, conversion, and loading tools. Excellent ETL tools should have the following features:

1. Workflow Management, job execution and scheduling manager. You can easily define processes and automate ETL tasks.

2. centralized metadata repository and management. Centralized storage and management of industry-standard metadata.

3. Data Profile and validation. Data quality can be verified.

4. High Performance. It still has good performance in the execution of heavy-load tasks.

5. scalable and platform independent. It has good elasticity, supports a variety of operating systems and database systems, and can operate a variety of heterogeneous data sources.

6. Open Architecture and API. It has an open architecture and an easy-to-use secondary development interface.

Well-known open-source ETL tools include:

1. ketl, developed by Kinetic networks with the IBM and KPMG backgrounds, has more than three years of product application history. It has been successfully applied to some products. In clickstream) outstanding Performance in analysis applications. Ketl adopts the plug-in architecture and uses Java for development.

2. Kettle is a metadata-driven ETL tool. Already added pentaho.

3. Clover ETL is a Java-based ETL framework that can be used to develop your own ETL applications.

4. enhydra octopus is a Java-based ETL tool that uses JDBC to connect to various data sources for ease of use and deployment. It has been used in telecommunication network resource analysis systems.

Table tools

Excellent reporting tools generally have the following features:

1. multiple data sources are supported.

2. intuitive visual designer and easy-to-use report customization functions.

3. convenient data access and formatting and rich data presentation methods.

4. meets the general data presentation standards, capabilities and applicationsProgramGood combination.

5. Easy to scale and deploy.

Well-known open-source reporting tools include:

1. jasperreports, an excellent Java report tool, started in 2001. Now jaspersoft continues to develop and support this tool. Similar to the commercial software Crystal Report, this tool supports PDF, HTML, xls, CSV, and XML file output formats. Now it is the most commonly used report tool for Java developers.

2. openreports: provides a web-based flexible report solution that automatically generates dynamic PDF, xls, htmlcsv, and chart reports in a browser. It is developed in Java and uses jasperreports as the report engine, the open-source technologies used include hibernate, veloctiy, and webwork.

3. jfreereport, now part of pentaho, is an excellent Java class library used to generate reports. It provides a flexible printing function for Java applications and supports output to printers and PDF, Excel, HTML and XHTML, plaintext, XML and CSV files.

Edit this section Eclipse Birt

Is an enterprise intelligence and report tool under eclipse. It can create beautiful and eye-catching reports in PDF or HTML format for J2EE Web applications. It provides core report functions.

Edit this section OLAP tools

Online analysis and processing tools. Currently, open-source OLAP tools include molap, ROLAP, and holap. Excellent OLAP tools generally have the following features:

1. Good execution performance and quick analysis and processing.

2. good applicability and scalability.

3. open interfaces and rich APIs.

Well-known open-source OLAP tools include:

1. Mondrian is part of pentaho. As an OLAP server developed using Java, it implements MDX, XML parsing, and jolap specifications, you can analyze the large data sets stored in the SQL database without writing SQL, encapsulate the JDBC data source, and display the data in multiple dimensions.

2. July is a JSP custom tag library that can draw an OLAP table and chart. You can perform typical OLAP navigation, such as drill, slice, and square. It uses Mondrian as its OLAP server. It uses WCF (Web Component Framework) to render Web UI components based on XML/XSLT. July's overly simplified overall initialization loading in metadata cache will limit that it can only handle smallCubeBody (cube ).

Edit this section Open source database

There are also many open-source databases, most of which are relational databases, and a few of which are specially optimized for the data warehouse environment. Bizgres optimizes the data warehouse environment based on PostgreSQL, improving the analysis and query performance.

Edit this section Open-source Bi suite

The following lists open-source Bi suites that are relatively mature and complete and have reference significance.

Bizgre

Bizgree is an open-source project led by greenplum and has a partnership with sun. Bizgres is B

I optimized PostgreSQL for application and improved the parallel computing capability of high load. In the Bi environment, PostgreSQL has superior data processing performance compared with common relational databases. Bizgres's database platform can be integrated with ketl and jasperreports to form a bi suite:

1. Database: Bi professional database bizgres, or bizgres MPP, a high-performance server in large applications, which is 20 times faster than normal relational databases

2. ETL Tool: ketl

3. Report tool: jasperreports

Openi

Openi is a Java-developed web application that analyzes and displays reports on OLAP servers, relational databases, and data mining servers, making it easy to use and deploy. The interface is beautiful and friendly, data Mining and ETL will be supported in the future. Openi mainly includes:

1. OLAP Presentation: July

2. Report tool: jfreechart

3. analyze the data source Connector

Pentaho

Pentaho is a workflow-oriented Bi suite that focuses on solutions rather than tool components. It integrates multiple open-source projects to compete with commercial bi. It includes:

1. workflow engine: Shark and JaWE

2. Database: Firebird RDBMS

3. Integrated Management and Development Environment: Eclipse

4. Report tool: Eclipse Birt

5. ETL Tool: enhydra/kettle

6. OLAP Server: Mondrian

7. OLAP Presentation: July

8. Data Mining Component: WEKA

9. Application Server and Portal Server: JBoss

10. Single Sign-on service and LDAP authentication: josso

11. Support for custom scripts: Mozilla rhino Javascript script Processor

It can be seen from the above that pentaho is a perfect Bi solution. Pentaho tends to be a bi solution combined with business processes, focusing on applications of large and medium-sized enterprises.

Spagobi

Spagobi integrates mondrain and jprovit to generate real-time reports through openlaszlo. Spagobi is developed using Java and has strong scalability without relying on specific operating systems. It mainly includes:

1. Report tool: jasperreports/eclipse Birt/ireport

2. OLAP Server: Mondrian

3. OLAP Presentation: July

4. Data Mining Component: WEKA

5. Map engine: Geo

6. ETL: BIE

7. Search Engine: Lucene

8. dashboard: openlaszlo

9. Portal Server: JBoss/tomcat/Jonas

According to its roadmap, we can see that spagobi will integrate more bi functions, and even functions other than bi.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.