Greenplum (gpdb) Open source! ~
The Greenplum database (gpdb) is a non-shared, massively parallel processing database designed to handle large-scale data analysis tasks, including data warehousing, business Intelligence (OLAP), and data mining. GPDB is designed for massive data analysis, using the most advanced cost-based query optimizer, is currently one of the most advanced open source database, the PB-level data can be quickly and efficiently query, analysis.
PostgresQL-based commercial version of the database Greenplum officially open source, its source code exists in the github:https://github.com/greenplum-db/gpdb, The vast number of database enthusiasts can more easily refer to the implementation of some SQL advanced query and analysis capabilities.
Greenplum database Server Software is an advanced full-featured open source data Warehouse management software. It provides powerful and efficient analysis capabilities for petabyte-scale data. Especially in big data analytics, the Greenplum database is equipped with the world's most advanced computational cost-based query optimizer to achieve higher query and analysis performance for big data.
The Greenplum Open source project now uses the Apache 2 Copyright agreement. The Greenplum company also appreciates the contributions made by community contributors and other enthusiasts to their products. For Greenplus communities, any form of contribution is meaningful to the product, and Greenplum also appreciates and encourages various forms of contribution.
"Open source massively parallel Data Warehouse"
About Greenplum Database Introduction
- Greenplum is based on PostgreSQL development and has added many important innovations in data warehousing operations:
- Massively parallel processing architecture: The Greenplum database provides the ability to parallelize all data and queries automatically;
- petabytes of load processing power: By using MPP technology, high performance can be maintained under heavy load, and up to 10T of data per rack can be processed every hour.
- Innovative query optimizer: Greenplum is the first in the industry to design a query optimizer based on the cost-first principle of a big data payload, which can be implemented in interactive mode or batch processing mode, and the petabytes of big data do not degrade the query performance and data processing throughput under the premise of analysis and processing.
- Polymorphic data storage and execution: table or partition storage, execution, and compression settings can be flexibly configured according to the access mode. When storing or processing row-level or column-level, users can choose according to their needs.
- Advanced Machine Learning Features: With the introduction of the Apache Madlib Library, the internal analysis functions are expanded in the Greenplum database through user-defined functions.
RELATED links:
1.Greenplum source and documentation and related information: http://greenplum.org/
Source code for 2.Greenplum: https://github.com/greenplum-db
3. Web site of the pivotal company of selfless contribution: Https://pivotal.io/big-data/pivotal-greenplum
http://www.bkjia.com/PHPjc/1067481.html www.bkjia.com true http://www.bkjia.com/PHPjc/1067481.html techarticle Greenplum (gpdb) Open source! ~ Greenplum Database (gpdb) is a non-shared, massively parallel processing database designed to handle large-scale data analysis tasks, including data warehousing 、...