PostgreSQL-based scalable, open-source Big Data cluster database: POSTGRES-XL
http://www.postgres-xl.org/
Postgres's-XL is a scale-out open source SQL database cluster based on the PostgreSQL database, with sufficient flexibility to handle different database workloads:
OLTP Write-Frequent business
Requires MPP parallel Business intelligence
Operational Data storage
Key-value Storage
Geographic space of GIS
Mixed Business Working environment
Multi-tenant service provider Hosting environment
Scalability
POSTGRES-XL (extensible Lattice), you can span multiple nodes or partition tables, or copy them. A partitioned (or distributed) table allows for write extensibility across multiple nodes, as well as large scale parallel processing (MPP) large data types of workloads.
Replicated tables are typical static data that does not change frequently. Replicate data, allowing read extensibility.
Full acid
POSTGRES-XL is a fully acid-compliant transactional database. Not only does it provide you with a fully consistent view at all times, but also uses a cluster-wide version of concurrency control (MVCC).
When you start a trade or query at POSTGRES-XL, you will see consistent data across the cluster. When you read your data in a connection, even in the absence of any lock in the other connection,
You can update the same table. Thanks to global transaction identifiers and snapshots, these connections are using their own version of the row. The reader and the writer do not block each other.
Components
- Global Transaction Monitor (GTM)
Global transaction monitoring ensures cluster-wide transactional consistency. The GTM is responsible for issuing transaction IDs and snapshots as part of its multi-version concurrency control.
The cluster optionally configures an alternate GTM to improve availability. In addition, Agent GTM can be configured between the coordinators, which can be used to improve scalability and reduce the volume of GTM traffic.
Coordinators manage user sessions and interact with the GTM and data nodes. The coordinator resolves and schedules the query and sends the next serialized global plan to each component in the statement
The data node is where the data is actually stored. The distribution of the data can be configured by the DBA. To improve availability, you can configure hot provisioning of data nodes for failover readiness.
PostgreSQL-based scalable, open-source Big Data cluster database: POSTGRES-XL