MPP Schema Database

Source: Internet
Author: User

Greenplum is a distributed database based on PostgreSQL . It uses the shared nothing architecture (MPP), the host, the operating system, the memory, the storage is self-controlled, there is no sharing. That is, each node is a separate database. The information interaction between nodes is realized through the node internetwork. By distributing data across multiple nodes to achieve storage of scale data, the query performance can be improved by parallel query processing.

How this feels is like organizing small databases together into a large database. The data is fragmented and stored on each node. Each node only queries its own data. The results obtained are then processed by the master node to obtain the final result. The linear expansion of the system is achieved by increasing the number of nodes.

??

Summary-New the value of the MPP database

??

Technology: A new type of database based on Columnstore +mpp architecture has great difference from traditional database in core technology, and it is designed and developed for structural data analysis, and can deal with petabytes of data effectively. Technology for many industry users to solve the problem of data processing performance.

??

User value: The new database is run on the x-86 PC server, which can greatly reduce the cost of data processing (1 orders of magnitude).

??

Future trends: The new database will gradually The Hadoop ecosystem, combined with a mix and match, uses MPP to process petabyte -level, high-quality structured data while providing rich SQL and transactional support for applications; with Hadoop achieves semi-structured and unstructured data processing. This can meet the needs of structured, semi-structured, and unstructured data processing.

??

?

Big Data focuses more on user behavior, group trends, correlations between events, and more than just past KPIs. This puts forward new requirements and challenges for the data analysis platform to analyze the ability and performance of the data.

??

?

The MPP (massively Parallel processing), massively parallel processing system, is made up of a number of loosely coupled processing units, and it is important to note that this refers to the processing unit rather than the processor. The CPU in each unit has its own private resources, such as bus, memory, hard disk, etc. Each unit has an operating system and an instance replica of the management database. The biggest feature of this structure is the non-sharing of resources.

features that the MPP schema database should have:

task parallel execution ;

Data Distributed Storage ( localization );

distributed computing ;

Private Resources ;

horizontal expansion ;

Shared Nothing schema.

??

Organize from the network

MPP Schema Database

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.