Data Warehouse is a subject-oriented (Subject oriented), integrated (integrate), relatively stable (non-volatile), data collection that reflects historical changes (time Variant). Used to support management decisions.
The so-called (1) topic-oriented: Index data in the warehouse is organized according to a certain subject domain.
(2) Integration: Refers to the original distributed database data through the system processing, collated to eliminate inconsistencies in the source data.
(3) Relative Stability: Once a data into the data warehouse only need to periodically load, refresh.
(4) Reflect historical changes: through this information, the enterprise's development process and future trends to make quantitative prediction.
Data Warehouse construction is a project, a process, not a product that can be purchased. The enterprise data processing method is the form of online transaction processing information, and the use of information for decision-making, in the information application process management information.
The advent of the Data warehouse is not to replace the database. At present, most of the data warehouse is still using the relational database pipe
Management system. The main difference between a data warehouse and a database is that:
(1) database is a transaction-oriented design, the Data Warehouse is subject-oriented design.
(2) database is generally stored online transaction data, Data Warehouse storage is generally historical data.
(3) database design is to avoid redundancy as far as possible, data warehouse design is intended to introduce redundancy.
(4) The database is designed for capturing data, and the Data Warehouse is designed to analyze the data.
The difference between a data warehouse and a database