Data Warehouse is a subject-oriented (Subject oriented), integrated (integrate), relatively stable (non-volatile), data collection that reflects historical changes (time Variant). Used to support management decisions.
(1) Topic-oriented: Index data in the warehouse is organized according to a certain subject domain.
(2) Integration: Refers to the original distributed database data through the system processing, collated to eliminate inconsistencies in the source data.
(3) Relative Stability: Once a data into the data warehouse only need to periodically load, refresh.
(4) Reflect historical changes: through this information, the enterprise's development process and future trends to make quantitative prediction.
The main difference is:
(1) database is a transaction-oriented design, the Data Warehouse is subject-oriented design.
(2) database is generally stored online transaction data, Data Warehouse storage is generally historical data.
(3) database design is to avoid redundancy as far as possible, data warehouse design is intended to introduce redundancy.
(4) The database is designed for capturing data, and the Data Warehouse is designed to analyze the data.
Comparison of database and Data Warehouse Hbase--hive