In this part, according to the process of designing and implementing ETL system, the subsystems of the previous part are classified according to four aspects, such as extracting data, cleaning and consistency, submitting to rendering server and managing ETL environment. (is not that the ETL is mainly to master these four aspects of the content)
|
|
|
|
|
ETL Processing steps |
Extracting data |
Cleaning and consistency |
Submit to Presentation layer |
Managing ETL Environments |
Plan |
|
|
|
|
Create a high-level, single-page source-to-target schematic flow |
* |
|
|
|
Test, select, and implement an ETL tool |
|
|
|
|
Design default policies for dimension management, error handling, and other related processes |
* |
* |
* |
|
Drill down through the target table, give a graphical indication of any complex data reorganization or conversion, and design a preliminary job sequence |
|
* |
* |
* |
Design a one-time historical loading process |
|
|
|
|
Build and test the loading of historical fact tables, including query and substitution of surrogate keys |
* |
* |
* |
|
Designing an incremental load process |
|
|
|
|
Establish and test the dimension table incremental loading process |
* |
* |
* |
|
Establish and test the fact table incremental loading process |
* |
* |
* |
|
Establish and test aggregation table loading and OLAP processing |
|
|
* |
|
Design, build and test automation of ETL systems |
|
* |
* |
* |
Design and development of ETL System (i)--etl process overview