ETL is responsible for the distribution, heterogeneous data sources such as relational data, flat data files, such as the extraction of the temporary middle tier after the cleaning, transformation, integration, and finally loaded into the data warehouse or data mart, to become the basis of online analytical processing, data mining.
If the frequency of data conversion or not high requirements can be manually implemented ETL functions, on the contrary, if the requirements for data conversion is high, you need specialized ETL tools.
The advantages are as follows:
First: Simplifies the user's operation. Special ETL is usually through a graphical configuration, simple, flexible, so that users do not care about the various internal details of the database, but focus on the function.
Second: Support a variety of data sources, especially planar data sources. ETL In addition to supporting all common data sources, such as ORACLE,SQLSERVER,DB2,,MYSQL,ACCESS,VF, but also provides a variety of flat data sources, such as txt,excel,csv,xml support.
Third: Support more hardware and software platform, software platform such as operating system, support Windows,linux and domestic operating system, support all kinds of hardware platform, such as x86, godson and so on.
Four: More powerful, data processing components are very rich, more versatile, components easy to reuse
V: Provide flexible custom rules to better control data quality
Sixth: Provide powerful management functions, such as rights Management, log management