I recently learned about BI, Because I mainly focus on data warehouse and learn about basic concepts first :)
BI
Specifically, BI is not a new technology. It uses technologies such as data warehouse (DW), Online Analytical Processing (OLAP), and data mining (DM) to manage customer relationships (CRM) and so on.
Applied to the actual process of business activities, achieving the goal of technology serving decision-making; Mark
Hammond looks at BI from a management perspective and thinks that BI is "fundamentally helping you transform your company's operational data into valuable information (or knowledge) that can be obtained ), and pass
When the means to pass the appropriate information to the appropriate person ".
ETL
ETL is the process of data extraction (Extract), transformation (Transform), and Load. It is an important part of building a data warehouse. Data Warehouse is subject-oriented
Integrated, stable, and time-changing data set to support the decision-making process in business management. There may be a large amount of noise data in the data warehouse system, which is mainly caused by abuse.
Acronyms, idioms, data input errors, repeated records, lost values, and spelling changes. Even if there is a large amount of noise data in a well-designed and well-planned database system, this system will also
It makes no sense, because "garbage in, garbage out" (garbage in, garbage
The system cannot provide any support for the decision analysis system. To clear noise data, data must be cleaned in the database system. At present, there are a lot of research on data cleansing and ETL,
There is not much research on how to effectively clean and visualize data in the ETL process.
Online Transaction Processing OLTP
OLAP)
The concept was first proposed by E. F. Codd, the father of relational database, in 1993. He also proposed 12 principles for OLAP. The proposal of OLAP has caused a great deal of response. As a kind of product, OLAP is used as a kind of parallel machine transaction processing.
(OLTP) is clearly differentiated.
Today's data processing can be roughly divided into two categories: online transaction processing OLTP (on-line transaction
OLAP (On-Line Analytical)
Processing ). OLTP is the main application of traditional relational databases, mainly for basic and daily transaction processing, such as bank transactions. OLAP is the main application of the data warehouse system.
Complex analysis operations, decision-making support, and intuitive and easy-to-understand query results.
OLAP is a kind of software technology that enables analysts, managers, or executors to quickly, consistently, and interactively access information from multiple perspectives to gain a deeper understanding of data. OLAP is designed to meet decision-making support or specific query and report requirements in multi-dimensional environments. Its core technology is the concept of "dimension.