"Data Mining concepts and technologies" reading notes-Introduction to Chapter I.

Source: Internet
Author: User

1.1 Why Data Mining

Data mining transforms large datasets into knowledge.

A data warehouse is a multi-heterogeneous data source that organizes storage in a single site in a unified pattern to support management decisions.

Online analytical Processing (OLAP) is an analytical technique that has the ability to summarize, merge, and aggregate and to view information from different angles. (Note: Different from online transaction processing OLTP)

1.2 What is Data mining

Data mining is the process of mining interesting patterns and knowledge from a large amount of data.

Data mining Process:

    1. Data cleansing (eliminating noise and deleting inconsistent data)
    2. Data integration (multiple data sources can be grouped together)
    3. Data selection (extracts data from the database that are related to the analysis task)
    4. Data transformations (by aggregating or aggregating operations, transforming and unifying data into the form of appropriate mining)
    5. Data mining (basic steps, using intelligent methods to extract data patterns)
    6. Pattern assessment (identifying a truly interesting pattern representing knowledge, based on a measure of interest)
    7. Knowledge representation (using visualization and knowledge presentation techniques to provide users with knowledge of mining)
1.3 What types of data can be mined
    • Database data
    • Data Warehouse Data
    • Transactional data
    • Other types of data (data flow, ordered/sequential data, graph or network data, spatial data, text data, multimedia, and the World Wide Web)
1.4 What types of patterns can be mined
    • Characterization and differentiation
    • Frequent pattern, association, and correlation mining
    • Classification and regression
    • Cluster analysis
    • Off-Group Point analysis
1.5 What technology to use
    • Statistics
    • Machine learning
    • Database systems and data warehouses
    • Information retrieval
1.6 What types of applications are you looking for?
    • Business Intelligence
    • Web search engine
1.7 Key issues in data mining
    • Mining methods
    • User interaction
    • Availability and Scalability
    • Diversity of data types
    • Data Mining and social

"Data Mining concepts and technologies" reading notes-Introduction to Chapter I.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.