Understand what a Web data mining

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

The goal of Web mining is to explore useful information from the web's hyperlink structure, Web page content, and usage logs. Although web mining uses many data mining techniques, it is not just a simple application of traditional data mining. In the past 20 years, many new mining tasks and algorithms have been invented successively. Web mining tasks can be divided into three main types, based on the data categories used during the mining process: Web structure mining, Web content mining, and web usage mining.

· Web structure Mining: Web structure mining seeks useful knowledge from hyperlinks (short links) that characterize web structures. For example, from these links, we can find out what are important pages, which is an important technology used by search engines. We can also explore community of users with common interests. These tasks do not exist in traditional data mining because there is no link structure in the relational table.

· Web content Mining: Web content mining extracts useful information and knowledge from Web content. For example: According to the theme of the Web page, we can do automatic clustering and classification. For example: www.g8g5.com, this station, the biggest theme is the QQ expression. Although these tasks are similar to the tasks of traditional data mining, we can extract useful information from a Web page for a variety of purposes, such as product descriptions, forum replies, and so on. This information can be used as a further analysis to exploit user attitudes. These tasks are also not traditional data mining tasks.

· Web usage Mining: Web uses mining to mine the user's access mode from the use log that records each user's click. This task also uses many algorithms for data mining. One of the important issues is to click on the preprocessing of stream data to generate the appropriate data that can be used for mining.

Search Engine optimization is a technology related to web data mining, because most of the search engine engineers think about how to design search engines, but also pay attention to or a large part of the search results in order to solve the problem of justice.

Author: Hangzhou SI billion Network Technology Co., Ltd.

Original load: http://www.seo.com.cn/

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.