Analysis system of search engine to do well website optimization

Source: Internet
Author: User
Keywords Analysis at present do

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

The current mainstream search engine according to its function can be divided into can be divided into download, analysis, index, query 4 large systems. The analysis system in the search engine architecture mainly undertakes the Web page structure, the page weight, text segmentation and the importance of the Web page calculation (such as Google's PR) these four basic tasks. It can be said that the search engine analysis system in the ranking of the site plays a decisive role, through the analysis of Search engine analysis system, can better guide us to optimize the site work, here, the author according to their own views,

First of all, the author of the search engine analysis system to do a simple introduction to the work steps:

First, read the original page from the page library where the download system was downloaded through the crawler.

Second, the process of packaging a Web page object from the original page by creating a tag tree and extracting valuable attributes from the Web page.

Third, discard redundant pages, only to keep a similar or the same page to the Word segmentation module, to achieve Web page weight.

The text segmentation module divides the body of the Web page into a set of words.

Finally, the results of the analysis are sent to the index module for indexed storage.

Understand the search engine Analysis system workflow, then, I think, we should target the search engine analysis system to do the following optimization work.

1. From the first and second processes of the analysis system, tell us to be clear about the information that needs to be retained

A Web page is written in HTML language and is a semi-structured object that preserves valuable information, such as headings and text, and discards unwanted information, such as HTML tags, mainly through web pages, generally, title tags, meat tags, The H tag is the most important Web page information the search engine thinks. For example, for title tags, in the search engine spiders crawling process,< TITLE > and </title > content is often the first spider to get the text content of the page. In addition, anchor text, Web page body are valuable information, to be preserved and valued.

2. From the third step of the analysis system, it tells us to pay attention to the content construction of Web pages.

Hundreds of millions of pages in the Web, storage and processing of massive web pages is a daunting task, and these pages contain many of the same or similar pages. Therefore, the search engine analysis system in the formal analysis of the Web page before the first task to do is to eliminate the weight of the page. Search engines look at these 4 pages as identical or similar, the content and format of two pages are identical, the contents of two pages are identical, but the format is different, two pages have the same content and the same format, two pages have the same content, but the format is different. From the search engine analysis system to see the site optimization, it is obvious that the uniqueness of the page content is very important, so it is meaningful to do the original.

3. From the calculation of the importance of the Web page, that is, the 45th part of the analysis system, it is meaningful to do the weight of the Web page.

Here, take Google's PR value for example, it is Google used to identify the level of the Web page/importance. Baidu also has a similar system, so we should according to their algorithm, do a good job of improving the importance of Web pages, such as the introduction of high-quality links, write some high-quality soft text on the Web page links and published to a large site, such as the provision of valuable web content, these can improve the weight of the page, specific practices, Webmaster friends know, here no longer detailed.

Through analysis of the search engine analysis system, we clearly know how we should do a good job site optimization. The above is purely small personal point of view, I hope to discuss learning with you, finally, the article copyright: Guangzhou People Hospital: http://www.gzrlw.net/, Welcome to reprint, but please reprint the time to retain the link, thank you for your understanding and cooperation!

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.