How to judge the core content by searching

Source: Internet
Author: User
Keywords Search engine Core

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

Search engine spider put the page code back to the search engine server, SE is how to determine the core content of the page?

First of all, KYW that the search engine operation process of the first few steps:

1. Spider download a page, send back to the server;

2. The server looks for the core content location of page A and then removes the HTML code;

3. Find the core content of the webpage;

......

I'm not sure if Google, Baidu, Yahoo! really have the "core content" step, but I believe there must be a similar computing mechanism, because without this step, the search engine will spend a lot of resources to do repetitive operations. Of course KYW is not a search engine engineer, the following is just some thinking about search engines.

Se face a whole page of HTML code, how to judge the core content where?

The first step: Take the page and the same domain name, the same directory, with the file format of the page to do contrast, remove similar parts. After this step, the head of the navigation bar, the bottom of the copyright information, fixed position ads were removed. I estimate that the search engine will create a contrast template to improve productivity every time a new website is added. If the site is often revised, in the beginning of the revision, the search engine has not responded, it may result in a new ranking of the page is not ideal.

Step two: Remove the part that has a lot of links. After this step, "related articles" "Recommended articles" and so on have been removed, leaving some code containing text content.

Step three: In the remaining code to determine which section of the label (may be Div, TABLE, p or other tags) in the text content, because the general core content of the text will be more.

Each page, after 3 steps above, should be able to successfully determine the location of the core content. After such thinking, we may be able to sum up:

1. The page under the same directory, the best template.

2. The core content cannot be too small, especially for enterprise stations and corporate Web sites, and it is not good to put prices and pictures that contain a large amount of descriptive text.

3. If there are many errors in the HTML code, the rankings may be affected because the search engine may make a mistake in judging the core content.

4. Look forward to your supplementary ^_^

KYW The main work of SEO to help search engine more efficient understanding of the site and the content of the page, so we suggest that there is nothing to ponder the search engine, the problem to think through the more, SEO method is more adaptable. Of course, there is no need to go to the point, there is no reason to my message.

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.