Say the search engine will first crawl and index which pages

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

For search engines, the theory can crawl and index all the pages on the Internet, can be unrestricted, but actually is not the case, the search engine due to some technical factors, such as bandwidth, mass storage space, response speed and other factors, will always give priority to crawl and index some pages, It is impossible to crawl and index all the meeting, then it will first crawl and index which pages? In its view, and is reasonable, it will try to crawl some of the more important pages, then the index is how to determine which pages are more important to the priority of crawling and indexing it? It generally takes into account the following factors:

1, the weight of a relatively high site has been said before, search engine due to a number of factors, will always first crawl some Web pages, the quality of the site is relatively high, the eligibility of the older site is it that the weight is relatively high, such a site search engine spiders will first go crawling and indexing, so we have to find ways to improve the weight of the site. The weight of the website is a comprehensive index, need to work from many aspects. Search engines are not only priority crawling and index of higher weight of the site, and for the weight of high sites, search spiders will often crawl deeper. For example, some big websites, such as Sina, NetEase, A5, stragglers, etc., for the new site, will always be included soon. Because they have high weights.

2, the updated frequency of the page search engine spiders crawl the web every time, will record the data, if the next time to crawl, the page does not change, then it in order to save bandwidth, there is no need to come so frequently, if it is once a week, then it may be half a month, update the less, The fewer times it comes, if you update it more frequently, and the more aggressive, search engine spiders will be more frequent, for example, if the previous is once a week, if you update fast enough, enough frequent, it may be changed to two times a week, three times a week .... Even daily snapshots. This requires you to cultivate spiders.

3, import links search engine spiders are crawling along the link to the Web page, to want the page to be Spiders crawl, the page must be imported links, if there is no link, spiders will not know the existence of your page. and the high quality of the import link to the page included a great help. We need to do the chain to the homepage, and to the inside page also do outside the chain, and the site's internal links to do a good job, the source of the article Taobao Crown Shop: http://www.suptb.cn/related pages to link up with each other, the homepage to have to the column page links, column page to have to the first page of the link to the content page link , the content page must have to the column page and to the homepage link, thus forms a flat right type NET structure. This helps search engines crawl and index as many pages as possible.

4, the page and the homepage of the click Distance we know in general, the highest weight on the site is the home page, and most of the external links also point to the homepage of the site, search engine spiders crawl The most frequent is also the home page, it is crawling other pages of the portal. Page from the number of effective clicks on the page, the higher the weight of the pages, so that the search engine spiders crawling opportunities will be greater. So we have to find a way to bring the new page of the link on the homepage more revealing. So as to accelerate the opportunity to be included.

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.