Two basic methods of grasping search engine

Source: Internet
Author: User
Keywords SEO

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

Hello everyone, here is Yaan SEO optimization blog. Today we say that the search engine is included in the process taken by the capture strategy.

When the spider completes the visit to the robots.txt file, it will begin to judge whether the page entered is in conformity with the standard, if it is satisfied, then extract its content and link. After the completion of this page crawl, still not finished, the spider will follow the extracted links to explore, from this link to the next page, and from the next page of the link to climb to the next page ...

Because the Web page link structure is extremely complex, spiders need to adopt a certain strategy to crawl to all the pages on the web. The simplest search engine crawling strategy has two kinds:

1. Depth Priority strategy

  

As shown above, the simple point is to crawl down a line vertically, until the task is complete.

2. Breadth Priority strategy

  

As shown above, it is simply to crawl all the links on a given page first, and then crawl from each link in the same parallel.

In practice, these two strategies happen at the same time, theoretically, as long as enough time, search engine spiders can crawl through all the pages. But the spider's bandwidth resources, time is not unlimited, so spiders can only crawl a certain time, the higher the weight of the site natural crawling longer.

Search the purpose of spiders is to explore the value of the page and included, which is why the weight of the station crawling long time, crawl degree deep reason. Therefore, we suggest that the new station site link level should not be too deep, lest spiders crawl in a short time.

After the crawler engine spider crawling finished, will collect the Web page data to the data analysis system, the entire collection process is over. OK, today's SEO Foundation is here.

This article is from: http://www.lxmseo.com/search-engines3.html

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.