Analysis of spider footprint in Baidu

Source: Internet
Author: User
Keywords Baidu Spider

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

After writing "recent Baidu and Google's collection of differences www.admin5.com/article/20080619/89812.shtml", began to think, looking for ... Why does Baidu not bird me?

Online netizens wrote that Baidu has a 15-day observation period, do not know if it is true. Log in to the server this afternoon to view the analysis IIS log files. found that Google and Baidu are on my website on the third day, that is, June 14 visited my website, and the first access to the files are robots.txt, explain how important robots.txt documents. Until today found Yaho visited my robots.txt files, other search engines did not find footprints, which is why the domestic search market Baidu and Google accounted for more than 80% of the market. I reckon that if I didn't apply for Yahoo's traffic statistics tool, it wouldn't know if I would be visiting.

Baidu and Google's action speed is very fast, Baidu's action is no less than Google weak. Starting from number 14th, these two guys basically visit my website every day, of course, Google's access volume is relatively frequent, but Baidu is not weak to where, basically is daily visit. Specific observation of today's log, starting from 0:8 A.M., Baidu Spider constantly to harass me to sleep, until 17 o'clock in the afternoon, access interval is basically 1 hours, from the beginning to visit the first page, access to the channel pages, basically are successful. Casually picked a few data as follows:

2008-06-16 15:07:23 w3svc1 202.104.188.69 get/plus/rssmap.html-80-220.181.32.5 baiduspider+ (+http://www.baidu.com/ search/spider.htm) 200 0 0

2008-06-17 04:09:07 w3svc1 202.104.188.69 get/index.html-80-61.135.168.39 baiduspider+ (+http://www.baidu.com/search /spider.htm) 200 0 64

2008-06-17 10:44:48 w3svc1 202.104.188.69 get/html/info/index.html-80-220.181.32.5 baiduspider+ (+http:// www.baidu.com/search/spider.htm) 304 0 0

The red digit 200 indicates the normal request completes, two 0 does not know what meaning, 64 also does not know what meaning, who knows please explain, extremely thanks. 304 is not modified, is not as expected to modify the document, Baidu also want to see your content is often updated, so often update the content of the site is also very important. Basically did not find 4xx (error in the client) and 5xx (server error) and other error messages, can be said to be relatively friendly.

So why Baidu has been reluctant to include me? What is it waiting for? My own thoughts are:

First, Baidu to the new station must observe a period of time, no matter what you are content, are not included, but the spider as usual visit, and so after this period of observation, will soon let go;

Second, the site is less original content, the idea is a bit wrong, because the fun Fly Business Network (www.trip36.com) in addition to the air information channel, Special airfares page is the original content to come, home is also, why the beginning of the climb home, but not included it? A little confused, Can only be explained with the first thought;

Third, declare that my domain name is newly registered, excluding the previous penalty records, my server is using a stand-alone IP, excluding multiple sites using the same IP is implicated in the possibility.

So, when it comes to the end, is it really like the Netizen said to wait for 15 days? We discuss together, hope to have experienced veteran analysis, or give some advice, extremely grateful! I am the new bird, I also continue to observe, continue to share, thank you for your support!

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.