"Deep spider" Baidu spider IP section detailed

Source: Internet
Author: User
Tags dedicated ip

everyone to the site log analysis, common to a lot of different IP segments of the Baidu Spider, in order to facilitate better log analysis, the following list of Baidu different IP segments of the common spider some details, and so-called down the right spider , sand box spider, high-weight spiders and so on

The following Baidu Spider IP visit, ready to crawl your things, crawl Web page Baidu Spider.
60.172.229.61
61.129.45.72
61.135.162.*


Baidu bidding Spider
61.135.165.134
117.34.74.66
118.122.188.194
119.63.196.9
125.39.78.185



(Baidu Alliance Crawler), plainly speaking is Baidu statistics.
61.135.186.*



Webmaster Tools Imitation of Baidu Spider.
61.147.98.146
61.188.39.16
113.98.254.245
117.21.220.245
117.28.255.42


114 Webmaster Toolbox (This is your website instability often come)
119.147.114.213
121.10.141.*


Baidu Image crawler
123.15.**.**



This spider often comes, others come less, indicating that the site may have to enter the sandbox, or be down right.
123.125.68.*


Crawl within the page included, the weight is lower, crawling over this paragraph of the inside page of the article is included but not put out (meaning to be determined), because it is not original or collection of articles. (Baidu web crawler (Baidu image crawler)
123.125.71.*


search outside webmaster tools spider.
124.248.34.52


Also belongs to the Baidu spider IP mainly caused by the composition, is the new line station more, there have used webmaster tools, or SEO comprehensive testing caused, not much use.
125.90.88.*


Baidu Spider
159.226.50.*
180.76.5.*
180.76.5.87
220.181.158.107



Camouflage Baidu Spider IP
180.149.130.*  


This IP segment appears after the new station and the site has abnormal phenomenon.
183.91.40.144
203.208.60.*


This IP segment patrol each station continuously, is passing by a moment.
210.72.225.*


Every day this IP segment only increases and is likely to go into the sandbox or K station
218.30.118.102
220.181.68.*
123.125.68.*
220.181.68.*  

Mainly crawl home accounted for 80%, the page accounted for 30%, this crawl of the article or home, absolutely 24 hours to put out and overnight snapshots! The general successful crawl return code is 200 0 0 return 304 0 0 for the site is not updated, the spider came, if 200 0 64 don't worry it's not K station, It may be that the site is dynamic, so the return is this code.
220.181.108.*


On behalf of Baidu Spider IP visit ready to crawl your things
220.181.7.*
123.125.66.*



This IP segment is used as a time to spend a new station
121.14.89.*



This IP segment appears after a new station or site has abnormal behavior
203.208..60.*



This IP segment patrols the stations continuously
210.72.225.*


This is the home page of Baidu crawl dedicated IP if 220.181.108 segment of the IP to basically say that the site will be a daily overnight snapshot, absolutely wrong
220.181.108.95



98% Crawl The first page may also crawl other "not referring to the inner page" is the weight of the IP segment this paragraph crawled over the article or the first page of the basic 24 hours released.
220.181.108.92


Crawl inside page ingest weight lower crawl over this section of the inside page article will not be released soon, because not original or collect articles
123.125.71.106


belong to the synthesis. The main crawl of the first and inner pages or other pages. belongs to the weight IP segment, grabbed the article or the homepage basically 24 hours to put out
220.181.108.91


Focus crawl update article within the page to reach 90%, 8% crawl home, 2% other weight IP segment, grabbed the article or home base 24 hours to put out
220.181.108.75


dedicated Crawl Home IP weight segment, general return code 304 0 0 for not updated
220.181.108.86


Crawl inside the page included, the weight is low, crawl this paragraph of the inside page article will not be released soon, because not original
123.125.71.95
123.125.71.97


Dedicated Crawl Home IP weight segment, general return code 304 0 0 for not updated
220.181.108.89
220.181.108.94
220.181.108.97
220.181.108.80
220.181.108.77


Crawl inside the page included, the weight is low, crawl this paragraph of the inside page article will not be released soon, because not original
123.181.108.77


dedicated Crawl Home IP weight segment, general return code 304 0 0 for not updated

220.181.108.83

This article by Whchina (Jiangcheng old temperature) original release, reproduced please indicate the source, Jiangcheng old temperature as a thinker. 877313758

"Deep spider" Baidu spider IP section detailed

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.