The last author in the "Forgotten by the SEO fundamental" mentioned in the Web site log some insights, many readers feel very confused, and even a lot of readers do not know how to view the site log, today I will explain in detail under the site log in the SEO role, as well as some common analysis methods.
In the previous article, I mentioned that the decision to rank the site is every time the spider crawled through your site to bring back to the database of that comprehensive value. Many readers have been biased towards this understanding. Many people think that this comprehensive value is the site log every data, or each IP represents the meaning, in fact, this is a wrong understanding. First of all, the Web log can only represent the traces of spiders crawled. There is also a lot of IP understanding of the information on the internet I do not agree. The author on the internet to see a lot of what a news story climbed 220.181.108.*ip, the next day will be immediately included in the author's view this is completely nonsense. Please remember very important point, Baidu Spider's IP is to crawl your site before deciding, and not first know that you this page is a high quality page, and then use the right spider to crawl. This is a very big misunderstanding.
Of course the importance of the journal is understandable. It doesn't fully see your site's ranking, but you can find a lot of trends. So today I would like to talk about some of the Baidu spider some common judgments. Why is Baidu Spider? Because now the website SEO basically already can understand for Baidu engine optimization. The following is a detailed explanation of how to analyze the site log.
2013-09-09 00:07:16 59.60.7.125 get/news/news2013524236.html-80-123.125.71.16 http/1.1 mozilla/5.0+ (Linux;u; ANDROID+2.3.7;ZH-CN) +applewebkit/533.1+ (Khtml,like+gecko) +version/4.0+mobile+safari/533.1+ (compatible;++http ://www.baidu.com/search/spider.html)--www.jinh.cn 200 0 0 16143 296 140
The above paragraph of text is the author from the website log to intercept a complete short section. Get in front of two data, the first half is time, the second half is your site's domain IP, get back to-the front is represented by spiders crawling page. Most of the time is nothing, on behalf of it climbed the homepage of your site. Behind--The back is very important to the crawling spider's IP. Generally common IP in fact, two kind is the ip220.181.108.* of the spider, the other is the view of the garbage content spider 123.125.71.*. For the new station, we also need to pay special attention to a ip:121.14.89.*. This IP represents your new station has been out of the new station of the inspection period, officially become a common site to look at. There are also some need to pay attention to is the IP of the third interval of 68 or 51 of IP, when these IP appear in a large number of your site, I can responsibly tell you: Pro, you can prepare your site for the funeral.
Of course, many times you will find a lot of you can not understand the IP, most of the time you are using some Web site monitoring tools, they simulate Baidu Spider generated IP, such as webmaster home, Love Station network. These counterfeit goods with nslookup command a check then know, there is no need to too much care.
Then IP followed by a lot of the only thing to note is that Web site. It represents the spider to find your site's entrance. For example, you have published an article on a blog or forum, and later found that the spider is from there to find your site. So, to show that the chain is the effect is better, you can continue to do it. And then there's the final return value, which is 200 0 0. The returned code generally has 200, 301, 304, 403, 404 These codes can be found on the internet on the first close to explain the author does not do more introduction. The last three values represent downloads, uploads, and time consuming.
Said so much, I think a lot of people have a preliminary understanding of the site's log. Here, the author again stressed that the spider's IP is not crawling to the site before the decision, so do not see 220.181.108.* IP is the angel of Providence, it is also likely to be the sickle of death! Next time I will share with you, how to in-depth analysis of the Web site log.
This article by the Union Science and Technology http://www.lianke.cn despair of peanuts to provide, reproduced please indicate the source, thank you!