Beginner's Guide to Web log analytics

Source: Internet
Author: User

Intermediary transaction SEO diagnosis Taobao guest Cloud host technology Hall

Web site log analysis is very easy for the old webmaster, from the Log Code view to log analysis of the problem solved is at hand. In fact, Web site log analysis is not difficult, but the novice does not understand the meaning, will feel a problem. Slowly read the website log you will understand the meaning of it. Here I take the lucky Racing website's log as an example, how to give beginners how to quickly get started this job.

In fact, the code is not as difficult as everyone imagined, I just did the optimization of the lottery site work, contact code, I really think that the webmaster in good cow ah, so difficult lucky Racing Http://www.7ien.cn/xysc.aspx website Log code are understood, worship the feelings of no words. After watching a period of time Web log, only to find that it is not their cattle, is I or rookie. Explore their own words, the site log analysis of the introduction time will be longer, here to share some experience, I believe that beginners can quickly get started.

1, Spider name Introduction

In the website log, the spider name generally has a few kinds: Baidu->baiduspider, Google->googlebot, Msn->msnbot, Yahoo->slurp, Yodao->youdaobot, Sogou->sogou+get+spider. In the log, as long as the search above spider name, you can see the spider type of crawl traces.

2, Spider crawl return type

After crawling the spider will return the code, by looking at the loan status can see the crawl results, the main HTTP status code has several categories:

(1) 200 code, indicating that the spider crawled normally.

(2) 304 code, which indicates that the content has not been updated since the last crawl. Typically, a picture of a Web site often returns that value.

(3) 404 Code, access to this link is the wrong link. This error link, on the one hand from the original existence later deleted the page, on the other hand may not exist, but other people outside the chain of such a dead link.

(4) 302 Code, which represents temporary redirection.

(5) 301 Code, which indicates permanent redirection.

(6) 500 code, indicating that the program is wrong.

3, the Log Code interpretation

#Software: Microsoft Internet Information Services 6.0

#Version: 1.0

#Date: 2013-12-22 16:00:39

#Fields: Date Time s-sitename s-computername s-ip cs-method cs-uri-stem cs-uri-query s-port cs-username c-ip cs-version CS ( User) CS (Cookie) CS (Referer) cs-host sc-status sc-substatus sc-win32-status sc-bytes cs-bytes Time-taken

Date indicates record access dates;

Time access times;

S-sitename represents your virtual host.

S-ip visitor IP;

Cs-method means the access method, there are two common, one is get, is usually we open a URL to access the action, the second is post, submit the form when the action;

Cs-uri-stem is to access which file;

Cs-uri-query refers to the access to the address of the accompanying parameters, such as ASP file? The string id=12, and so on, if no parameters are used-represented;

S-port access Port;

Cs-username visitor name;

C-ip source IP;

CS (user) Access source;

Sc-status State, 200 means success, 403 means no permissions, 404 means that the page is not hit, 500 indicates that the program is wrong;

The byte size that the Sc-substatus server delivers to the client;

Cs–win32-statu the byte size that the client delivers to the server;

Specific case Study:

2013-12-22 18:47:12 w3svc2137573334 d-901195c886694 119.147.151.150 get/a.aspx id=2230&typeid=91 80-123.125.71.28 http/1.1 mozilla/5.0+ (compatible;+baiduspider/2.0;++http://www.baidu.com/search/spider.html)--www.7ien.cn 200 0 0 59004 243 2250

This log said, Baiduspider: said spider crawl type for Baidu Spider, get/a.aspx id=2230&typeid=91 said spider crawl file named A.aspx id=2230&typeid=91. The return result is 200.

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.