Intermediary transaction SEO diagnosis Taobao guest Cloud host technology Hall
What is the website log?
As a webmaster, we should not be able to help us understand the health of the site, except through the flow Statistics tool to view the number of visitors to the site, and the advent of the Web log is a good way to help us solve this problem. Web site log is the site of the server real-time records of the operation of the state of the various data files, through the analysis of the site log, we can know which users visited the site, visited which pages, but also can see the search engine spiders in the site crawling situation, The use of web logs can also see the request returned in the HTTP protocol status code, the long-term analysis of the site log HTTP status code can be found on the site unfavorable details, so that site managers better manage and optimize the site.
Where is the site log stored?
Site logs are typically stored in the log folder or LogFiles folder under the root of the Web site, and the folder name varies depending on the virtual host provider. The Web site log is a txt-terminated text file. Can download the log to the local analysis through the FLASHFXP, leapftp and so on website upload download tool.
Site Log Case Analysis:
1. Log syntax:
#Software: Microsoft Internet Information Services 6.0
#Version: 1.0
#Date: 2010-08-11 00:00:17
#Fields: Date Time s-sitename s-ip cs-method cs-uri-stem cs-uri-query s-port cs-username C-IP cs (user) Sc-status Sc-substatus sc-win32-status sc-bytes cs-bytes
Description:
#Software: Indicates the name of the software;
#Version: Represents the version number;
#Date: Indicates time
#Fields: The description is as follows:
Date: Indicates the record access dates;
Time: access to specific times;
S-sitename: Represent your virtual host or machine code;
S-IP: Server IP;
Cs-method: Represents the access method or the occurrence of the request/commit event, there are two common: one is get, is usually we open a URL to access the action, the other is post, submit the form when the action;
Cs-uri-stem: Which file or specific page the user accesses at the current time;
Cs-uri-query: Refers to the access to the address of the accompanying parameters, such as ASP file? The string id=12, and so on, if there are no parameters, "-" is represented;
S-port: Ports Accessed
Cs-username: Visitor name, if no parameters are used "-";
C-ip: Visitor IP
CS (user): Access to search engines and spider names;
Sc-status:http status code, 200 for success, 403 means no permissions, 404 means the page is not hit, 500 indicates that the program is wrong;
Sc-substatus: The byte size that the server delivers to the client;
Cs–win32-statu: The byte size that the client delivers to the server;
Sc-bytes: Server-side transfer data byte size;
Cs-bytes: User requests data byte size;
The HTTP status code after several data is not a fixed format, if only one indicates the size of the download data byte;
2, HTTP status code:
1**: Request received, continue processing
2**: Operation received, analyzed and accepted successfully
3**: Complete this request must be further processed
4**: request contains an error syntax or cannot be completed
5**: Server failed to perform a fully valid request
Case case Source: one of the log codes for web168.com is as follows:
2010-08-09 11:44:32 w3svc622339 222.186.25.142 get/index.html-80-123.125.66.70 baiduspider+ (+http://www.baidu.com/ search/spider.htm) 304 0 0 283
Description]
This record said Baidu Spider in 2010-08-09 11:44:32 this time climbed the site root directory "index.html" This page, through the return of the 304 status code said the spider thought the content of the Web page is not updated or not modified, 283 means that the spider download this page byte size.