Through the HTTP status code all through to see how the search engine crawl your station.
The following table is all HTTP status codes and their definitions.
Code |
Instructions |
2xx |
Success |
200 |
OK, the request is complete. |
201 |
Normal, immediately following the POST command. |
202 |
normal; accepted for processing, but processing has not yet completed. |
203 |
normal; Partial information-the information returned is only part of the message. |
204 |
Normal, no response-received request, but no information to echo. |
3xx |
redirect |
301 |
Moved-The requested data has a new location and the change is permanent. |
302 |
Found-The requested data has a different URI temporarily. |
60V |
See other-you can find a response to a request under another URI, and you should use the Get method to retrieve the response. |
304 |
Unmodified-the document was not modified as expected. |
305 |
Use proxy-The requested resource must be accessed through the agent provided in the Location field. |
306 |
Unused-is no longer in use, and retains this code for future use. |
4xx |
Errors that occur in the client |
400 |
Error request-There is a syntax problem in the request or cannot satisfy the request. |
401 |
Unauthorized-The client is not authorized to access the data. |
402 |
Payment required-Indicates that the billing system is valid. |
403 |
Prohibit-access is not required even if authorized. |
404 |
Unable to find-the server could not find the given resource; The document does not exist. |
407 |
Proxy authentication Request-The client must first use the proxy authentication itself. |
415 |
Media type not supported-server denial of Service request because the format of the request entity is not supported. |
5xx |
Error occurred in the server |
500 |
Internal Error-The server could not complete the request because of an unexpected condition. |
501 |
Not executed-The requested tool is not supported by the server. |
502 |
Error gateway-Server received an invalid response from the upstream server. |
503 |
Unable to get service-the server was unable to process the request due to temporary overload or maintenance. |
For example:
2004-12-03 07:33:25 61.135.145.208-*.*.*.* get/index/119.htm-304 baiduspider+ (+http://www.baidu.com/search/ Spider.htm)
This means that Baidu spider in 2004-12-03 07:33:25 climbed the/index/119.htm this page, it found that this page is not updated.
For example, 2004-12-03 07:33:25 61.135.145.208-*.*.*.* get/index/120.htm-googlebot/2.1
(http://www.google.com/bot.html)
This means that Google Spiders climbed the/index/119.htm page in 2004-12-03 07:33:25, and it found that the page was new and crawled through.
Author: anonymous Source: Network