How Web site IIS logs can help with optimization

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

Web site IIS Log is one of the most important things, because there can query the search engine crawling situation, can also learn some of their own site, can be analyzed to some of the user's antecedents, not necessarily with the flow statistics code to calculate, but look at the Web site IIS log also some space business restrictions, must inform the opening can, Some do not support, you can also download some code to install the Web site IIS log stub also points space, the site capacity is not very large, will be a sudden excess, so to find a better space to put the site. I'll talk about it. What is the help of Web log analysis for optimization?

First, understand the search engine spider crawl times

Spiders are sent to the search engine to crawl the content of the robot, to understand the number of spiders crawling to know whether our site search engine like, and vice versa this site does not continue to crawl, can be compared to the number of times, Compared to the previous perimeter of the operation of the comparison can know how much of the spider to what is caused by the site outside the chain or site updates, to adjust the Web page, spiders like original content, if they are some directly copied paste over the content, the next time maybe the spider will not come again, feel that this site is a mirror of a site.

For the harmful spider we if shielding (shielding Spider's IP) spider is also a lot of species, if one day found that some unknown spider IP site is down right or by K, then we have to prohibit this spider's IP access.

A lot of spider access can increase the resources of the server, spider's frequent visit or to the website has the help, but the resource consumption is also very big, therefore wants to find a good space merchant to put, otherwise all of a sudden server collapsed, that may not notify you the case deletes your website to drop.

Baidu Club: Look at the identification of fake spiders (in order to steal other data), the most important is that the Baidu Spider is to show the IP over there, if it is the other place is not the real spider, but also have the right to drop and k spiders, careful to see OH.

Second, the site page crawl degree

If you like to crawl the home page, the snapshot is the next day, frequent words in the page is also seconds, if a part of the page has not been crawled, then to see if the spider is prohibited to crawl, do not most of the chain to do the home page, but also to do the appropriate internal pages, otherwise included on the very low, We also know what's going on with our website, spiders are mainly crawling to which pages or which pages crawl is more frequent and which pages are not crawled, we want to combine analysis, that directory contains more than that less comparison (space provider to provide IIS log) but also to see the changes, Search engines in different periods of the crawling situation is not the same, because of the reprint, or because of the chain.

Analysis of HTTP status codes

Spiders after the general has left the HTTP status code, the return is 200 does not represent the direct release, some one weeks update on the release, some one months to put, as long as the return of this code, generally this page will be released.

Two questions:

1. Should we make the error page return 404 or 200?

Should return 4,044 correct, because such a search engine just knew that the page could not be accessed, if it is 200, it may cause the right to drop or be k of the situation, the representative of this web page can still be crawled, once found that a large number of are not accessible, it will be punished, so to do 404 pages.

2. If our site to record, we are building, we should return that status code is 500, or 400, or 404, or 500, or 503?

To return 503 (503 is to tell the search engine temporarily can not access, will be restored, if the return of other status code may search engine will not be visited, 404 page is representative of this page is no longer exist, the search engine will think that your site is no longer exist, it is directly deleted, The next time will continue to crawl. )

Website made 301 Permanent redirect return code is not, that will have to query whether set the correct, otherwise the weight will not be transferred to the new domain name, what things should be done foolproof.

Four, professional log analysis tools

Can let us know the PV access value and the aggressive access value

PV is to retain the embodiment of users, if the jump rate is too high, the site is not open or content is not readable, the site will not escape the fate of the end of the ranking, but also see what page is the highest access, so you can find out the needs of users to improve. The site is not open or open for a long time, it is necessary to see whether there is an unknown amount of IP access in the log, it may be attacked, so we can only compromise or report, or else to change the server, but the new, still can not solve the problem, it is best to take the legal means to protect their interests.

(a site's good and bad directly determines the user's click)

Users do not click on your site, it means that your site is not persuasive, and there is no attraction, it is not a good site, in addition to grazing and illegal sites, your product description is not detailed enough, the picture is not clear enough, customer service does not give force, then who will continue to stay on your site? To do a good job of user experience is a lesson.

Log Analysis tool:

(1). Awstats,

Article. Webalizer

can also analyze the status code of the website

Web site IIS logs are a great help for optimization, do not ignore any of the details, IIS log can not only know whether your site is helpful to users, but also to let the search engine know whether it is suitable for its likes, can know some of the site, down the right and by K of the Omen can be in the Web site IIS log code to understand, Reprint please indicate www.bole110.com source, thank you for your cooperation!

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.