Site 404 Page return code error caused by the site by K case Analysis

Source: Internet
Author: User

Intermediary transaction SEO diagnosis Taobao guest Cloud host technology Hall

I stood peacefully through the August K station Storm, but unfortunately in this month recruit, finally by K.

Baidu's algorithm adjustment in the last two months has hurt a lot of high-quality sites. But we still think that this is only the temporary adjustment of Baidu, I believe that Baidu will seriously treat every station, as long as the insistence on providing high-quality content to be recognized by the user, sooner or later will be restored and get a better ranking.

After my detailed research, detailed study of the webmaster tools to provide data, and analysis of Baidu spider access logs, etc., found that the main reason for the K event is due to the site caused by large-scale revision of the file loss caused by a large number of dead chain after the error page of the 404 Return Code processing Link has emerged a major problem. The secondary reason is the external links of some highly weighted websites that have been tampered with by hackers, and the legacy of hacking that has long ago been compromised.

The new server system after the revision of the server to provide greater access to the Web site, but due to the new server system part of the function is unfamiliar, resulting in configuration completed, the input error page after the normal jump 404 page, but returned 200 status code.

Detailed Technical details Analysis:

1. The initial date of the start of the K event dates back to October 17 (last Wednesday), after our Baidu webmaster platform to capture the pressure feedback tool analysis, Baidu on October 17, the capture of silver investment of 0, for the beginning of the K station, but then the next day the spider crawl volume will be restored, and gradually increased significantly, Up to now the Spider Daily Crawl frequency has exceeded 6,000 times, this is also the initial recovery performance.

  

2. Then the movement can be seen in the Baidu statistics, October 18 In fact has been K, Baidu statistics backstage index from 50,234 pages to less than 10, but at that time site Display page collection still 37,300 pages, and keyword ranking is still normal.

  

3. Webmaster Tools through the site historical data can be seen, the beginning of October 19, Baidu Front desk display of the volume began to decline rapidly, but still included, and even three new included page. But after 3 days of process, included quickly lowered, to October 21, the site only 188 included.

  

4. The website is officially K is happening in this Monday, that is, October 22. The day included down to 0, but the same day from time to time can site part of the collection of Web pages.

Then we analyzed the Web site's log files. found that the spider on the day of October 22 visit very unusual, the page crawled unexpectedly all is wrong page, and grab some game plug-in sensitive keywords.

  

Access log as described above, after our observations, Baidu spider access to the path of all the previous site before the page revision, because we only retained a partial generation of static page, resulting in most of the most inaccessible, let us more puzzled that there are some non-existent directory files, a game download, etc., by the Baidu Spider visit.

After our detailed view, open the path on the page, return the 404 error page, and not find the path in the server-side file system.

These Games keyword hyperlink address should not exist, but there is no outside the chain point, the only possible is to save a brush Baidu keyword or a high weight of the site was hacked to hang with the anchor text outside the chain, which led to spiders visit this site. For this external problem we are helpless, can only remind the majority of webmaster more attention to site security issues.

We carefully examined the simulation client access to the Web page HTTP return code, found that the core of the problem is this error access, this should return 404 code, let Baidu direct filtering. But in Baidu spiders crawl process, unexpectedly returned 200 code.

So we examined the problem of the server in detail, confirmed that our 404 error configuration problems, arbitrary error access, although the error page can be given, but the return code is 200, we have to amend the first time.

Then after our changes, the next day Baidu Spider returned the status code are all 404, I believe that soon Baidu will rescan all the files, from the database gradually remove the file, collect the normal web site files, and gradually release the sandbox.

  

After a few days, Baidu spider A large number of visits, a large number of crawls. But most of the 123.125.68 paragraph Baidu down the right spider or 123.125.68 segment of the low weight crawl, crawl error page more. Occasionally a normal page is crawled.

In today finally appeared 220.181.108 High Weight Spider crawl home page. According to the experience of netizens, the spider visit will be released within a few days, do not know the site has no effect on K, look forward to recovery as soon as possible.

This article by the Gold novel Net www.hjxs.com original.

Finally remind you to do the site must pay attention to the server configuration, especially 404 error page Return code, a little inattentive will cause Baidu by K serious consequences.

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.