Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall
The author on June 25 published a "skillfully with 301 redirect 404 errors into the site outside the chain," the article, talked about how to pass the 301 redirect way, will be from the outside to get the wrong URL into an accessible URL, so as to pass the weight of the article.
Today in A5 saw a "talk about the use of 301 redirect 404 pages into their own outside the chain of harm," refuted my previous article in the view. I feel very good, SEO industry should have the spirit of questioning and independent thinking ability. Having read the article carefully, I found that the author misunderstood what I meant. So write another article to clarify the point of view, and to introduce the role of 301 redirect. First of all, I explicitly two in the "Use 301 redirect 404 errors into the Web site outside the chain," a thought:
The article is about the 404 error URL, through 301 redirect back to the original URL. This needs to be explained, I did not mention any 301 to the home page or other pages in the article meaning, the example in the article is about the link from the external (b Web site) to its own web site (a site), in the process may be the wrong spelling of the URL, link add error or even intentionally produce the wrong URL behavior. Rather than the 404 error of a site itself.
The original text has been there, the question of friends can be read more carefully. Let me refute the friend's point of view (the blue Word is a friend's opinion):
Refute the first paragraph
The original author, the chain pointing to the 404 error is due to the outside of the site, this sentence is understandable, but this 404 error occurred, but also because of their own web site procedures decided, since there is no way to escape, for example, on the A5 page can also appear many such as Live like the suffix 404 page, Join 1.html,2.html directly in the back, that's all, but if someone is deliberately using the form of a chain to link such a page, it is also to bring links to the site, that is, at most, produce a 404 page, and there will be no other.
The site has 404 errors, not necessarily the internal procedures of the site problem. Spiders crawl to their sites via an external Web site (for example, the B site), which is the same as an example of a Web site, which also causes 404 errors: The wrong URL causes the wrong page. Spiders don't care about you. Link URL exists inside or outside the site, as long as the URL crawling along the process of "page does not exist" situation, will record a 404 error.
This is something we can see clearly in the "health"-"Crawl Error"-"Can't find" section of Google admin tools. 404 errors are divided into "in Sitemap" (internal cause) and "link to your Site page domain" (external cause) two.
As the name suggests, the "domain linked to your Web site" refers to the URL from the B site to the site A.
Refute the second paragraph
And the original author means how to grasp the weight of this aspect, want to the weight of the chain directly back, rather than let go, here, the author also has its own point of view, the chain itself this "Http://www.xxxxx.com/rich-snippets.htmlGFQ", This chain is linked to the 404 pages, if you take these pages 301 off, this situation and the site appears in a large number of 404 pages, and then directly to 404 pages 301 to a page of the situation is the same; then if your site has 404 pages, then in order to prevent the loss of these weights, Will these pages all 301 to the home page? This is completely not in line with the requirements of search engines, if you want to know clearly, the direct Baidu "404 page 301 to the harm on the homepage" can understand more.
First, the search engine has a clear distinction between "self action" and "external behavior". Take the link construction, the internal chain and the chain in the weight of the effect is different. Everybody knows that. The core of the idea is that the outside chain is the webmaster can not control, and the chain is the webmaster to set up their own. Although in the search engine development process, appears "can be controlled by the webmaster outside the chain" this factor (is the usual outside the chain construction). But either controllable or uncontrollable, there is a thought is clear, that is, who will not be in other people's website can publish the correct URL under the premise, but to send the wrong URL, resulting in users can not normally visit their own web site or on their own site "The page does not exist" word.
Second, for 301 to the original page or 301 to the homepage. I do not want to say that, we all can understand the original meaning. What I want to say here is how the search engine identifies the source of the original text:
Where the search engine first sees content
Many of the same content in the Domain name Trust degree
The most links to where (the inner chain in the original text)
Copy link back to original source (copyright link)
Due to the existence of the second signal, many of our original authors published or reproduced in the content of other sites, can not get a good ranking. Many authors have complained about it. But we can use 1, 3, 4 points to correct this error.
Baidu does not do well in this respect, but Google has been able to quickly and accurately identify the source of the original. This will benefit from the above 3 articles. And the "copy is linked back to the original source" this factor, but also I in the "Use 301 redirect 404 error into the site outside the chain," this article is stated in one of the purposes, there is a purpose we also see is to pass the weight.
Finally, an error URL that is inaccessible to a user is reasonably redirected to the correct URL in 301 ways. Also helps with the user experience. We also see this in the "crawl error" of Google Administrator tools.
Googlebot cannot crawl the URL because it points to a page that does not exist. Typically, 404 does not affect your site's ranking in search results, but you can use it to improve the user experience.
The way to solve the 404 error is simply through the robots.txt mask, or through the 301 redirect. I don't think shielding can improve the user experience. Robots.txt's way can only improve the spider experience. Because the user clicks the error URL, the access is still a non-existent page and see 404 errors.
Refute the third paragraph
Direct copy "If you are returning code other than 404 or 410 for a nonexistent Web page (or redirecting users to other pages, such as the first page, instead of returning 404), you may have problems. First of all, this is tantamount to telling the search engine through the site can find the actual page. As a result, the search engine might crawl the Web site and index its contents. Because Googlebot to use a lot of time to deal with nonexistent Web pages, you may not be able to quickly find your Web site or visit these URLs frequently, or visit these URLs frequently, thereby affecting the amount of crawling of your site's content (plus, you certainly don't want your site to appear frequently in [Files not found] Search query). "This is the 404-page quote, if you do not follow the request to continue the error page jump, it may be that your site appears a large number of the same page, the same title, the same description, the same content, and so on, and then this is the different URL and the same content between the story, as to what will happen in the future? This everyone can go to Baidu, their own to Google find on the know.
If a friend of the rebuttal mentions the Google Administrator's Guide, don't forget to extract another text:
In general, 404 errors do not affect the ranking of your site in Google, so you can safely ignore these errors. These errors are usually caused by misspellings, incorrectly configured (such as links that are automatically generated by the Content management system), or by Google, which strengthens the links in embedded content, such as Javascript, to identify and crawl.
To view the source of an invalid link, click on the relevant URL. In the error dialog box, click the link from the following page tab. If the related links are from your site, repair or remove the links. If these links come from external sites, you can use this data to improve the user experience of your site. For example, if someone had intended to link to your site but had lost the wrong address, a legitimate misspelled URL (such as www.example.com/awesome spelled Www.example.com/awsome) would appear. Instead of returning a 404 error, you can redirect the misspelled URL 301 to the correct URL and get the expected traffic through the link. You can also make sure to help them find what they want, rather than just show "404 Not Found," After you have directed users to 404 pages. However, we only recommend that you take this action if the error link produces a higher flow.
SOURCE Link: https://support.google.com/webmasters/bin/answer.py?hl=zh-Hans&answer=2409439
Unfortunately, this friend only saw one, but did not see the second. When we do SEO, the official information is very important, many details are hidden in it, it takes a lot of time to read and understand carefully.
In fact, many of the settings and explanatory text in Google Administrator tools have their reasons. Just some of our SEO is not willing to understand. Like 404 of the internal and external causes of such things, in the "Crawl error" in the area of the separation is also justified. Instead of being idle and doing nothing.
Summary: As SEO, we have to learn a lot of knowledge, while developing their own ideas and analysis of the problem. But you need to make sure that the knowledge you learn is advanced, not stale. Otherwise, their ideas can easily be misled, resulting in bad results.
The debate on the point of view is also a very important part of SEO work. No one can say for sure that their understanding is correct, we can only take the official disclosure of some information and their own through the data analysis of the conclusions to prove their ideas and theoretical correctness.
This article by Yangfan original in Yang seo, reprint please keep the link: http://www.seoyangs.com/404-301-original-page.html