What is URL normalization? SEO friends should be more clear, referring to the search engine to pick the most appropriate URL as a real (normalized) URL process.
First, why does the non-canonical URL appear?
Http://365daohang.com
Http://www.365daohang.com
Http://365daohang.com/index.html
Http://www.365daohang.com/index.html
The above URLs refer to the same file: Home
Technically speaking, these URLs are different URLs, the search engine also really treats him as a different URL, although these URLs return the same files, that is, the home page. Technically, however, the host can return different content to these URLs. Then, in addition to the figure with and without www caused, and whether the end with index.html suffix caused by the non-canonical URL, in fact, there are several reasons for the cause. For example:
①: The reason for the website program, many CMS systems often appear an article can be accessed through several different URLs.
②:url static settings There are errors, multiple static URLs in the same article can be accessed.
③:url static and dynamic URLs coexist, both have links and can also be accessed.
④: The directory behind the website with and without a slash. Different URLs, but actually a page.
Http://www.365daohang.com
http://www.365daohang.com/
⑤: Encrypted URLs: URLs exist at the same time, but can be accessed.
Http://www.365daohang.com
Https://www.365daohang.com
There are port numbers in the ⑥:url:
http://www.365daohang.com:80/
Http://www.365daohang.com
⑦: Tracking code. There are people who like to do network promotion, used to follow the URL followed by code:
http://www.365daohang.com/?affid=100
Second: Secondly, the appearance of the Web site does not standardize the website will bring any problems?
Web site has a number of non-canonical URLs will be indexed and ranked search engines to bring a lot of trouble, this is no doubt. However, there are a lot of webmasters on this site is not a very heavy standard. For example, to prospective customers to see the site, found that many of the URLs of the site did not do URL normalization. When asked why they do not do URL normalization? Customer Answer: What is the URL normalization, how to do, will be written in the diagnostic recommendations? On the spot on the drunk ... There is also a part of the SEO diagnostic customer, the process of the diagnosis found that the URL of the site is not standardized problem is very serious. Here, also hope that these parts of the webmaster can read this article after learning to standardize knowledge points. We went on to read .....
Why does the non-canonical URL bring a lot of trouble to search engines and rankings? In order to let the webmaster thoroughly understand, we give an example, such as: The site home is fixed and only one, but a lot of webmaster in the link back to the page when the URL is not unique, but a will connect to http:// Www.365daohang.com, one will connect to http://www.365daohang.com/index.html.
Generally speaking, although the user will not cause any trouble, but because these URLs are the same file, will be indirect to search engine confusion, search engine will think: In the end which is the real home page? Which URL should be returned as the home page? Here's the point: if different versions of URLs appear on the site at the same time, it will likely be indexed by search engines. After being included, the consequences can be imagined. At this time, the search engine in the calculation of the rankings must find the so-called standardized URLs, which is the search engine that the most appropriate version of the URL.
The problem caused by non-normalization is simply the following:
①: There are multiple URLs on the site, which will spread the page weights, not conducive to ranking.
②: The canonical URL of search engine judgment is not the website that Stationmaster wants. Webmaster want is not with suffix, search engine to bring the suffix to the included:
③: If the URL normalization problem is too serious, it may also affect ingest. Because: a weight is not very high domain name, can be included in the total number of pages and spider total crawl time is limited, and the search engine to spend resources in the collection of non-standard web site, to the unique content of the resources are becoming less.
④: Too many duplicate pages, the search engine may think there is suspicion of cheating.
⑤: For search engines, wasting resources and wasting broadband.
Third, finally, how to solve the problem of Web site normalization
about how to solve the problem of website normalization, perhaps this is the focus of the webmaster is also the core content. So there are a number of ways to resolve URL normalization issues, such as what we'll say next:
①: Now the Enterprise, personal webmaster with the most programs is CMS, then you have to determine whether you use this CMS system can only produce a standardized URL, whether or not static, such as Dede, Imperial cms.
②: All internal chains should be consistent, pointing to the normalized URLs. For example: Take with and do not take the WWW as an example, to determine a version of the normalized URL, the site's internal links to use this version, so that the search engine will understand which is the webmaster hope that the site normalization website. And from the user experience point of view: the user usually the first choice is to take the WWW version of the normalized URL.
③:301 steering. This is the most commonly used is the most common method, the webmaster can be 301 to turn the non-normalized URL all to the normalized URL.
④:canonical tags. At present is also a webmaster use more of a kind of, and Baidu is also full support this label.
⑤: Make an XML map, all using normalized URLs, and then submit to the search engine.
Although many methods, but many methods have limitations, such as: some sites because of the lack of technology or immature, resulting in 301 can not be achieved. Another example: Many CMS systems are often unable to be controlled by themselves and so on. Well, here are mainly for 301 and canonical to do the specific instructions, because these two standardized way is the most common means of webmaster, but also Baidu most agree. Let's continue reading .....
URL Normalization of the 301 turn:
Previously wrote a 301 turn to a comprehensive analysis of the article, the webmaster in the reading of this small paragraph, if you want to know more about the 301 turn, you can click the address: http://www.admin5.com/article/20131212/532417.shtml into read more. So, let's continue reading ....
①: What is a 301 turn?
301 Turn, also known as 301 redirect, 301 Jump, is a user or spider to the site server to make an access request, the server returns the HTTP data stream header information part of the status code, indicating that this site is permanently transferred to another address.
In addition, there are other methods of Web site steering, such as: 302 steering, JavaScript steering, PHP/ASP/CGI program steering, etc. Here is a key point: In addition to the 301 turn, the other methods are commonly used cheat, although the method itself is not wrong, but the number of cheaters used, search engines on the suspicious turn is very sensitive and so on. So, the other way is to use less for the better.
②:301 Steering Transfer weights
For example: Page A with 301 redirect to Page B, the search engine can be sure that page a permanently change the address, or actually does not exist, the search engine will take page B as the only effective target. And, more importantly: page a accumulates the weight of pages that will be passed to page B.
For example: http://www.365daohang.com/is the selected normalized URL, the following several URLs are done 301 to the selected normalized URL, so that the search engine knows it is a normalized URL, and will be the weight of the three URLs are transferred to the normalized URL.
Http://www.365daohang.com
Http://365daohang.com/index.html
Http://www.365daohang.com/indexl.html
There may be a webmaster will ask: 301 How long can the shift take effect? Generally speaking, in the Baidu Webmaster Tool Revision tool to make the rules submitted, about a week will be effective.
③: How to do 301 turn?
about how to do 301 turn, here is recommended to refer to this article: (http://www.admin5.com/article/20131212/532417.shtml) This article, there is a detailed 301 steering operation method, fully suitable for the personal webmaster and corporate webmaster. Because the text is too long, it is not written here.
URL Normalization of the canonical label (refer to the Baidu Webmaster platform to give the standard):
What is the role of ①:canonical tags?
For a set of identical or highly similar pages, by using the canonical tag you can tell the search engine which page is a canonical page, be able to standardize the URL and avoid multiple pages of the same or similar content in the search results, to help solve the problem of duplication of content, Avoid the site of the same content pages of repeated display and weight of the dispersion, improve the weight of the standard page, optimize the ranking of the page.
②: How to specify canonical URLs with canonical tags?
You can specify a canonical URL by adding a rel= "canonical" link in the section of each non-canonical version of the HTML page.
For example, to specify a canonical link to a Web page http://www.365daohang.com/product.php?id=15786, you need to create the element as follows:
<link rel= "canonical" Href= "http://www.365daohang.com/product.php?id=15786"/>
You can then copy the above link to the section of all non-canonical page versions of a page (for example, http://www.365daohang.com/product.php?id=15786&active=1) to complete the setup.
③: Examples of situations where you can set up a canonical Web page:
For example, a community post may cause a Web page with the same content as the top, highlight color, etc. to produce a different link, the search engine will only select one of the links to index, such as the following two links are different, the content of the page is identical:
http://www.365daohang.com/forum.php?mod=viewthread&tid=17868770&page=1#pid115642474
Http://www.365daohang.com/thread-17868770-1-1.html
For example, the list page of a commodity, sorted by price or preferential order, but the content of the page is very similar:
Http://mall.leho.com/pr-list?locid=75fb2a357d38397c5e1e75fa&cid=5e1e02f950a4101fb27571ee&order= Discount
Http://mall.leho.com/pr-list?order=price_asc&locid=75fb2a357d38397c5e1e75fa&cid= 5e1e02f950a4101fb27571ee
Example three, the site has a number of pages displayed for the same model of the product, but the color of each page product map, the other content is almost identical, at this time can also be set rel= "canonical", will be the most popular color of the product Web page set specifications page, recommended Baidu has priority to display in the search results.
④: Will Baidu fully abide by the rel= "canonical" Label?
The page added the label, on behalf of the webmaster to Baidu recommend a webpage as the most standard version of the Web page, Baidu will also be based on the recommendation of the label and the system algorithm to select the most appropriate page to display it in the search results. Baidu will be based on the actual content of the Web page to consider canonical tags recommended pages, but does not guarantee full compliance with the label. To guarantee the effect of the label, make sure that only one canonical label is on a page.
⑤: Is this link relative or absolute?
Rel= "canonical" can be used with relative or absolute links, but it is recommended that you use absolute links to minimize any confusion or problems that may arise.
Can the ⑥:rel= "canonical" label be used to suggest canonical URLs in different domain names?
If the site needs to replace the domain name, and the server used cannot create the server end multiplicity directed URL, you can use the rel= "canonical" link element to specify the site you want Baidu domain.
Written at the end:
Above by 365 navigation SEO original editor. Regarding the website normalization, this article gives the most detailed text explanation, hoped that the personal and the enterprise and so on the stationmaster in after reading this article, can more grasps the website site normalization Knowledge Point and the operation method. Of course, if you feel that this article has helped you, please do not hesitate to share it, spread it out.
Full-scale analysis of website normalization optimization