The effect of GBK or UTF8 on SEO in charset

Source: Internet
Author: User
Keywords SEO

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

See someone asked GBK and utf-8 for SEO impact, I say a personal view.

If the site is for people, it is recommended to use GBK for the following reasons:

1.GBK the use of two-byte representation of Chinese characters, utf-8 using three bytes to represent Chinese characters, from the representation of a Chinese character of the number of bytes, GBK relative to Utf-8 can save 50% of the space.

2. The current open source program GBK code is relatively mature.

3. Spiders in the process of crawling a page to identify the value of the CharSet property if the GBK words can be essentially a Chinese type of Web site (no need to judge the following content), if the utf-8 words need to be further judged ( For example, retrieves how many characters in the full text are within the range of the utf-8 literal character.

If the website is a foreign language decisive utf-8 bar.

There is also a point to note is that because GBK and utf-8 coding is different, if the site is included in the change after the charset, spiders in the crawl process if not in time to find CharSet changes will determine the content of the Web page is abnormal caused by K.

Take my own forum for example (the following example is a little different from the actual situation, only to illustrate the general meaning), as shown in Figure 1, the Forum adopted the code for GBK, the browser is normal display.

Figure 1:

  

If you force the browser to explain in Utf-8 encoding, it will look like Figure 2.

Figure 2:

  

The same reason, if the previous use of the Utf-8 code, and the page has been included in the search engine, if midway replaced by GBK code, in the spider crawling process if the spider can not be found in time CharSet attribute value changes will also be based on the previous code analysis, The result is a great change from the previous normal page, which leads to the possibility of the page being K.

Theory needs to practice, I used my own page to carry out this experiment (page address http://sl.zoum5.com), before using the Utf-8 code, and then changed to GBK. On May 5, this page search keyword "included in the bulk query" is located in the first, today in Baidu has lost traces, as to the halfway change the encoding will not affect the weight accumulated before, but also need further observation.

As of May 13, the page by K has been properly restored, and lasted about five or six days.

Original address: http://www.zoum5.com/seo/119.html

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.