Conversion of Chinese character encoding character sets

Source: Internet
Author: User
Tags i18n
For more information about converting Chinese character encoding character sets-Linux general technology-Linux technology and application, see the following. 1) Let's Talk About locale first:

Locale (local language) was promoted to the stage in order to unify linux's support for various languages. Locale can be understood as a language environment defined based on cultural differences in different countries. Generally, if locale is not defined (if it is en by default), you can also browse Web pages in other languages, because as long as the system has the corresponding character set and font for translation. However, when entering Chinese characters and mounting windows partitions, you must note that you must manually define the environment variables in locale. $/Usr/bin/locale can be used to view the environment variables in locale.

What is character set? Character set can be considered as a non-English character encoding method in the system. It is indexed by unicode numbers.

Unicode uses numbers to represent all characters, but the encoding method of a character is determined by the combination of character sets. The basic character set for internal and network communication within a computer is a UTF-8, which represents a Latin letter in one byte and two represents all Chinese characters. While GB2312 represents all characters in two bytes. It is worth noting that unicode itself stores all characters in four bytes, which is of great significance in windows mounting.

2) how to display Chinese Characters in an English environment:

First, check LANG and LC_CTYPE straight through echo in the command line, then export LANG = "en-US.UTF-8" sets the entire system as an English environment, export LC_CTYPE = "zh-CN.UTF-8" can explain Chinese in the main language environment, that is, can complete the Chinese display. These two variables are stored in/etc/sysconfig/i18n. You can also directly modify vi and copy them ~ /. I18n.

3) Chinese Input Method:

There are currently two types: scim and fcitx. Generally, GNOME Desktop uses scim And KED uses fcitx. Fcim is installed in fc7 by default. It is an integrated body of scim and fcitx, because the scim platform is compatible with other input methods.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.