When the localization process involves the transmission of source and destination files, the encoding becomes important. Chinese web pages and operating systems are usually ANSI-encoded, which is a character standard for Microsoft OS. For ANSI, different countries and regions have set different standards, resulting in GB2312 (Simplified Chinese), BIG5 (Traditional Chinese), JIS (Japanese) and other individual coding standards. However, different ANSI encoding is incompatible between different languages, so for the transfer of files between different operating systems, or under the same operating system, the source file language is different from the language file of the OS, it needs to be converted into UT8 format.
Specific differences:
a ansi:16384 character. This is the ANSI character standard. English one byte, Chinese two bytes
Unicode: encode almost all languages in the world using two bytes (0x0000-0xffff English, Chinese are two bytes) The characters expressed are unique, So different languages can coexist in the text, solve the problem of internationalization
Utf8 Unicode expressed as 0x0041 space, So the English compressed into 1 encoded, but Chinese characters in Utf8 cost This is the Chinese web page as Ansi encoding while the foreigner's webpage is commonly used Utf8
In the English OS, the control and panel settings-regional and language options,standards and formats are set to 中文版.
Ansii, Unicode, UTF8 differences and relationships