During the localization process, the transmission of source and target files is involved, and encoding is very important at this time. Chinese Web pages and operating systems generally adopt ANSI encoding, which is also a character standard of Microsoft OS. For ANSI, different countries and regions have different standards, resulting in gb2312 (Simplified Chinese), big5 (Traditional Chinese), JIS (Japanese) and Other encoding standards. However, different ANSI encodings are incompatible between different languages. Therefore, for file transfer between different operating systems, or in the same operating system, the source file language is different from the OS language file transmission,Convert to ut8 format.
Specific differences:
ANSI: 16384Characters. This isANSICharacter Standard. One English byte, two Chinese bytes
UNICODE: Uses two bytes to encode almost all languages in the world ( Zero x 0000 - 0 xFFFF ), 65536 Characters , For each languageCodeSegment not Same,Two bytes ( Both English and Chinese are two bytes. ) The characters are unique, so different languages can coexist in the text,Solve international problems
Utf8 Yes Unicode Compression Mode , English A In Unicode Is Zero X 0041 Foreigners think this storage method is too wasteful. 50% Space,Therefore, the English language is compressed 1 Bytes, Into Utf8 Encoding,However Utf8 Moderate percentage 3 Byte, apparently not as good as Chinese ANSI Cost-effective This is what Chinese Web pages are used ANSI Code, which is commonly used by foreigners. Utf8 .
In the English OS, set control and panel settings-Regional and language options, standards and formats to English.