1. commonly used Chinese characters include 3000 ~ 5000, cannot use one byte encoding.
2. The GB2312-80 of the general Chinese Character character set (basic set) and its exchange code standard published by China has collected a total of 7445 characters, two bytes are used to encode a character, the maximum bit of each byte is 0. GB2312-80 code for short.
3. The Chinese character internal code is the Information encoding used to store, process, and transmit Chinese characters in the computer. It must be compatible with ASCII codes but cannot conflict with each other. The inner code is obtained from the highest position "1" of the two bytecode.
4. Chinese character input codes, also known as external codes, are the codes used to input Chinese characters from the keyboard. There are three main types: digital encoding, Pinyin encoding, and font encoding.
Digit encoding uses a string of numbers to represent a Chinese character. The most commonly used is the location code .. It subtract 00100000 (20 h) from each byte of the Country Standard Code, and is subtracted from the control character. The high byte is the area code, and the low byte is the bit code. There is no duplicate code, which is hard to remember.
Pinyin encoding is based on Chinese pronunciation. It is simple, has a high bitrate, and has a low input speed.
The font encoding is based on the Chinese character shape. because the components that constitute Chinese characters are certain, the components are encoded and input in order to represent a Chinese character. five pen and table codes are common.
5. The font code is used to generate a font when outputting Chinese characters. There are two ways to display the font: vector character and bitmap character.
6. The Chinese character information processing system must contain a Chinese Character Library, which stores Chinese Character submodules for display and output. The font library can be used to fix Chinese cards on the read-only memory chip, or a soft font that is stored in a floppy disk or hard disk as a file.