Binary representation of Chinese characters in a computer

Source: Internet
Author: User

1. commonly used Chinese characters include 3000 ~ 5000, cannot use one byte encoding.

2. The GB2312-80 of the general Chinese Character character set (basic set) and its exchange code standard published by China has collected a total of 7445 characters, two bytes are used to encode a character, the maximum bit of each byte is 0. GB2312-80 code for short.

3. The Chinese character internal code is the Information encoding used to store, process, and transmit Chinese characters in the computer. It must be compatible with ASCII codes but cannot conflict with each other. The inner code is obtained from the highest position "1" of the two bytecode.

4. Chinese character input codes, also known as external codes, are the codes used to input Chinese characters from the keyboard. There are three main types: digital encoding, Pinyin encoding, and font encoding.

Digit encoding uses a string of numbers to represent a Chinese character. The most commonly used is the location code .. It subtract 00100000 (20 h) from each byte of the Country Standard Code, and is subtracted from the control character. The high byte is the area code, and the low byte is the bit code. There is no duplicate code, which is hard to remember.

Pinyin encoding is based on Chinese pronunciation. It is simple, has a high bitrate, and has a low input speed.

The font encoding is based on the Chinese character shape. because the components that constitute Chinese characters are certain, the components are encoded and input in order to represent a Chinese character. five pen and table codes are common.

5. The font code is used to generate a font when outputting Chinese characters. There are two ways to display the font: vector character and bitmap character.

6. The Chinese character information processing system must contain a Chinese Character Library, which stores Chinese Character submodules for display and output. The font library can be used to fix Chinese cards on the read-only memory chip, or a soft font that is stored in a floppy disk or hard disk as a file.

 

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.