Encoding is a solution. Encoding corresponds to a character set that can be expressed by itself.
For example, the original ASCII code can only contain 127 characters, and then extended to 255 characters ~~
Gb2312 encoding, which corresponds to the character set of thousands of Chinese characters and other symbols.
Gb2312 can be understood as a character set (someone is called "encoding Character Set") or a encoding scheme (someone is called "character set encoding ").
As a character set, it indicates the inherent thousands of characters in gb2312.
The encoding scheme maps a Chinese character to a binary (in bytes). For example, gb2312 represents a Chinese character in two bytes, utf8 represents a Chinese character in three bytes.
ThisArticleSee the following: http://www.cnblogs.com/anran_guojianjun/archive/2008/12/27/1363465.html
In addition, it should be noted that Unicode encoding is a bit special. Each character must be stored in 2 bytes ~~ It's a little different from ASCII. Haha ~~
In addition, I wrote a funny article about the development history of coding. Although long, I should have a deep understanding after reading it.