Very detailed very good, turn around to study:Reprinted from: http://www.cnblogs.com/lidabo/archive/2013/11/27/3446518.html1, character encoding, internal code, incidentally introduced Chinese character codingCharacters must be encoded before they
Coding knowledge Summary
The earliest encoding is ASCII, which is only 1-127, expressed in one byte. And the first bit of this byte is 0.Later, many countries found that ASCII characters are too few. For example, Chinese characters cannot
The problem of character encoding seems to be very small, often overlooked by technical staff, but it can easily lead to some puzzling problems. Here is a summary of the character encoding of some of the popular knowledge, I hope to be helpful to
Unicode, ucs-2, ucs-4, UTF-16, utf-32, UTF-8
Unicode details
Copyright Notice: It can be reproduced at will, but the original author charlee and original link http://tech.idv2.com/2008/02/21/unicode-intro/must be indicated in a timely
In programming, these three character encoding formats are often converted to each other, so that many third-party libraries fail to call for unknown reasons, in fact, many are because third-party libraries support UTF-8 rather than Windows Default
Reference:Http://www.ruanyifeng.com/blog/2007/10/ascii_unicode_and_utf-8.htmlHttp://www.cnblogs.com/mjgforever/archive/2008/02/27/1083135.html1. ASCII codeWe know that inside the computer, all the information is ultimately represented as a binary
UTF-32 stores each character in 4 bytes to ensure that the UCS is fully represented. However, the number of characters in the UCS does not need to be represented by 32 bits at all, UTF-32 greatly wasted space. In addition, because of the combination
The default encoding of many files is ISO-8859-1, while the default encoding of the Chinese operating system is gb18030, the project code established in this workspace is gb18030. our commonly used encoding is UTF-8, this provides better
Character encoding: ASCII, Unicode, UTF-8, gb2312
1. ASCII code
We know that in a computer, all information is eventually represented as a binary string. Each binary bit has two states: 0 and 1. Therefore, eight binary bits can combine 256
Unicode
We know that the computer is actually stupid. It only knows strings like 0101. Of course, we certainly feel dizzy when we look at the 01 string, so many times we simply use decimal to describe it, hexadecimal format, which is octal. in fact,
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.