2015/5/1 (ii) Character set and character encoding

Source: Internet
Author: User

First of all, we need to understand that the computer hardware is only read binary files, but for these complex 10110101101, the meaning of these things, even the Advanced program apes may not be able to understand, let alone those who are not the IT industry, but, with the popularization of computers, Naturally, someone came up to solve the problem of communication between the person and the computer. The first debut is the ASCII this thing, the full name American Standard code for information interchange, that is the United States Standards Information Interchange code below the drawings:

         simply, this is a standard, like language, This is the meaning of this, that the representative of what meaning, this standard contributed to the good situation of communication between people and computers, but, this thing is made by the Americans, the computer was not popularized, from the narrow view of history, the Americans certainly meet their own requirements, These 128 character encodings are enough for you.

  but then the computer became popular, which was problematic, like   " é "This French letter, there is no ASCII this character set inside, that can be said, because ASCII is to take a byte, but only takes up the last 7 bits of a byte, and the first 1-bit uniform is 0. So everyone has an idea, occupy this high, plus the previous 128 character code, can reach 256 characters encoding, basically meet the European coding needs, Europe is the solution, that Asia, and Africa, the world so many language symbols, they should go, so they are not to be outdone , on the basis of preserving the previous 128 character encodings, make code, such as China's GB2312, and so on.

< Span style= "FONT-SIZE:14PX; line-height:28.7999992370605px; Color:rgb (17, 17, 17); Font-family: Microsoft Jacob Black, ' Microsoft Yahei '; Letter-spacing: -0.119999997317791px; word-spacing:2.40000009536743px; Background-color:rgb (245, 245, 213); " > So there are many kinds of character sets and encoding in the world, at this time, the great OSI can not see, as a series of standards, it also developed a set of code called Unicode, known as the Unified Code, universal code, anyway is very diao. Of course, the later said Utf-8,utf-16 is a few of the Unicode implementation way! (I haven't read the details yet!) omitted)

It said some of the birth history what kind of, personally think very interesting, the following is said some definitions:

Character Set (charset) The so-called character set is a collection of all the abstract characters that a system supports, and the character is a generic term for all kinds of words and symbols, such as the ASCII character set, GB2312 character set!

Encoding (CharSet encoding) encoding is the process of converting information from one form or format to another, which means that the process of using a binary representation of a particular meaning information (such as numbers, text, symbols, etc.) is used on a computer. To mention the decoding is the inverse of the code, and complement it, I have to turn over the book, there is known to pay attention to the explanation.

The relationship between character sets and encodings is that encoding is how the character set is implemented on a computer!

Speaking of ANSI, this is also an American friend launched, double-byte encoding, for example, we use the window comes with a text editor to save the file will appear:

This ANSI is the default encoding, for the English file is ASCII encoding, and Chinese is GB2312, do not know you know, anyway, I did not know yesterday.

This character set and the code to say so much first, in fact, about each encoding, with a few bytes ah, what form is not too clear, and supplemental (2015/5/2)




2015/5/1 (ii) Character set and character encoding

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.