Help: character display problems

Source: Internet
Author: User
Tags ultraedit
Document directory
  • In summary, "?" It used two bytes: C6, 00, but after different processing, it is easy to become 3f, that is, the real English ?, Why?
  •  
  • Thank you for your understanding!

There is a batch file named "TXT", which is very simple. There is only one line of string "AB China? 12 ". Note:"?" It is neither a Chinese Question mark nor an English question mark. It is a non-standard character and actually occupies two bytes, as shown in: C6 32. Because there is no corresponding character, it cannot be properly displayed, it is displayed as "?". The following is the question mark (?) Some experiments.

 

A ultraedit

The following figure shows the results in the ultraedit editor.

1Source Text: "?" It corresponds to two bytes: C6 32, which is shown as follows:

 

2ANSIFormat: Use ultraedit to save it as an ANSI/ASCII format. It corresponds to two bytes: C6 32, which are not missing and are shown as follows:

 

3UFT-8Format: Use ultraedit to saveUFT-8Format, "?" Corresponding to 1 byte: 3f, which is an English question mark? The corresponding ASCII code is obviously one byte missing, that is, when it is savedUFT-8 CodingFormat, the C6 32 is changed to 3f. Shown as follows:

 

4UTF-16 (UNICODE)Format: Use ultraedit to saveUTF-16Unicode format. "?" 2 bytes: 3f 00, which is an English question mark? Unicode code. Normally, it should be C6 00 32 00, which is obviously also a loss of bytes, as shown below:

 

 

Notepad Editor

 

The following figure shows how to open the 1.txt file with the notepad editor notepadof Windows system. Do you want to save it as a file of several different encoding formats, and then view the file in hexadecimal mode as follows:

Source Text: "?" It corresponds to two bytes: C6 32, which is shown as follows:

 

ANSIFormat: "?" The corresponding value is 1 byte: 3f, which is obviously one byte missing, as shown below:

 

UTF-16 (UNICODE)Format: "?" Corresponding to 2 bytes: 3f 00, shown as follows:

 

UFT-8Format: "?" Corresponding to 1 byte: 3f, shown as follows:

 

 

 

After the three copies and paste the changes

Open the file 1.txt in ultraeditand paste it to the new file, "?" As shown in:

As shown in, after copying and pasting, C6 32 becomes 3f, "?" The corresponding byte is missing. Why?

In summary, "?" It used two bytes: C6, 00, but after different processing, it is easy to become 3f, that is, the real English ?, Why? Thank you for your understanding!

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.