Document directory
- In summary, "?" It used two bytes: C6, 00, but after different processing, it is easy to become 3f, that is, the real English ?, Why?
-
- Thank you for your understanding!
There is a batch file named "TXT", which is very simple. There is only one line of string "AB China? 12 ". Note:"?" It is neither a Chinese Question mark nor an English question mark. It is a non-standard character and actually occupies two bytes, as shown in: C6 32. Because there is no corresponding character, it cannot be properly displayed, it is displayed as "?". The following is the question mark (?) Some experiments.
A ultraedit
The following figure shows the results in the ultraedit editor.
1Source Text: "?" It corresponds to two bytes: C6 32, which is shown as follows:
2ANSIFormat: Use ultraedit to save it as an ANSI/ASCII format. It corresponds to two bytes: C6 32, which are not missing and are shown as follows:
3UFT-8Format: Use ultraedit to saveUFT-8Format, "?" Corresponding to 1 byte: 3f, which is an English question mark? The corresponding ASCII code is obviously one byte missing, that is, when it is savedUFT-8 CodingFormat, the C6 32 is changed to 3f. Shown as follows:
4UTF-16 (UNICODE)Format: Use ultraedit to saveUTF-16Unicode format. "?" 2 bytes: 3f 00, which is an English question mark? Unicode code. Normally, it should be C6 00 32 00, which is obviously also a loss of bytes, as shown below:
Notepad Editor
The following figure shows how to open the 1.txt file with the notepad editor notepadof Windows system. Do you want to save it as a file of several different encoding formats, and then view the file in hexadecimal mode as follows:
Source Text: "?" It corresponds to two bytes: C6 32, which is shown as follows:
ANSIFormat: "?" The corresponding value is 1 byte: 3f, which is obviously one byte missing, as shown below:
UTF-16 (UNICODE)Format: "?" Corresponding to 2 bytes: 3f 00, shown as follows:
UFT-8Format: "?" Corresponding to 1 byte: 3f, shown as follows:
After the three copies and paste the changes
Open the file 1.txt in ultraeditand paste it to the new file, "?" As shown in:
As shown in, after copying and pasting, C6 32 becomes 3f, "?" The corresponding byte is missing. Why?
In summary, "?" It used two bytes: C6, 00, but after different processing, it is easy to become 3f, that is, the real English ?, Why? Thank you for your understanding!