The problem of character encoding seems to be very small, often overlooked by technical staff, but it can easily lead to some puzzling problems. Here is a summary of the character encoding of some of the popular knowledge, I hope to be helpful to
Unicode in JavaScript, unicodejavascript
Unicode in JavaScript
By Jinya
[For more information, see http://blog.csdn.net/ei1_nino]
Glossary:
BMP :( BasicMultilingual Plane) It is also referred to as "Zero plane", Plane 0
UCS: Universal Character
I have nothing to worry about recently. I was overwhelmed by Unicode a while ago. So I want to see msdn! English looks really hard. In order to save some effort in the future, the translation has summarized it for future reference. The English level
The first function is the conversion function between wide characters and multi-byte characters. The function prototype is as follows:
Int widechartomultibyte (Uint codePage,DWORD dwflags,Lpcwstr lpwidecharstr,Int cchwidechar,Lpstr
Unicode in JavaScriptby Jinya"Reprint please indicate the source, Http://blog.csdn.net/EI__Nino"Noun Explanation:BMP: (basicmultilingual Plane) It is also referred to as "0th plane", Plane 0UCS: Universal Character Set (Universal Character set,
PHP achieves Unicode and Utf-8 mutual conversion
I. coding principles and implementation
Unicode encoding is the basis for UTF-8 and gb series encoding (gb2312, gbk, and gb18030) Conversion. although we can also directly create a UTF-8-to-these
I. Coding principle and implementationUnicode encoding is the basis for implementing Utf-8 and GB series encoding (GB2312, GBK, GB18030), although we can also directly do a utf-8 to these coded comparison table, but very few people will do so,
UTF code
The UTF-8 is to encode the UCS in 8-bit units. The encoding method from UCS-2 to UTF-8 is as follows:
UCS-2 encoding (16-in-system)
UTF-8 byte stream (binary)
0000-007f
0xxxxxxx
0080-07ff
110xxxxx 10xxxxxx
0800-ffff
1110xxxx 10xxxxxx
Transferred from:Http://www.ceapet.com/blog/index.php? Option = com_content & view = article & id = 162: javaunicode & catid = 38: Java & Itemid = 64
Use native2ascii on the command line interface1. Convert Chinese characters to
During Java Development, some garbled characters may occur, or files that cannot be correctly identified or read, such as Common Message Resources (properties) used for validator verification) the file must undergo Unicode re-encoding. The reason is
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.