PHP and UTF-8 best practices in detail, UTF-8 Best Practices
The article "PHP string, encoding, UTF-8" describes the basic knowledge of some columns, relatively boring, now point useful-PHP string processing best practices, this article is "PHP,
PHP character encoding conversion class,
support for ANSI, Unicode, Unicode big endian, UTF-8, Utf-8+bom to convert each other.
Four common text file encoding methods
ANSI Code:
No file header (file encoding at the beginning of the symbolic
Google's Sitemap service requires that all site maps published must be encoded in Unicode UTF-8. Google does not even allow other Unicode encodings (such as UTF-16), not to mention non-Unicode encodings such as ISO-8859-1. Technically, this means
: This article mainly introduces the php character conversion class, support ANSI, Unicode, Unicodebigendian, UTF-8, UTF-8 + Bom mutual conversion, for PHP tutorials interested in students can refer to it. Php character encoding conversion class,
This article is the second part of knowledge about PHP, strings, encodings, UTF-8. First, the conclusion--
use UTF-8 encoding in all aspects of PHP
.
The PHP language level does not support the Unicode character set, but most of the problems can be
65279 invisible character in php (BOM header of UTF-8), 65279bom. Php stealth character 65279 (BOM header of UTF-8) problem, 65279bom today in CSDN saw a php output blank invisible character 65279 problem, found on the Internet, this 65279 character
: This article mainly introduces the differences between UTF-8 and UTF-8 without BOM. if you are interested in the PHP Tutorial, please refer to it. BOM -- Byte Order Mark, which is a Byte Mark
There is a character named "zero width no-break space"
UTF-8 regular expression how to match Chinese characters, UTF-8 regular expression. UTF-8 regular expression how to match Chinese characters, UTF-8 regular expression to determine whether the input content contains illegal characters, please refer
But I this feature is the principle of investigation, I care about things want to understand, so the QQ group in turn send information, no one heeded. Alas, depressed. Had to own Google it and teach myself. The following is a detailed
Bom--byte order mark, which is the byte-order mark
There is a character called "ZERO WIDTH no-break SPACE" in the UCS encoding, and its encoding is Feff. Fffe is not a character in UCS, so it should not appear in the actual transmission. The UCS
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.