In php, we can use the mb_detect_encoding function to check string encoding or file encoding. The mb_detect_encoding function is a built-in function of php. The following is a brief introduction. In php, we can use the mb_detect_encoding function to check string encoding or file encoding. The mb_detect_encoding function is a built-in function of php. The following is a brief introduction.
Script ec (2); script
For the file encoding detection, Baidu regards it all, but it does not work,
Many people suggest mb_detect_encoding () detection, but I don't know why I didn't succeed, and I didn't output anything,
I saw someone wrote an enhanced version, and I decided to ignore it with BOM. This is totally unreliable,
Finally, according to the example below the mb_detect_encoding function in the PHP manual, I wrote a detection function,
It also includes functions that automatically detect the encoding and read the file according to the instruction code,
The source code is presented.
I tried to write the online method without using it. Maybe the environment is different.
So if it's useless, don't spray me, I just want to share ideas ,,
The php manual explains this as follows:
Mb_detect_encoding-encoding of detected characters, string mb_detect_encoding (string $ str [, mixed $ encoding_list = mb_detect_order () [, bool $ strict = false])
This function has three parameters:
1. str: the string www.111cn.net to be checked.
2. encoding_list: encoding_list is a character encoding list. The encoding sequence can be specified by list strings separated by arrays or commas.
If encoding_list is omitted, detect_order is used.
3. strict: strict specifies whether the encoding is strictly checked. The default value is FALSE.
The following is an example:
The Code is as follows: |
|
$ Encode = mb_detect_encoding ($ keytitle, array ('ascii ', 'gb2312', 'gbk', 'utf-8 ')); |
The three parameters are: the input variable to be detected. The encoding method check sequence (if the encoding method is true, it is automatically ignored later ).
The strict mode adjusts the encoding detection sequence to minimize the possibility of conversion errors.
In general, the first line of gb2312, when there is GBK and UTF-8, You need to commonly used to sort to the front
Complete instance
The Code is as follows: |
|
/** * Check the file encoding. * @ Param string $ file path * @ Return string | null returns the encoding name or null. */ Function detect_encoding ($ file ){ $ List = array ('gbk', 'utf-8', 'utf-16le', 'utf-16BE ', 'iso-8859-1 '); $ Str = file_get_contents ($ file ); Foreach ($ list as $ item ){ $ Tmp = mb_convert_encoding ($ str, $ item, $ item ); If (md5 ($ tmp) = md5 ($ str )){ Return $ item; } } Return null; } /** * Automatically parse the encoding to read the file * @ Param string $ file path * @ Param string $ charset read Encoding * @ Return string returns the read content */ Function auto_read ($ file, $ charset = 'utf-8 '){ $ List = array ('gbk', 'utf-8', 'utf-16le', 'utf-16BE ', 'iso-8859-1 '); $ Str = file_get_contents ($ file ); Foreach ($ list as $ item ){ $ Tmp = mb_convert_encoding ($ str, $ item, $ item ); If (md5 ($ tmp) = md5 ($ str )){ Return mb_convert_encoding ($ str, $ charset, $ item ); } } Return ""; } |
Finally, we recommend a php detection character encoding mb_detect_encoding () function.