Php Chinese character string truncation question mark

Php Chinese character string truncation question mark _ PHP Tutorial

Last Update:2017-05-13 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Question mark of the Chinese character string in php. When using php built-in functions to intercept Chinese characters, you may encounter question marks. Below are some examples of Chinese characters accurately intercept. Php has no Chinese characters when dealing with string operations. when using php built-in functions to intercept Chinese characters, you may encounter question marks. below we have sorted out several very good examples of Chinese character precise intercept.

The time for php to operate on strings is nothing more than two problems:
1. determine whether the string encoding is gbk or unicode.
2. extract the corresponding encoding.

In general, we may encounter garbled characters when using substr to intercept Chinese characters. Because the Chinese character is dubyte, when a byte is intercepted, the Chinese character cannot be displayed and is out of order.

In fact, the solution is very simple. See the following screenshot function:

The code is as follows:
// Truncate an extra long string Function curtStr ($ str, $ len = 30 ){ If (strlen ($ str)> $ len ){ $ Str = substr ($ str, 0, $ len ); $ Str. = chr (0 )."... "; Return $ str; }

The above chr (0) is not null
Null is nothing, while chr (0) is 0. The hexadecimal value is 0 × 00, and the binary value is 00000000.
Although chr (0) does not display anything, it is a character.
When a Chinese character is truncated, according to the encoding rules, he always needs to pull the other characters behind it as an explanation of the Chinese character. this is the cause of garbled characters. The combination of 0x81 to 0xff and 0x00 is always displayed as "null"
Based on this feature, add a chr (0) after the substr result to prevent garbled characters.

The following are some functions that can be used to precisely intercept Chinese strings:

Truncates UTF-8 encoded multi-byte strings.

The code is as follows:
// Truncate the utf8 string Function utf8Substr ($ str, $ from, $ len) { Return preg_replace ('# ^ (? : [X00-x7F] \| [xC0-xFF] [x80-xBF] +) {0, '. $ from .'}'. '((? : [X00-x7F] \| [xC0-xFF] [x80-xBF] +) {0, '. $ len.'}). * # s ', '$ 1', $ str ); } ?>

Chinese character truncation functions supported by UTF-8 and GB2312

The code is as follows:

/*
Chinese character truncation functions supported by Utf-8 and gb2312
Cut_str (string, truncation length, start length, encoding );
The default encoding format is UTF-8.
The default start length is 0.
*/

Function cut_str ($ string, $ sublen, $ start = 0, $ code = 'utf-8 ')
{
If ($ code = 'utf-8 ')
{
$ Pa = "/[x01-x7f] | [xc2-xdf] [x80-xbf] | xe0 [xa0-xbf] [x80-xbf] | [xe1-xef] [x80-xbf] [x80-xbf] | xf0 [x90-xbf] [x80-xbf]] [x80-xbf] | [xf1-xf7] [x80-xbf] [x80-xbf] [x80-xbf]/";
Preg_match_all ($ pa, $ string, $ t_string );

If (count ($ t_string [0])-$ start> $ sublen) return join ('', array_slice ($ t_string [0], $ start, $ sublen )). "... ";
Return join ('', array_slice ($ t_string [0], $ start, $ sublen ));
}
Else
{
$ Start = $ start * 2;
$ Sublen = $ sublen * 2;
$ Strlen = strlen ($ string );
$ Tmpstr = '';

For ($ I = 0; $ I <$ strlen; $ I ++)
{
If ($ I >=$ start & $ I <($ start + $ sublen ))
{
If (ord (substr ($ string, $ I, 1)> 129)
{
$ Tmpstr. = substr ($ string, $ I, 2 );
}
Else
{
$ Tmpstr. = substr ($ string, $ I, 1 );
}
}
If (ord (substr ($ string, $ I, 1)> 129) $ I ++;
}
If (strlen ($ tmpstr) <$ strlen) $ tmpstr. = "...";
Return $ tmpstr;
}
}

$ Str = "the string to be intercepted by abcd ";
Echo cut_str ($ str, 8, 0, 'gb2312 ');
?>

Bytes. Php has no time to operate on strings...

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Php Chinese character string truncation question mark _ PHP Tutorial

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support

Php Chinese character string truncation question mark _ PHP Tutorial

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support