PHP iconv function Use method, Phpiconv function _php Tutorial

Source: Internet
Author: User
Tags translit

How to use Iconv function in PHP, Phpiconv function


Iconv function Library can complete the conversion between various character sets, which is an indispensable base Function library in PHP programming.
1, download Libiconv function library http://ftp.gnu.org/pub/gnu/libiconv/libiconv-1.9.2.tar.gz;
2, decompression TAR-ZXVF libiconv-1.9.2.tar.gz;
3, Installation Libiconv
#configure--prefix=/usr/local/iconv
#make
#make Install
4. Recompile php add compiler parameter--with-iconv=/usr/local/iconv

Windows under

Recently in a thief program, need to use the ICONV function to fetch the UTF-8 encoded page into gb2312, found that only with the ICONV function to fetch the data of a transcoding data will be less for no reason. Let me depressed for a while, go online a check information to know this is a iconv function of a bug. Iconv error when converting character "-" to gb2312
The workaround is simply to add "//ignore" to the second parameter of the ICONV function after the code that needs to be converted.

The following is the referenced content:

Copy CodeThe code is as follows:
Iconv ("UTF-8", "Gb2312//ignore", $data)


Ignore means ignoring errors at the time of conversion, and if there is no ignore argument, all strings after that character cannot be saved.

Copy CodeThe code is as follows:
echo $str = ' Hello, here is coffee! ';
Echo '
';
echo iconv (' GB2312 ', ' UTF-8 ', $str); To transfer the encoding of a string from GB2312 to UTF-8
Echo '
';
Echo Iconv_substr ($STR, 1, 1, ' UTF-8 '); Intercept by number of characters rather than bytes
Print_r (Iconv_get_encoding ()); Get the current page encoding information
Echo Iconv_strlen ($str, ' UTF-8 '); Gets the string length of the set encoding
That's what it's used for.
$content = Iconv ("UTF-8", "Gbk//translit", $content);
?>


Iconv is not the default function for PHP, it is also the default installed module. Need to be installed to use.
If it is windows2000+php, you can modify the php.ini file, will extension=php_iconv.dll before the ";" Remove, while you copy your original PHP installation file under the Iconv.dll to your winnt/system32 (if your DLL is pointing to this directory)
In the Linux environment, the static installation of the way, in the configure when adding a--with-iconv on it, phpinfo see iconv items. (linux7.3+apache4.06+php4.3.2),

Download: ftp://ftp.gnu.org/pub/gnu/libiconv/libiconv-1.8.tar.gz
Installation:
#cp LIBICONV-1.8.TAR.GZ/USR/LOCAL/SRC
#tar ZXVF lib*
#./configure--prefix=/usr/local/libiconv
#make
#make Install
Compiling PHP
#./configure--prefix=/usr/local/php4.3.2--with-iconv=/usr/local/libiconv/
simple examples of use:

Echo iconv ("gb2312", "iso-8859-1", "we");
?>

Introduction to Mb_convert_encoding and ICONV functions in PHP

Mb_convert_encoding This function is used to convert the encoding. It turns out that the concept of program coding is not understood, but it seems to be a bit enlightened now.
However, there is generally no coding problem in English, only the Chinese data will have this problem. For example, when you write a program with Zend Studio or editplus, using GBK encoding, if the data needs to enter the database, and the database encoding is UTF8, then the data will be encoded conversion, or into the database will become garbled.

Mb_convert_encoding's usage is in the official view:
http://cn.php.net/manual/zh/function.mb-convert-encoding.php

Make a GBK to UTF-8
< PHP
Header ("content-type:text/html; Charset=utf-8 ");
Echo mb_convert_encoding ("You Are my Friend", "UTF-8", "GBK");
?>

One more GB2312 to Big5.
< PHP
Header ("content-type:text/html; Charset=big5 ");
Echo mb_convert_encoding ("You Are my Friend", "Big5", "GB2312");
?>
However, to use the above function requires installation but first enable the mbstring extension library.

Another function in PHP, Iconv, is also used to convert string encodings, similar to functions on the upper function.

Here are a few more examples:
Iconv-convert string to requested character encoding
(PHP 4 >= 4.0.5, PHP 5)
Mb_convert_encoding-convert character encoding
(PHP 4 >= 4.0.6, PHP 5)

Usage:
String mb_convert_encoding (String str, string to_encoding [, mixed from_encoding])
Need to enable Mbstring expansion Library, in php.ini; Extension=php_mbstring.dll in front of; Remove
Mb_convert_encoding can specify a variety of input encoding, it will automatically identify according to the content, but the execution efficiency is much worse than the iconv;


String Iconv (String in_charset, String out_charset, String str)
Note: The second parameter, in addition to specifying the encoding to be converted to, can also add two suffixes://translit and//ignore, where//translit automatically converts characters that cannot be converted directly into one or more approximate characters,//ignore Characters that cannot be converted are ignored, and the default effect is truncated from the first illegal character.
Returns the converted string or FALSE on failure.


Use:

The iconv is found to have an error converting the character "-" to gb2312, and if there is no ignore parameter, all strings after that character cannot be saved. In any case, this "-" can not be converted successfully, unable to output. Another mb_convert_encoding does not have this bug.

In general, with Iconv, only use the Mb_convert_encoding function if you encounter an inability to determine what encoding the original encoding is, or if the Iconv conversion fails to display properly.

From_encoding is specified by character code name before conversion. It can be array or STRING-COMMA separated enumerated list. If It is not specified, the internal encoding would be used.
/* Auto detect encoding from JIS, Eucjp-win, Sjis-win, then convert str to Ucs-2le */
$str = mb_convert_encoding ($str, "Ucs-2le", "JIS, Eucjp-win, Sjis-win");
/* "Auto" is expanded to "ascii,jis,utf-8,euc-jp,sjis" */
$str = mb_convert_encoding ($str, "EUC-JP", "Auto");

Example:
$content = Iconv ("GBK", "Utf-8″, $content);
$content = mb_convert_encoding ($content, "Utf-8″," GBK ");

parameters that are easy to ignore when using the Iconv function in PHP
Today in the processing of crawling content, when the use of ICONV encoding conversion, found that the results will be interrupted, guess is the character set problem, consider how to skip the target character set does not exist characters, check the manual found that the function of Iconv only three parameters, as if not, and then check online that some people say yes, But it is very strange how to achieve, finally found that the English description can be added to the target code behind: "Translit", very depressed how to add it? The original is to add "//", really depressed, unexpectedly have such a design
Prototype: $txtContent = Iconv ("Utf-8", ' GBK ', $txtContent);

Special parameters: Iconv ("UTF-8", "Gb2312//ignore", $data)


Two optional auxiliary parameters: Translit and ignore (where ignore means to skip if it encounters an inability to convert). Description

String Iconv (String in_charset, String out_charset, String str)

Performs a character set conversion on the string str from In_charset to Out_charset. Returns the converted string or FALSE on failure.

If you append the string//translit to Out_charset transliteration is activated. This means if a character can ' t is represented in the target charset, it can be approximated through one or several Similarly looking characters. If you append the string//ignore, characters that cannot is represented in the target charset is silently discarded. Otherwise, str is cut from the first illegal character.

http://www.bkjia.com/PHPjc/1048772.html www.bkjia.com true http://www.bkjia.com/PHPjc/1048772.html techarticle php iconv function use method, Phpiconv function Iconv function library can complete the conversion between various character sets, is an indispensable base Function library in PHP programming. 1, download Libiconv function library h ...

  • Contact Us

    The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

    If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

    A Free Trial That Lets You Build Big!

    Start building with 50+ products and up to 12 months usage for Elastic Compute Service

    • Sales Support

      1 on 1 presale consultation

    • After-Sales Support

      24/7 Technical Support 6 Free Tickets per Quarter Faster Response

    • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.