php的漢字轉換: Unicode(UTF8)->GBK

來源:互聯網
上載者:User
漢字|轉換

秋水無恨 GBK Unicode UTF8 漢字 轉換

php的漢字轉換一直是比較麻煩的事

該類內建了四個過濾"&#[dec];","&#x[hex];","%u[hex]","utf8轉換"
方便使用者的使用,同時也可自訂過濾進行自己喜歡的操作


qswhU.php 從這裡下載
http://www.blueidea.com/user/qswh/qswhU.zip

class qswhU{
 var $qswhData;
 function qswhU($filename="qswhU.php"){
  $this->qswhData=file($filename);
 }
 
 function decode($str,$pattern=0){
  $arr=array("/&#(\w+);/iU","/((%\w\w)+)/i","/%u(\w{4,5})/iU");
  if(is_integer($pattern)){
    if($pattern>=count($arr))die("Invalid Function");
    $pattern=$arr[$pattern];
  }
  return preg_replace_callback($pattern,array($this,"u2gb"),$str);
 }
 
 function u2gb($arr){
  /******(qiushuiwuhen 2002-8-15)******/
  $ret="";$str=$arr[1];
  if(preg_match_all("/%\w{2}/",$str,$matches)){
   for($i=0;$i<count($matches[0]);$i++){
    $chr1=hexdec(substr($matches[0][$i],1));
    $arr=array("f0","e0","c0","0");
    for($j=0;$j<count($arr);$j++)if($chr1>hexdec($arr[$j]))break;
    $chr=hexdec(substr($matches[0][$i],1))-hexdec($arr[$j]);
    while(++$j<count($arr))$chr=$chr*0x40+(hexdec(substr($matches[0][++$i],1))-0x80);
    $str=dechex($chr);
    if(strlen($str)==4){
     $p=hexdec(substr($str,0,2))-0x4d;
     $q=hexdec(substr($str,2))*4;
     $ret.=chr(hexdec(substr($this->qswhData[$p],$q,2)));
     $ret.=chr(hexdec(substr($this->qswhData[$p],$q+2,2)));
    }else
     $ret.=chr(hexdec($str));
   }
  }
  else{
   if(strtolower($str[0])=="x")
    $str=substr($str,1);
   else
    if(strlen($str)!=4)$str=dechex($str);
   
   if(strlen($str)==4){
    $p=hexdec(substr($str,0,2))-0x4d;
    $q=hexdec(substr($str,2))*4;
    $ret.=chr(hexdec(substr($this->qswhData[$p],$q,2)));
    $ret.=chr(hexdec(substr($this->qswhData[$p],$q+2,2)));
   }else
    $ret.=chr(hexdec($str));
  }
  return $ret;
 }
 
 
}


使用範例

$qswh=new qswhU("qswhU.php");//如果檔案名稱是qswhU.php,可省參數
echo "<xmp>不帶參數(預設過濾為:&#[num];):";
echo "\n".$qswh->decode("中文Abc");
echo "\n".$qswh->decode("中文Abc");
echo "\n調用內建過濾(UTF轉碼):".$qswh->decode("%E4%B8%AD%E6%96%87%20!%22%23%24%25%26'()*%2B%2C%2F%3A%3B%3C%3D%3E%3F%40%5B%5D%5E%60%7B%7C%7D~%25Abc",1);
echo "\n調用內建過濾unescape(%u[num]):".$qswh->decode("%u4E2D%u6587Abc",2);
echo "\n自訂過濾([x+num]):".$qswh->decode("[x4E2D][x6587][x41][x62][x63]","/\[(\w+)\]/");

效果如下:
不帶參數(預設過濾為:&#[num];):
中文Abc
中文Abc
調用內建過濾(UTF轉碼):中文 !"#$%&'()*+,/:;<=>?@[]^`{|}~%Abc
調用內建過濾unescape(%u[num]):中文Abc
自訂過濾([x+num]):中文Abc

 



相關文章

Beyond APAC's No.1 Cloud

19.6% IaaS Market Share in Asia Pacific - Gartner IT Service report, 2018

Learn more >

Apsara Conference 2019

The Rise of Data Intelligence, September 25th - 27th, Hangzhou, China

Learn more >

Alibaba Cloud Free Trial

Learn and experience the power of Alibaba Cloud with a free trial worth $300-1200 USD

Learn more >

聯繫我們

該頁面正文內容均來源於網絡整理,並不代表阿里雲官方的觀點,該頁面所提到的產品和服務也與阿里云無關,如果該頁面內容對您造成了困擾,歡迎寫郵件給我們,收到郵件我們將在5個工作日內處理。

如果您發現本社區中有涉嫌抄襲的內容,歡迎發送郵件至: info-contact@alibabacloud.com 進行舉報並提供相關證據,工作人員會在 5 個工作天內聯絡您,一經查實,本站將立刻刪除涉嫌侵權內容。