Ask PHP to support the UTF8 "word segmentation" program ...... I want to perform a search now, but I need to perform "word segmentation" on the input text. do you have a good source program (UTF-8 is supported) for searching and word segmentation ), we are very grateful to you for providing it! ------ Solution ------------------ if it is a virtual host, you can use the Zhimeng word segmentation algorithm V1.0www. dedecms. comhtml to find that PHP supports the "word segmentation" program of UTF8 ......
I want to perform a search now, but I need to perform "word segmentation" on the input text. do you have a good source program (UTF-8 is supported) for searching and word segmentation ), we are very grateful to you for providing it!
------ Solution --------------------
For a VM, you can use the Zhimeng word splitting algorithm V1.0.
Http://www.dedecms.com/html/chanpinxiazai/20061229/3.html
If you have your own server, you can install the PHP extension SCWS-simple Chinese word segmentation system.
Http://www.hightman.cn/index.php? Scws
------ Solution --------------------
UTF8 does not actually perform word segmentation, because it is a universal code, so it is impossible. if you are mainly Chinese, you will use Chinese word segmentation, such as the word segmentation of Zhimeng, convert UTF8 to gbk before word splitting, and then convert it back to utf-8. the conversion time is slight relative to the word splitting.