PHP supports the UTF8 "word segmentation" program, solution
Source: Internet
Author: User
Ask PHP to support the UTF8 "word segmentation" program ...... I want to perform a search now, but I need to perform "word segmentation" on the input text. do you have a good source program (UTF-8 is supported) for searching and word segmentation ), we are very grateful to you for providing it !, If it is a VM, you can use the Zhimeng word segmentation algorithm V1.0http: // www. dedecm to find that PHP supports the UTF8 word segmentation program ......
I want to perform a search now, but I need to perform "word segmentation" on the input text. do you have a good source program (UTF-8 is supported) for searching and word segmentation ), we are very grateful to you for providing it!
------ Solution --------------------
For a VM, you can use the Zhimeng word splitting algorithm V1.0.
Http://www.dedecms.com/html/chanpinxiazai/20061229/3.html
If you have your own server, you can install the PHP extension SCWS-simple Chinese word segmentation system.
Http://www.hightman.cn/index.php? Scws
------ Solution --------------------
UTF8 does not actually perform word segmentation, because it is a universal code, so it is impossible. if you are mainly Chinese, you will use Chinese word segmentation, such as the word segmentation of Zhimeng, convert UTF8 to gbk before word splitting, and then convert it back to utf-8. the conversion time is slight relative to the word splitting.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.