Sphsf-does not support Chinese word segmentation, and many Chinese Word Segmentation components have been written in China. This article describes how to install libmmseg. It is a Chinese Word Segmentation software package designed for the sphsf-full-text search engine by coreseek.com, the Chinese Word Segmentation Method released under the GPL protocol adopts the mmseg of Chih-hao Tsai.Algorithm.
First download the installation package to libmmseg from the http://www.coreseek.cn/news/7/99/, as shown below:
CD / USR / Local / SRC /
Wget http: // Www.coreseek.cn / Uploads / CSFT / 3.2 / Coreseek - 3.2 . 13 .Tar.gz - C
Decompress:
Tar - Zxv - F coreseek - 3.2 . 13 .Tar.gz
Go to the mmseg folder and compile:
CD coreseek - 3.2 . 13 / Mmseg - 3.2 . 13 /
. / Configure -- Prefix =/ USR / Local / Mmseg
During the compilation process, an error of config. Status: Error: cannot find input file: src/makefile. In is reported. Then, run the following command to re-compile and pass the compilation:
Aclocal
Libtoolize -- Force
Automake -- Add - Missing
Autoconf
Autoheader
Make clean
Then compile and install:
. / Configure -- Prefix =/ USR / Local / Mmseg
Make && Make install
Add the mmseg command to the environment variable and run the mmseg command to enter the successful installation information:
Ln - S / USR / Local / Mmseg / Bin / Mmseg / Bin / Mmseg
Mmseg
Coreseek cos (TM) mm segment 1.0
Copyright by coreseek.com all right reserved.
Usage: mmseg < Option > < File >
- U < Unidict > Unigram dictionary
- R combine - U, used a plain text build unigram dictionary, default off
- B < Synonyms > Synonyms dictionary
- T < Thesaurus > Thesaurus dictionary
- H Print This help And Exit