UbuntuCoreseek installation Configuration

Source: Internet
Author: User
Everything is installed according to the official instructions, but there is an error in the old configuration when it is the best configuration. In the end, you can only google it, and the configuration below will be done. Just playing coreseek, I am not very familiar with some parameters, but I want to configure it. I didn't take a good look at the official instructions. To prevent compilation errors, first install the following program yum-yinstallmysqlmysql-develphp-mysqlqt4-mysqlpythonpython-devgcc-c ++

Everything is installed according to the official instructions, but there is an error in the old configuration when it is the best configuration. In the end, you can only google it, and the configuration below will be done. Just playing coreseek, I am not very familiar with some parameters, but I want to configure it. I didn't take a good look at the official instructions.

To prevent compilation errors, first install the following program
Yum-y install mysql-devel php-mysql qt4-mysql python-dev gcc-c ++ gtk + libtool automake autoconf glibc-common expat-devel

1. Installation
Wget http://www.coreseek.cn/uploads/csft/3.1/Source/csft-3.1.tar.gz #### coreseek source file
Wget http://www.coreseek.cn/uploads/csft/3.1/Source/mmseg-3.1.tar.gz #### dictionary used by coreseek
Tar zxvf csft-3.1.tar.gz
Tar zxvf mmseg-3.1.tar.gz

##### Install mmseg before installing coreseek
Cd mmseg-3.1
./Configure-prefix =/usr/local/mmseg
Make
Make install

######## Install coreseek ########
# The python data source is not used here. If you need it, add-with-python, and specify the corresponding path on mmseg.
. /Configure-prefix =/usr/local/coreseek-with-mmseg-supported des =/usr/local/mmseg/include/mmseg-with-mmseg-libs =/usr/local/ mmseg/lib-without-iconv

Specifying the-enable-id64 option opens the branch of the 64-bit Document ID and Word ID

Make
Make install

If no problem exists, the coreseek directory and its files are generated under/usr/local/after installation.

The mmseg dictionary and configuration file will be generated:
Cd/usr/loca/mmseg
/Usr/local/mmseg/bin/mmseg-u/usr/local/src/mmseg-3.1/data/unigram.txt contains invalid unigram.txtis the corresponding dictionary file, unigram.txt. uni will be generated
Cd ../coreseek
Mkdir dict ### create a dictionary directory
Cp/usr/local/src/mmseg-3.1/data/unigram.txt. uni dict/uni. lib # copy the created dictionary to dict
Vim dict/mmseg. ini #### create the mmseg configuration file. This file is included in coreseek's windows version!

Mmseg. ini:
[Mmseg]
Merge_number_and_ascii = 1;
Number_and_ascii_joint = -;
Compress_space = 0;
Seperate_number_ascii = 1;
Now, mmseg configuration is complete! Next, configure csft. conf -- coreseek's configuration file.

Source article
{
Type = mysql
SQL _host = localhost
SQL _user = root
SQL _pass = jiaxian
SQL _db = test
SQL _port = 3306 # optional, default is 3306

SQL _query_pre = SET NAMES utf8
# SQL _query_pre = set session query_cache_type = OFF # disable the SQL query cache.
# SQL _query = SELECT id, classid, checked, title, newstime, newstext FROM article
SQL _query_range = SELECT MIN (id), MAX (id) FROM article
SQL _range_step = 1000
SQL _query = SELECT id, classid, checked, title, newstime, newstext FROM article WHERE id >=$ start AND id <= $ end

SQL _attr_uint = classid
SQL _attr_uint = checked
SQL _attr_uint = newstime
SQL _query_info = select * from article where id = $ id

}

Index article
{
Source = article
Path =/usr/local/coreseek/var/data/article
Docinfo = extern
Charset_type = zh_cn.utf-8 ### specify coreseek Encoding
Charset_dictpath =/usr/local/coreseek/dict ##### coreseek dictionary file

Min_prefix_len = 0
Min_infix_len = 0
Min_word_len = 2
Ngram_len = 1
Ngram_chars = U + 4E00 .. U + 9FBF, U + 3400 .. U + 4DBF, U + 20000 .. U + 2A6DF, U + f900.. U + FAFF ,\
U + 2F800 .. U + 2FA1F, U + 2E80 .. U + 2EFF, U + 2F00 .. U + 2FDF, U + 3100 .. U + 312F, U + 31A0 .. U + 31BF ,\
U + 3040 .. U + 309F, U + 30A0 .. U + 30FF, U + 31F0 .. U + 31FF, U + ac00 .. U + D7AF, U + 1100 .. U + 11FF ,\
U + 3130 .. U + 318F, U + a000.. U + A48F, U + a490.. U + A4CF
Html_strip = 0
}

Indexer
{
Mem_limit = 256 M
}
Searchd
{
# Address = 0.0.0.0
Log =/usr/local/coreseek/var/log/searchd. log
Query_log =/usr/local/coreseek/var/log/query. log
Read_timeout = 5
Max_children = 30
Pid_file =/usr/local/coreseek/var/log/searchd. pid
Max_matches = 1000
Seamless_rotate = 1
}

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.