1, Nlpir Introduction
NLPIR Chinese Word segmentation system (aka ICTCLAS2013), the main functions include Chinese word segmentation, POS tagging, named entity recognition, user dictionary function, support GBK encoding, UTF8 encoding, BIG5 encoding. Added micro-bo participle, new word discovery and keyword extraction; Dr. Zhang Huaping has made more than 10 years, the kernel upgrade 10 times.
: Http://ictclas.nlpir.org/downloads
2. Introduction of Catalogue
ReadMe: Introduction
Data: System Core Thesaurus
Doc: Documentation, Introduction to Usage
Include: System header File
Sample:nlpir Sample Program
LIB: Support Library in different environments
Test: Testing Data
bin:ictclas2014-----------> Participle system
ictclas-tools.exe--> participle support tool, can be used for testing, the main use to do user dictionary import
Importuserdict.bat--> user dictionaries can be automatically imported into the system
Support dynamic link library under Nlpir.dll-->win32, Library of other environment, can access library file of Lib corresponding environment.
NLPIR.lib
Demo program under Nlpir_windemo.exe-->win32, compiled under WIN8 32 bit, may not be supported in some environments, or display exceptions
userdic.txt--> user dictionary, users can edit their own
3. Java Code
3.1 New Project Fenci import Lib library file (Sample\jnatest_nlpir\lib\jna-4.0.0.jar)
3.2 Copy the files from the Sample\jnatest_nlpir\src\ directory to the project SRC directly using
3.3 Modifying the code
Java implementation Nlpir (Ictclas) participle