Changes in the release of Chinese word segmentation. net
Now, almost every day, a friend has written to me asking for The Implementation of The Chinese word segmentation. I have implemented Java and C. For the same algorithm logic, use
Java and C # Are not interesting. So naturally I think of it.
Development and multi-language implementation of Lucene
Therefore, the Chinese word segmentation algorithm of the. NET version is mainly updated in the future, and the Chinese word segmentation of the. NET version is converted based on the Java class.
A long time ago, I wrote a blog about the. Net-based Java Virtual Machine ikvm.
. So today I tried it, and the whole process was quite smooth. The following is my conversion process:
X: \ ikvmbin-0.14.0.1 \ ikvm \ bin> ikvmc-target: Library
X: \ XXXX \ chinese_sentence_splitter.jar
Note: output file is "chinese_sentence_splitter.dll"
Note: automatically adding reference to "E: \ programming \ Java &. Net \ ikvmbin-0.14.0.1 \ ikvm \ bin \ ikvm. GNU. classpath. dll"
The preceding command can be used to convert a Java JAR file to a. Net DLL file of the same name.
The test results in Java are as follows:
However, the test results in. NET are incorrect:
This is obviously because ikvm. Net encountered a problem during the conversion process.
Because ikvm. NET is used for the first time, this problem still needs to be solved in the future. I also hope that you can provide more guidance with relevant experience.
Related connections:
How happy
Xiao Dingdong Chinese Word Segmentation