Chinese SOLR Synonyms

Source: Internet
Author: User
Tags solr
Symptom:

After synonyms.txt is entered as a Chinese synonym, Tomcat starts to report an error and cannot read the file.

Cause:

Use solrto Add the default encoding of synonyms.txt to the same as that used by your system.
For example: your entire system is using 8, so you need to convert the synonyms.txt file encoding format into UTF-8. The reason is that the words found when a word is synonymous are garbled, and garbled characters do not exist in the index. This causes the unavailability of Chinese characters.

Solution:

In eclipseconvert synonyms.txt file encoding format to UTF-8 (System Default UTF-8), and then enter the Chinese synonym, save, restart tomcat, solve the problem.

Summary:

SOLR Chinese Word Segmentation cannot be used. The core issue is that there is no unified encoding, leading to garbled characters.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.