Java implementation and NLP application of the longest common substring and the longest common subsequence

Source: Internet
Author: User

Preface before HANLP use "shortest editing distance" to do the recommender, the effect needs to be improved, the main disadvantage is that according to the pinyin sequence of the editing distance recommended, the same word interleaved is very common, and the editing distance is not so large. I was looking for a complementary scoring algorithm to judge how similar the two sentences were to this dimension of pinyin. The difference between the longest common substring (longest Common Substring) refers to the longest common substring in two strings, which requires that the substring must be contiguous. The longest common subsequence (longest Common Substring) refers to the longest common substring in a two string, and does not require continuous substring. The solution of the two is the same as the editing distance, are dynamic planning, with space exchange ...

Continue reading : yards Farm» Java implementation of longest common substring, longest common subsequence and NLP application

original link : http://www.hankcs.com/program/algorithm/ Implementation-and-application-of-nlp-longest-common-subsequence-longest-common-subsequence-of-java.html

Java implementation and NLP application of the longest common substring and the longest common subsequence

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.