Lucene分詞報錯:”TokenStream contract violation: close() call missing”

來源:互聯網
上載者:User

標籤:core   api   als   final   lte   lang   result   開始   setting   

Lucene使用IKAnalyzer分詞時報錯:”TokenStream contract violation: close() call missing”  解決辦法是每次完成後必須調用關閉方法。

如果報錯:java.lang.illegalstateexception: tokenstream contract violation: reset()/close() call missing,則要在tokenStream.incrementToken(),原因是lucene從4.6.0開始tokenstream使用方法更改的問題,在使用incrementtoken方法前必須調用reset方法,詳見api http://lucene.apache.org/core/4_6_0/core/index.html 。

以下正確範例程式碼(第10行和22行調用reset()和close()方法):

 
public Set<String> slicing(String text){    Set<String> result = new HashSet<>();    StringReader reader = null;    TokenStream tokenStream = null;    try {        reader = new StringReader(text);        tokenStream = analyzer.tokenStream("", reader);          CharTermAttribute charTermAttribute  = tokenStream.getAttribute(CharTermAttribute.class);         OffsetAttribute offsetAttribute = tokenStream.addAttribute(OffsetAttribute.class);          tokenStream.reset();            while (tokenStream.incrementToken()) {                  int startOffset = offsetAttribute.startOffset();                  int endOffset   = offsetAttribute.endOffset();                if((endOffset - startOffset) > 1){                    String term = charTermAttribute.toString();                     result.add(term);                }            }      } catch (IOException e) {        e.printStackTrace();    } finally{        IOs.close(tokenStream, reader);    }    return result;}

 

http://www.lizi.pw/archives/56

 

org.wltea.analyzer.lucene.IKAnalyzer

Exception in thread "main" java.lang.IllegalStateException: 詞典尚未初始化,請先調用initial方法at org.wltea.analyzer.dic.Dictionary.getSingleton(Dictionary.java:137)at org.wltea.analyzer.core.CJKSegmenter.analyze(CJKSegmenter.java:80)at org.wltea.analyzer.core.IKSegmenter.next(IKSegmenter.java:116)at org.wltea.analyzer.lucene.IKTokenizer.incrementToken(IKTokenizer.java:88)

 

Lucene分詞報錯:”TokenStream contract violation: close() call missing”

聯繫我們

該頁面正文內容均來源於網絡整理,並不代表阿里雲官方的觀點,該頁面所提到的產品和服務也與阿里云無關,如果該頁面內容對您造成了困擾,歡迎寫郵件給我們,收到郵件我們將在5個工作日內處理。

如果您發現本社區中有涉嫌抄襲的內容,歡迎發送郵件至: info-contact@alibabacloud.com 進行舉報並提供相關證據,工作人員會在 5 個工作天內聯絡您,一經查實,本站將立刻刪除涉嫌侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.