A simple example of "Lucene" three highlighted modules-highlighter

Last Update:2016-11-29 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Lucene provides two implementations for highlighting, namely highlighter and Fastvectorhighlighter

The three examples here are all using highlighter;

Example code:

Package Com.tan.code;
Import Java.io.File;
Import java.io.IOException;
Import Java.io.StringReader;
Import Org.apache.lucene.analysis.TokenStream;
Import Org.apache.lucene.analysis.core.SimpleAnalyzer;
Import org.apache.lucene.document.Document;
Import Org.apache.lucene.index.DirectoryReader;
Import Org.apache.lucene.index.IndexReader;
Import Org.apache.lucene.index.Term;
Import org.apache.lucene.queryparser.classic.ParseException;
Import Org.apache.lucene.queryparser.classic.QueryParser;
Import Org.apache.lucene.search.IndexSearcher;
Import Org.apache.lucene.search.Query;
Import Org.apache.lucene.search.ScoreDoc;
Import Org.apache.lucene.search.TermQuery;
Import Org.apache.lucene.search.TopDocs;
Import Org.apache.lucene.search.highlight.Highlighter;
Import org.apache.lucene.search.highlight.InvalidTokenOffsetsException;
Import Org.apache.lucene.search.highlight.QueryScorer;
Import Org.apache.lucene.search.highlight.SimpleHTMLFormatter;
Import Org.apache.lucene.search.highlight.SimpleSpanFragmenter;
Import org.apache.lucene.search.highlight.TokenSources;
Import Org.apache.lucene.store.Directory;
Import Org.apache.lucene.store.SimpleFSDirectory;
Import org.apache.lucene.util.Version;
Import Org.wltea.analyzer.lucene.IKAnalyzer;
Public class Highlightertest {
//Highlight the text (the following is purely fictitious)
private String Text = "China has lots of people,most of them is very poor. is very big. China become strong now,but The poor people are also poor than other controry ";
//Original Highlight
public void highlighter () throws IOException, invalidtokenoffsetsexception {
Termquery termquery = new Termquery ("field", "China");
Tokenstream Tokenstream = new Simpleanalyzer (version.lucene_43)
. Tokenstream ("field", new StringReader (text));
Queryscorer queryscorer = new Queryscorer (termquery);
Highlighter highlighter = new highlighter (queryscorer);
Highlighter.settextfragmenter (new Simplespanfragmenter (Queryscorer));
System.out.println (Highlighter.getbestfragment (Tokenstream, text));
}
//Use CSS to highlight the handle
public void Highlighter_css (String searchtext) throws ParseException,
IOException, Invalidtokenoffsetsexception {
//Create enquiry
Queryparser Queryparser = new Queryparser (version.lucene_43, "field",
new Simpleanalyzer (version.lucene_43));
Query query = queryparser.parse (SearchText);
//Custom callout highlighting text label
Simplehtmlformatter htmlformatter = new Simplehtmlformatter (
"", "");
//token of the cell
Tokenstream Tokenstream = new Simpleanalyzer (version.lucene_43)
. Tokenstream ("field", new StringReader (text));
//Creative Queryscoer
Queryscorer queryscorer = new Queryscorer (Query, "field");
Highlighter highlighter = new Highlighter (Htmlformatter, queryscorer);
Highlighter.settextfragmenter (new Simplespanfragmenter (Queryscorer));
System.out.println (Highlighter.getbestfragments (tokenstream, Text, 4,
"..."));
}
//Highlight search results
public void Highlighter_sr (String field, String searchtext)
throws IOException, ParseException, invalidtokenoffsetsexception {
//This example is for easy direct use of the index established by the previous experiment
Directory directory = new Simplefsdirectory (new File ("E://myindex"));
Indexreader reader = directoryreader.open (directory); //Read directory
Indexsearcher search = new Indexsearcher (reader); Initializing the query component
Queryparser parser = new Queryparser (version.lucene_43, field,
New Ikanalyzer (true));
Query query = parser.parse (SearchText);
Topdocs td = Search.search (query, 10000); Gets a docid that matches the elements on the
scoredoc[] sd = Td.scoredocs; //Load all documnet documents
System.out.println ("This hit data:" + sd.length);
Queryscorer scorer = New Queryscorer (query, "content");
Highlighter highlighter = new highlighter (scorer);
Highlighter.settextfragmenter (New Simplespanfragmenter (scorer));
For (Scoredoc scoredoc:sd) {
Document document = Search.doc (Scoredoc.doc);
String content = document.get ("content");
Tokenstream Tokenstream = Tokensources.getanytokenstream (
Search.getindexreader (), Scoredoc.doc, "content", document,
New Ikanalyzer (true));
System.out.println (highlighter
. getbestfragment (Tokenstream, content));
}
}
}

Test code:

@Test
Public Void Test () throws IOException, Invalidtokenoffsetsexception,
parseexception {
//Fail ("not yet implemented");
Highlightertest highlightertest = new Highlightertest ();
Highlightertest.highlighter ();
Highlightertest.highlighter_css ("China");
Highlightertest.highlighter_css ("poor");
HIGHLIGHTERTEST.HIGHLIGHTER_SR ("content", "moon Light Before Bed");
}

Test results:

china has lots of people,most of them is very poor. china is very big. china become strong now,but The poor people is also poor than other controry
<Spanstyle="Backgroud:red">china has lots of people,most of them are very poor. china is very big. china become strong now,but the poor people is also poor than other controry
China has lots of people,most of them is very>poor. China is very big. China become strong now,but the poor people is also poor than other controry
Hit data: 1
Bed Pre Bright Moon light </B, suspect is ground frost

A simple example of "Lucene" three highlighted modules-highlighter

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

A simple example of "Lucene" three highlighted modules-highlighter

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support

A simple example of "Lucene" three highlighted modules-highlighter

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support