Nine Java-based search engine frameworks

Source: Internet
Author: User
Keywords search engine java
Tags advertising apache based data distributed example find framework

In this very complicated information age of the Internet, we have learned how to use the powerful tool of search engine to find the target information. For example, you can search for Valentine's Day on Google to find a girlfriend, and you will also look for a regular Cosmetic medical institutions (although a large part of advertising liar). So if your own website system needs to enable users to search for some important information, and can be structured to show to the user, the following nine Java search engine framework to share may be able to help you.

1, Java full-text search engine framework Lucene

There is no doubt that Lucene is currently the most popular Java full-text search framework, to be precise, it is a full-text search engine architecture that provides a complete query engine and index engine, part of the text analysis engine. Lucene provides developers with a fairly complete toolkit that makes it easy to implement powerful full-text search capabilities. Below are several search engine frameworks that are also based on Lucene.

Official website: http://lucene.apache.org/

2, open source Java search engine Nutch

Nutch is an open source Java implementation of the search engine. It provides all the tools we need to run our own search engine. Including full-text search and web crawler.

With Nutch, you can do the following:

Get billions of pages per month Maintain an index for these pages Thousands of searches per second of index files Deliver high-quality search results with minimal cost

Official website: http://nutch.apache.org/

3, distributed search engine ElasticSearch

ElasticSearch is a distributed search engine based on the Lucene framework and one of the few search engines based on JSON. ElasticSearch is particularly suitable for use on cloud computing platforms.

Official Website: http://www.elasticsearch.org/

4, real-time distributed search engine Solandra

Solandra is a real-time, distributed search engine built on Apache Solr and Apache Cassandra.

Its characteristics are as follows:

Support for most of Solr's default features (search, faceting, highlights) Data Replication, Sharding, Caching, and Compression These are all handled by Cassandra Multi-master (any node can read and write) High real- Read Easily add new SolrCores w / o restart across the cluster Easily add and restart nodes

Official site: https://github.com/tjake/Solandra

5, IndexTank

IndexTank is a set of Java-based index - real-time full-text search engine implementation, IndexTank has the following characteristics:

Index update takes effect in real time Location search supports multiple client languages
Ruby, Rails, Python, Java, PHP, .NET & more Supports flexible sorting and scoring control Supports auto-complete facet search Supports matching highlights Supports massive data extensions Scalable from a personal blog to hundreds of millions of documents!) Supports dynamic data

Official website: https://github.com/linkedin/indextank-engine

6, search engine Compass

Compass is a powerful, transactional, high-performance OSEM: object / search engine mapping with a Java persistence framework that includes:

Search engine abstraction layer (using Lucene search referrals) OSEM (Object / Search Engine Mapping) support for transaction management Simple Keyword Lookup Language similar to Google Extensible and Modular Framework Simple API

Official Website: http://www.compass-project.org/

7, Java full-text search server Solr

Solr is also Java-based and based on Lucene, and Solr's key features include efficient and flexible caching capabilities, vertical search capabilities, and highlight search results. It is noteworthy that Solr also provides a great web interface to manage indexed data.

Official website: http://lucene.apache.org/solr/

8, Lucene picture search LIRE

LIRE is a Java-based image search framework based on Lucene, which uses the index to construct a content-based image retrieval (CBIR) system to search for similar images.

Official Website: http://www.semanticmetadata.net/lire/

9, full-text search engine Egothor

Egothor is an open source and efficient full-text search engine written in Java. With Java's cross-platform features, Egothor can be used in any environment, both as a separate search engine and as your full-text search application.

Official Website: http://www.egothor.org/cms/

Original link: http://www.codeceo.com/article/8-java-search-engine.html

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.