Feature implementation:One: After the project starts, automatically monitors all data models, the data that is queried. Create an indexTwo: dynamic automatic updating of incremental Data Index and maintenance index.This is a project that builds an index based on the data model, the coupling degree is low, the expansibility is high. Different from the general full-text search project with business nature. For example: Common e-commerce business, Consum
Lucene is a subproject of the Apache Software Foundation 4 Jakarta Project group, an open source full-Text Search engine toolkit, which is not a full-text search engine, but a full-text search
Apache SOLR 1.1 is the first SOLR release since joining the Apache incubator.
SOLR is a high performance full-text search Server Based on Lucene, written in java5, and easily extensible through plugins written in Java. events a
Open source search engine Toolkit
1. Lucene
Lucene is currently the most popular open-source full-text search engine toolkit. It is affiliated to the Apache Foundation and initiated by Doug Cutting, a senior full-text indexing/retrieval expert, take the name of the project a
Part of the content is transferred from: http://blog.csdn.net/hguisu/article/details/8024799First, open source project1.Lucene full-text retrieval systemHttp://lucene.apache.org and http://www.lucene.com.cn/Lucene is a subproject of the Apache Software Foundation 4 Jakarta Project group, an open source full-Text Search engine toolkit, which is not a full-text
Chinese characters, and you need to consider numbers and special characters.
Need to maintain pinyin, abbreviated two trie trees.
Scenario Two SOLR comes with suggest smart tipsSOLR, as a widely used search engine system, has built-in smart hints, called suggest modules. The module can choose to do smart hints based on the text of the cue word, and also
://lucene.apache.org/2. Open source Java search engine NutchNutch is an open source Java-implemented search engine. It provides all the tools we need to run our own search engine. Includes full-text
://lucene.apache.org/ 2. Open source Java search engine NutchNutch is an open source Java-implemented search engine. It provides all the tools we need to run our own search engine. Includes full-text
In addition, as the content of the Internet with an alarming rate of growth has become more and more prominent the importance of search engines, if the site wants to be better indexed by search engines, site design In addition to user-friendly (users friendly), search engine friendly (searching
: http://nutch.apache.org/3. Distributed Search engine ElasticSearchElasticsearch is a distributed search engine based on the Lucene framework, and is one of the few search engines that are indexed based on JSON. The Elasticsearch is particularly suitable for use on cloud co
is a distributed search engine based on the Lucene framework, and is one of the few search engines that are indexed based on JSON. The Elasticsearch is particularly suitable for use on cloud computing platforms.Official website: http://www.elasticsearch.org/4. Real-time distributed search
IBM WebSphere Commerce uses Apache SOLR Search as a search engine solution that provides a full range of packages, mainly in the following areas:
SOLR Multi-core creation
Preprocess Indexbuild processed by DIH (Data
magnitude. Therefore, maintain a K (10) Size of the small Gan, and then traverse 3 million of the query, respectively, and the root element to compare. So, our final time complexity is: O (n) + N ' * O (LOGK), (n is 10 million, N ' is 3 million).The problem with this scenario is:
When you build indexes and queries, you convert Chinese characters to pinyin, and after the query is finished, you have to convert the pinyin into Chinese characters, and you need to consider numbers and speci
Lucene is a subproject of the Jakarta Project Team of the Apache Software Foundation. It is an openSource codeIs not a complete full-text search engine, but a full-text search engine architecture, provides a complete query engine
simplifying the complexity of SOLR, users can perform related data manipulation through simple SQL statements. Tngoudb can completely throw away the Lucene knowledge associated with SOLR and can be implemented with common SQL statements.DocumentDocument Address: HTTP://WWW.TNGOU.NET/DOC/TNDB supports complete installation, configuration, and use of documentation.Use caseNow TNGOUDB is the internal test ver
complexity of SOLR, users can perform related data manipulation through simple SQL statements. Tngoudb can completely throw away the Lucene knowledge associated with SOLR and can be implemented with common SQL statements.DocumentDocument Address: HTTP://WWW.TNGOU.NET/DOC/TNDB supports complete installation, configuration, and use of documentation.Use caseNow TNGOUDB is the internal test version, please do
Introduction to the framework and implementation of Word segmentation system---This article is suitable for readers with good concept of search engine (original)keywords : Search engine, participle, LuceneThe domestic vertical field of e-commerce or information sharing applications are in a high-speed development perio
http://www.lucene.com.cn/Lucene is a sub-project of the Jakarta Project Team of the Apache Software Foundation. It is an open-source full-text search engine toolkit [written in Java], that is, it is not a complete full-text search engine, it is a full-text
Directory (?) [+]Open Source Search engine evaluation: Lucene Sphinx elasticsearch Open Source Search engine program has 3 major categories
Lucene System, Java development, including SOLR and Elasticsearch
Sphinx, C + + development, simple and high performance
://www.nutch.org/Chinese site http://www.nutchchina.com/Latest Version: nutch 0.7.2 released
Nutch is a search engine implemented by open-source Java. It provides all the tools we need to run our own search engine. you can create your own search
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.