SOLR
1. Solr server setup
LJava environment setup
Download linux JDK 6 from this website:
Http://java.sun.com/javase/downloads/index.jsp
After installing JDK, edit/ect/profile, add these code to the end of the file
JAVA_HOME =/usr/java/jdk1.6.0 _ 16
PATH = $ JAVA_HOME/bin: $ PATH
CLASSPATH =.: $ JAVA_HOME/lib/dt. jar: $ JAVA_HOME/lib/tools. jar
Export JAVA_HOME
Export PATH
Export CLASSPATH
/Usr/java/jdk1.
First, why blog write "LUCENE/SOLR Search engine Development Series" I graduated in 2011, 2011-2014 of the three years, in Shenzhen, the top 50 enterprises, engaged in the field of industrial control machine vision direction, the main use of language for C + +; now working in a large state-owned enterprise owned e-commerce company, mainly using language as Java,
folder and copy the E:\solr-4.9.1\example\ Resources\log4j.properties file to the Classes folder, modify the Log4j.properties file, and modify the value of "Log4j.appender.file.File" to ". /logs/solr.log ";Figure 1-12 SOLR running successfully in Tomcat 11. Delete the Solr.war file under the E:\apache-tomcat-8.0.9\webapps folder, restart Tomcat, enter the http:
lot of learning to understand, to understand how it works, skilled use of lucene is really very complex.Solr:Apache SOLR is an open-source search platform built on a Java library called Lucene. It provides Apache Lucene search functionality in a user-friendly manner. As an industry participant for nearly a decade, it
" useSmart="false"/>"query"> class="org.wltea.analyzer.lucene.IKAnalyzerSolrFactory" useSmart="true"/>
Solrconfig:If the schema is defined as SOLR's model, then Solrconfig is the SOLR configuration, which defines SOLR if it handles many requests such as indexing, highlighting, searching, and also specifies a cache policy, with more elements including:
Specify the index data path
Provided based on the Lucene search engine and open-source with Apache Software License license. SOLR is (based on the Lucene site) "an open-source enterprise search Server Based on Lucene Java Search Library, with XML/HTTP and JS
.
Each node makes up a peer network structure, and when some nodes fail, they are automatically assigned other nodes to work instead.
Disadvantages
There is only one developer (the current Elasticsearch GitHub organization is more than that, already has a fairly active maintainer)
Not enough automatic (not suitable for the current new index warmup API)
About SOLR *SOLR (read as "s
Full-text search engine Solr, full-text solr
Solr is an independent enterprise-level search application server that provides APIs similar to Web-service. You can submit an XML file of a certain format to the
Why do you need a search engine? Simple database is not enough? If you just create a small website, the database is enough. But when you create a neutral or large application, the search engine is a smarter choice. As a matter of course, small sites can also use SOLR to obta
, and when some nodes fail, they are automatically assigned other nodes to work instead.
Disadvantages
There is only one developer (the current Elasticsearch GitHub organization is more than that, already has a fairly active maintainer)
Not enough automatic (not suitable for the current new index warmup API)
About SOLRSOLR (read as "solar") is an open source enterprise search platform for the
Why do you need a search engine. A simple database is not enough. If you just create a small site, the database is enough. But when you create neutral or large applications, search engines are a smarter choice. Of course, a small web site can also use SOLR to get highly correlated
First, this chapter mind map II. installation and deployment of SOLR 1. What are Lucene and SOLR?Lucene is a subproject of the Apache Software Foundation 4 Jakarta Project group, an open source full-Text Search engine toolkit, which is not a full-text
, ikanalyzer configuration file can be a good solution to this problem.1. Add Industry wordsOpen IKAnalyzer.cfg.xml and you will see that the configuration file is written very clearly, as long as the installation of Stopword.dic format custom a name, such as Xxx.dic placed in the current directory of the same level, and can be accessed by specifying it in the profile IKAnalyzer.cfg.xml. (It is important to note that the Thesaurus file encoding format must be UTF-8 without BOM header)For example
files to see what is inside, solr.xml content is:
1234567891011121314151617181920
"id">SOLR1000"name">Solr, the Enterprise Search Server"manu">Apache Software Foundation"cat">software"cat">search"features">Advanced Full-Text Search Capabilities using Lucene
some analysis of the above process, see:
Integrating with Nutch/Hbase/Solr to build a search engine 2: Content Analysis
Http://blog.csdn.net/jediael_lu/article/details/37738569
The following error occurs when you use crontab to set a routine task of the Nutch.
JAVA_HOME is not set.
Therefore, a script is created to execute the capture task:
#!/bin/bashexport J
Tags: mongodb solr search engine ubuntu JSONEnvironment: Ubuntu 12.04 Solr 5.1.0 MongoDB db version:v2.0.4 1. SOLR Configuration and MongoDB installationSOLR installation configuration is now very simple, refer to the Official document: Http://lucene.apache.org/
At present, more suitable for Java search engine construction will generally choose SOLR, the underlying operation will use SOLRJ interaction, in fact, SOLR is based on Lucene. The implementation process found that many of the Web documents on Java Integration SOLR are based
Enter keywords in the search box to search for keyword-related keyword records from MongoDB.The user selects one of the relevant keywords as the last keyword to query data from SOLR.1 CREATE INDEX: Check all records from SQL Goodsinfo table, SOLR. AddRange (); Solr.commit () to create an index2 client searches in SQL t
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.