The revelation of Lucene application development
The Training network address: http://www.hztraining.com/bbs/showtopic-1954.aspx
Chinapub Address: Http://product.china-pub.com/3502099&ref=xiliegoumai
Note: Since I was the first to record this kind of video, the imperfect place please forgive me. This series of courses mainly covers three major parts, one is the principle of search engine, the second is the in-depth analysis of lucene and advanced characteristics, the third is the real-time search of the framework of distributed search and code implementation of the search engine More theoretical explanations, some parts of the content is less, the need for integrity, or as a separate section, may be short video time, good video is a section of sales, I have marked the video of each section of the length, you can selectively purchase for Lucene in-depth analysis and advanced characteristics, basic in the " Lucene principles and Code Analysis of the series, described in the blog, if you do not want to buy a friend can download http://download.csdn.net/detail/forfuture1978/2452992, do not integrate For real-time search of distributed search framework analysis and code implementation, I think the most valuable part is the principle of the framework as well as the code, and some video courses in the video playback process in the field to write code, so the video may be longer, I take is to write the code in advance, in the video just to explain to you, So the time may be shorter.
Course Description:
This course is not limited to how to use Lucene, but with Lucene as an example, through the in-depth analysis of the various modules of search engine design principles and implementation of various modules, and then can jump out of lucene see Search engine system architecture and design ideas. Finally, this course introduces some of the advanced features of Lucene and personalized modifications and interventions that may be involved in building a search engine system using Lucene, and finally gives an example of a real time search system based on Lucene.
Course Catalogue:
1. First talk about search engine panorama
1.1. The history of the search engine system
1.2. The industry status of search engine system
1.3. The overall structure of the search engine system
1.4. First knowledge of Lucene
Detailed information: PPT 12 pages, video 65 minutes, not attached code
2. Second, the basic principles of Full-text Search
2.1. What is Full-text search
2.2. What is stored in the index
2.3. How to create an index
2.4. How to find the relevant articles
2.5. Write a simple Lucene yourself
Detailed information: PPT 17 pages, video 46 minutes, not attached code
3. Third, the realization of a simple lucene
3.1. Review the important process of Full-text search
3.2. Design your index
3.3. How to create an index
3.4. How to find the relevant articles
Detailed information: PPT 17 pages, video 67 minutes, with code
4. Go to the world of search engines-Design your index format
4.1. How to store the dictionary
Order List
Pointer list
Front-end encoding
hash table and minimum perfect hash
Trie Tree and Trie tree with double group
M Road Search Tree
4.2. How to store the inverted table
Fixed-length coding
Difference Code
parameter-free and variable-length coding
Jump table
Detailed information: PPT 39 page, the video points up and down two sets, the previous 54 minutes, the next 28 minutes, does not have the code (embarrassed that the content configuration of the context is not enough average)
5. The-lucene index format for entering the world of search engines
5.1. Basic Concepts
5.2. Basic data types
5.3. Some basic rules
Prefix encoding Rules
Difference Rule
Probable following rule
Jump Table Rule
5.4. Important Forward Information
5.5. Important Reverse Information
5.6. Other information
Detailed information: PPT 29 page, the video points up and down two sets, the previous 57 minutes, the next 42 minutes, does not attach the code
6. Sixth talk about entering the world of search engines-creation of indexes
6.1. The main process of index creation
6.2. Memory-based index creation
6.3. Hard disk based index creation
6.4. Dynamic Index Creation
6.5. Integration of indexes
Detailed information: PPT 12 pages, video 22 minutes, not attached code
7. Seventh talk about the creation of the index of the world-lucene into the search engine
7.1. The overall structure of the indexing process
7.2.Lucene Multi-threaded indexing
7.3.Lucene Multi-domain Index
7.4. Key steps for index creation
7.5.Lucene of document Deletion
Cache Management for 7.6.Lucene
Detailed information: PPT 40 page, the video points up and down two sets, the previous 39 minutes, the next 45 minutes, does not attach the code
8. The-lucene index merging into the search engine world
A summary of the merging of 8.1.Lucene segments
8.2. Paragraph consolidation Strategy
8.3. Merging of reverse information
8.4. Key steps in the consolidation of segments
Detailed information: PPT 28 pages, video 29 minutes, not attached code
9. Ninth talk into the search engine world-the search process
9.1. The main process of search
9.2. Query preprocessing
Stemming
Spell check
Synonyms
Query extension
9.3. Information Retrieval Model
Based on set theory: Boolean model
Based on algebra theory: vector space model
Based on probability theory: Language model
Detailed information: PPT 23 pages, video 41 minutes, not attached code
10. Tenth talk about the search process of the world-lucene into the search engine
10.1. The overall structure of the search process
10.2.Lucene query syntax
Scoring mechanism for 10.3.Lucene
Key steps in 10.4.Lucene search
Detailed information: PPT 51 page, the video points up and down two sets, the previous 45 minutes, the next 35 minutes, does not attach the code
11.11th. Several important inquiries into the world-lucene of search engines
11.1. Querying the hierarchy of objects
11.2. Several advanced query objects
Rangequery, Spanquery, Filterquery
Detailed information: PPT 20 pages, video 36 minutes, with code
12.12th Talk about entering the world of search engines-participle
12.1. Commonly used participle method
Dictionary-based Machine segmentation algorithm: Maximum matching method and shortest path method
Statistics-based segmentation algorithm: n-ary syntax
The word breaker in 12.2.Lucene
12.3. Customize your own word breaker
The encapsulation of Chinese Academy of Sciences participle
Detailed information: PPT 22 pages, video 60 minutes, with code
13.13th. One of the advanced features of Lucene
The principle and application of 13.1.Payload
13.2. Interval query and use payload to implement interval query
13.3. Sort by domain field and using payload to sort by fields
13.4.Lucene of Transactional
Detailed information: PPT 26 pages, video 30 minutes, with code
14.14th. The advanced characteristics of Lucene
14.1.Lucene Performance Bottleneck
14.2. Four ways to influence the grading process of Lucene
Detailed information: PPT 20 pages, video 51 minutes, with code
15.15th. The principle and implementation of real-time search based on Lucene
15.1. The principle of real-time search
15.2. Real-time search implementation and code
Detailed information: PPT 12 pages, video 33 minutes, with code
16.16th. Real-time search for Chinese file update based on Lucene
16.1. Real-Time Search Chinese document update principle
16.2. Real-Time Search Chinese file update implementation and code
Detailed information: PPT 17 pages, video 38 minutes, with code
17.17th Lecture on real-time search system Framework Zoie
17.1. Overall structure
17.2. Configure a Zoie
17.3.Zoie realization of real-time search principle
Indexing process for 17.4.Zoie
17.5.Zoie Search Process
Detailed information: PPT 26 pages, video 40 minutes, not attached code
18. The 18th Lecture is based on Zoie to create a real-time search system
18.1. A demo to run Zoie
18.2. Overall structure
18.3. Implement your own components
Index Component Code Implementation
Search Component Code implementation
Real-Time Search system Demo
Detailed information: PPT 6 pages, video 41 minutes, with code
19.19th Speaking distributed search framework Sensei
19.1. Overall structure
19.2. Detailed framework
19.3. Configure the Sensei server side
19.4. Configure the Sensei client
The important process of 19.5.Sensei
Client startup process
Server-side startup process
Client search Process
Server-Side Search process
Detailed information: PPT 13 pages, video 44 minutes, not attached code
20. The 20th lecture is based on Sensei to create a distributed search system
20.1. Overall structure
20.2. Implement your own components
Server-side Code implementation
Client code Implementation
Distributed Search System Demo
Detailed information: PPT 6 pages, video 57 minutes, with code