Lucene Application Development secrets
Hua Zhang Training Network Address: http://www.hztraining.com/bbs/showtopic-1954.aspx
ChinaPub address: http://product.china-pub.com/3502099&ref=xiliegoumai
Note:
- I am recording this kind of video for the first time. Please forgive me for its imperfections.
- This series of courses mainly covers three parts: one is the principle of the search engine, the second is the in-depth analysis and advanced features of lucene, and the third is the framework analysis and code implementation of the distributed search in real time.
- There are many theoretical explanations on the principles of search engines, some of which have less content and are in the need of integrity. They are also used as separate sections, and the video may take a short time, fortunately, videos are sold in different sections. I have marked the video duration for each section and can choose to purchase videos.
- In-depth analysis and advanced features of lucene are described in the Lucene principles and code analysis series. If you do not want to buy it, download http://download.csdn.net/detail/forfuture1978/2452992without points.
- I think the most valuable part of framework parsing and code implementation of distributed search in real time is the framework principle and code. Some video courses write code on site during video playback, therefore, the video duration may be relatively long. I have written the code in advance and only explained it to you in the video, so the time may be short.
Course description:
This course is not limited to how to use Lucene. Instead, it uses Lucene as an example to introduce the design principles and implementation of various search engine modules through in-depth analysis of each module, then we can jump out of Lucene to see the overall architecture and design ideas of the search engine system.
Finally, this course will introduce some advanced features of Lucene that may be involved when using Lucene to build a search engine system, as well as personalized modification and intervention, finally, an example of Lucene-based Real-time search system is provided.
Course directory:
1. First, let's talk about the overall picture of search engines.
1. Search Engine System history
1. 2. industry status of search engine systems
1. 3. overall architecture of the search engine system
. Early Learning Lucene
Details: 12 ppt pages, video 65 minutes, no code
2. Second, the basic principles of full-text search
2. 1. What is full-text search?
2. What is stored in the index?
2. 3. How to create an index
2. 4. How to find related articles
. Write a simple Lucene
Details: 17 ppt pages, 46 minutes video, no code
3. The third step is to implement a simple Lucene
3. 1. Review the important process of full-text search
. Design your index
3. How to create an index
3. 4. How to find related articles
Details: 17 ppt pages, video 67 minutes, with code
4. Fourth, enter the search engine world-design your index format
. Storage of dictionaries
Sequence List
Pointer list
Front-end code
Hash table and minimum perfect hash
Trie tree and double array Trie tree
M-path Search Tree
. Storage of inverted tables
Fixed Length Encoding
Difference code
No parameter and variable-length encoding
Skip table
Details: 39 ppt pages. The video is divided into two sets: 54 minutes for the previous set and 28 minutes for the next set. No code is attached (Sorry, the content of the previous set is not evenly configured)
5. Fifth, enter the search engine world-Lucene index format
5. 1. Basic Concepts
5. 2. Basic Data Types
5. 3. Some basic rules
Prefix encoding rules
Difference rule
Or follow the rule
Skip table rules
5. 4. Important positive information
5. Important Reverse Information
. Other information
Details: 29 ppt pages. The video is divided into two sets: 57 minutes for the previous set and 42 minutes for the next set. No code is attached.
6. 6. Enter the search engine world-index creation
. Index creation process
. Memory-based index creation
. Hard Disk-based index creation
6. 4. Create a dynamic index
6. 5. Merge Indexes
Details: 12 ppt pages, video 22 minutes, no code
7. 7. Enter the search engine world-create Lucene Indexes
. Overall architecture of the index process
7.2.Lucene multi-threaded Index
7.3.Lucene multi-domain index
. Key Steps for index creation
7.5.Lucene document Deletion
7.6.Lucene Cache Management
Details: 40 ppt pages. The video is divided into two sets: 39 minutes for the previous set and 45 minutes for the next set. No code is attached.
8. 8. Enter the search engine world-Lucene index Merging
8.1.Lucene merge Summary
8. 2. Merge segments
8. 3. Merge Reverse Information
8. 4. Key Steps of segment Merging
Details: 28 ppt pages, 29 minutes video, no code
9. 9. Enter the search engine world-the search process
9. 1. Main search process
9. 2. query preprocessing
Stemming
Spelling check
Synonym
Query Extension
9. Information Retrieval Model
Set Theory: Boolean Model
Algebra-based: vector space model
Probability Theory: Language Model
Details: 23 ppt pages, video for 41 minutes, no code
10. Lecture 10: Entering the search engine world-Lucene's search process
10. 1. overall architecture of the search process
10.2.Lucene query syntax
10.3.Lucene scoring Mechanism
Key steps for searching 10.4.Lucene
Details: 51 ppt pages. The video is divided into two sets: 45 minutes for the previous set and 35 minutes for the next set. No code is attached.
11th enter the search engine world-several important Lucene queries
11. 1. query the object hierarchy
11. 2. Several advanced query objects
RangeQuery, SpanQuery, FilterQuery
Details: 20 ppt pages, 36 minutes video, with code
12th enter the search engine world-Word Segmentation
12. 1. Common Word Segmentation Methods
Dictionary-based mechanical splitting Algorithm: maximum matching method and Shortest Path Method
Statistics-based splitting Algorithm: N yuan syntax
Word divider in 12.2.Lucene
. Customize your own word Divider
Encapsulate Chinese Emy of Sciences Word Segmentation
Details: ppt 22 pages, video 60 minutes, with code
13. 13th one of Lucene's advanced features
13.1.principles and Applications of payload
13. 2. interval query and use Payload for Interval Query
13. 3. sort by Field and use Payload to sort by Field
13.4.Lucene transaction
Details: ppt page 26, video 30 minutes, with code
14. 14th II on Lucene's advanced features
14.1.Lucene performance bottleneck
. Four methods that affect Lucene's scoring process
Details: 20 ppt pages, video for 51 minutes, with code
15th principles and implementation of Lucene-based Real-time search
15. 1. Principles of real-time search
. Implementation and Code of Real-time search
Details: 12 ppt pages, 33 minutes video, with code
16th about document update in Lucene-based Real-time search
16. 1. How documents are updated in real-time search
16. 2. Document update Implementation and code in real-time search
Details: 17 ppt pages, video for 38 minutes, with code
17. 17th real-time search system framework Zoie
17. 1. Overall Architecture
17. 2. configure a Zoie
17.3.how zoie implements real-time search
17.4.Zoie index process
17.5.Zoie search process
Details: 26 ppt pages, 40 minutes video, no code
18. 18th create a real-time Search System Based on Zoie
18. 1. a Demo of running Zoie
18. 2. overall architecture
18. 3. Implement Your Own Components
Code Implementation of the index component
Search Component code implementation
Real-time search system Demo
Details: ppt 6 pages, video for 41 minutes, with code
19. 19th distributed search framework Sensei
19. 1. Overall Architecture
19. 2. Detailed architecture
19. 3. Configure the Sensei Server
19. 4. Configure the Sensei Client
19.5.Sensei important process
Client Startup Process
Server startup process
Client search process
Server-side search process
Details: 13 ppt pages, video 44 minutes, no code
20th create a distributed search system based on sensei
20. 1. Overall Architecture
. Implement Your Own Components
Server code implementation
Client code implementation
Distributed search system Demo
Details: 6 ppt pages, 57 minutes video, with code