| Test function |
Test item |
Effect description |
Complete situation |
| Manage pages |
Handler Startup |
Click the "Start" button to have the handler start processing |
|
| Open Crawler Management page |
Click "Source Configuration" to eject the Crawler management page |
|
| Open the Output Destination Configuration page |
Click "Output Configuration" to eject the output configuration page |
|
| Crawler functions |
Crawler crawl Information seed increased |
You can manually increase the crawl information source site on the Crawler Administration page |
|
| Crawler keyword Filter |
You can add keywords to filter content when crawling information |
|
| Crawler Multi-threaded boot |
You can customize the startup of several crawl threads, and you can see how each thread is running |
|
| Crawler Information Display |
You can see the information about the crawler running, the number of files crawled |
|
| Crawler Crawl Site number limit |
You can customize the number of crawler crawl sites, and if default, crawl down consistently |
|
| Crawler File Information Preservation |
Information crawled from the Web can be stored in the database in a format that can be updated from the database |
|
| Data processing functions |
Data processing Start Control |
You can manage the start and pause of the current data processing thread on a Web site |
|
| Doc data Text information acquisition |
Extract text information from doc file |
|
| Doc Key Information extraction |
Extract key information from the doc file and save |
|
| HTML Data Text acquisition |
Extracting the de-noising text from HTML |
|
| HTML Key Information Extraction |
Extracting key information from an HTML file |
|
| PDF Data Text acquisition |
Extracting text information from a PDF file |
|
| PDF Key Information Extraction |
Extracting keywords from PDF files |
|
| Questions and answers on site information extraction |
Extract questions and quality answers from quiz sites |
|
| Expansion capabilities |
Configure a linked SOLR account |
You can manually configure the SOLR database that needs to be linked |
|
| Custom Uploads |
Allow users to SOLR index deletion and rebuild options |
|
| Provide modify keyword interface |
Provides a way to modify the keyword interface and access |
|
| Login account |
Provide login interface, use fixed account to login to admin interface |
|