1. Use DIH to upload structured data
Many search applications index structured data, such as relational databases. DIH provides a mechanism for storing and indexing structured data. apart from relational databases, SOLR can index content from HTTP, based on data sources such as RSS and atom feeds, e-mail libraries, and structured XML (fields can be generated using XPath)
For more information, see https://wiki.apache.org/solr/DataImportHandler.
1.1 concepts and terminology
Concepts and terms
The description of data import handler uses several similar terms, such as entity and processor.
Terms |
Definition |
Datasource |
For a database, it is a DNS, and for an HTTP data source, it is a basic URL. |
Entity |
Conceptually, an object generates a group of documents. For an RDBMS data source, an object is a view or table. |
Processor |
An entity processor is used to extract content from the data source, convert the content, and add it to the index. A custom entity processor can inherit or replace the entity processor it supports. |
Transformer |
Each set of fields obtained by the object can be converted. this conversion process can be to modify a field, create a new field, or generate multiple lines/documents from one line. DIH has several built-in converters that can modify dates and filter HTML tags. you can also use public available interfaces to customize the converter. |
1.2 configuration 1.2.1 sorting ing solrconfig. xml
1.6.4 uploading structured data store data with the data import Handler