Lucene 5.X version Index file format

Source: Internet
Author: User

Original link: https://my.oschina.net/rickylau/blog/527602

  

Name File Expansion Name Describe
Segment file Segments_n Saves the number of segments that the index contains, and how many documents each segment contains.
Duan Yuancan Villager Data , s+ Saved metadata information for an index segment
Lock file Write.lock Prevents multiple IndexWriter from being written to an index file at the same time.
Compound index File . CFS,. CFE Stores all index information in a composite index file.
Domain Information for index segments . FNM Save the domain that this segment contains, as well as the name of the domain and the index type of the domain.
Document information for an index segment . FDX,. FDT

Save the documents contained in this section, the fields contained in each document, and the information for each domain.

Index segment Term Information . Tim,. Tip

The. Tim file stores the statistics for the term in each domain and holds a pointer to the. doc,. Pos, and. Pay index files.

The. Tip file holds the index information for the term dictionary and can support random access.

Term word frequency and skip table information in documents . doc Save the term frequency information for each document in this paragraph.
Position information for term in the document . Pos Save the term location information for each document in this paragraph.
Payload and partial location information for a document . Pay The location information (offsets) of the valid carrier (payload) and term for each document in this paragraph is saved. One part of the term location information is stored in the. pos file.
Index field weighting factor . NVD,. NVM

. nvm files Save metadata for index field weighting factor

. nvd File Save index field weighted data

Index document weighting Factor . DVD,. DVM

. dvm files Save metadata for index document weighting factor

. dvd File Save index document weighted data

Index vector data . TVX,. TVD,. TVF

. TVD stores information such as term, term frequency, location information, payload, and so on for this document.

The. TVX index file that is used to load specific documents into memory.

. TVF Save the vector information for the indexed field.

Valid documents . Liv Save index file information for a valid document

Lucene 5.X version Index file format

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.