Lucene index file, use the basic type to save the information:
1. Byte: Is the most basic type, with a length of 8 bits (bit).
2. UInt32: Consists of 4 byte.
3. UInt64: Consists of 8 byte.
4. VInt:
"A variable-length integer type, which may wrap multiple bytes, for each byte of 8 bits, where the latter 7 bits represent a numeric value, and the highest 1 bits indicate whether there is another byte,0 indicating no, and 1 means there is.
The earlier byte indicates the low of the value, and the later byte indicates the high of the value.
"For example, 130 to binary is 1000, 0010, a total of 8 bits, a byte cannot be represented, thus two bytes are required, the first byte represents the last 7 bits, and at the highest position there is a byte behind it, so that is (1) 0000010, The second byte represents the 8th bit, and the highest position of zero indicates that there is no other byte behind, so it is (0) 0000001.
5.Chars: is a series of Byte UTF-8 encoded.
6.String: A string is first a VInt to represent the number of characters in this string packet, followed by the UTF-8 encoded character sequence Chars.
Lucene base Type