Managing gigabytes for Java 4.0 publishes a Java search engine is a highly 17813.html "> customizable, high-performance, Full-text, large document collection of Java search engines." It provides State-of-the-art functions (such as bm25/bm25f) and new research algorithms.
Although mg4j (managing gigabytes for Java) is not an information retrieval library like Lucene, Egothor, and Xapian, we believe every software engineer who is reading this book should Know about it, Because it provides low-level support for building a Java information Retrieval library.
Mg4j is another search engine. The main difference with Lucene is that it provides cluster functionality with a more OO design approach. MG4J allows you to build a compressed Full-text index for a large collection of document collections by making the interpolation code (interpolative job) technology.
Managing gigabytes for Java 4.0 This is part of a parallel release fastutil version of the DSi utilities,sux4j,mg4j,webgraph and so on.
Support in the "big" version more than 2^31 in the array (analog), the list of elements, terminology, files, nodes, etc. Several improved semantics, as well as some subtle, long-term bug fixes.
Official website: http://mg4j.dsi.unimi.it/