Compressing index files in information retrieval
First Claim
1. A method for compressing an index file in an information retrieval system that retrieves information from a plurality of documents, each of the plurality of documents having features occurring therein, the method comprising the step of:
- representing occurrence frequencies of the features in the plurality of documents in a compressed format in the index file.
5 Assignments
0 Petitions
Accused Products
Abstract
There is provided a method for compressing an index file in an information retrieval system that retrieves information from a plurality of documents. Each of the plurality of documents has features occurring therein. Each of the features has parameters corresponding thereto. Parameter values corresponding to the parameters of the features are mapped into a plurality of bins. Bin identifiers are stored in the index file. Each of the bin identifiers identifies a bin to which is assigned at least one individual parameter value corresponding to at least one individual parameter.
-
Citations
33 Claims
-
1. A method for compressing an index file in an information retrieval system that retrieves information from a plurality of documents, each of the plurality of documents having features occurring therein, the method comprising the step of:
representing occurrence frequencies of the features in the plurality of documents in a compressed format in the index file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
13. An apparatus for compressing an index file in an information retrieval system that retrieves information from a plurality of documents, each of the plurality of documents having features occurring therein, the apparatus comprising:
a compression device for representing occurrence frequencies of the features in the plurality of documents in a compressed format in the index file. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
23. A method for compressing an index file in an information retrieval system that retrieves information from a plurality of documents, each of the plurality of documents having features occurring therein, each of the features having parameters corresponding thereto, the method comprising the step of:
-
mapping parameter values corresponding to the parameters of the features into a plurality of bins; and
storing bin identifiers in the index file, each of the bin identifiers identifying a bin to which is assigned at least one individual parameter value corresponding to at least one individual parameter. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
Specification