Method and apparatus using run length encoding to evaluate a database
First Claim
1. A method of storing data, comprising:
- creating a plurality of subdocuments of approximately equal length from a database;
representing the occurrence of a plurality of terms in each of said subdocuments by an encoded bit string;
combining a plurality of said bit strings, wherein said combination represents a search result from said database.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a method and apparatus for generating a database search result. The creation of the search result is achieved by representing the subdocument lists of an inverted database with encoded bit strings. The encoded bit strings are space efficient methods of storing the correspondence between terms in the database and their occurrence in subdocuments. Logical combinations of these bit strings are then obtained by identifying the intersection, union, and/or inversion of a plurality of the bit strings. Since keywords for a database search can be identified by selecting the terms of the inverted database, the logical combinations of bit strings represent search results over the database. This technique for method for generating a search result is computationally efficient because computers combine bit strings very efficiently. Also, the search elements of the present invention are not just limited to keywords. The search elements also include types of fields (e.g., date or integer fields) or other extracted entities.
-
Citations
11 Claims
-
1. A method of storing data, comprising:
-
creating a plurality of subdocuments of approximately equal length from a database;
representing the occurrence of a plurality of terms in each of said subdocuments by an encoded bit string;
combining a plurality of said bit strings, wherein said combination represents a search result from said database. - View Dependent Claims (2, 3, 4)
-
-
5. An apparatus for storing data, comprising;
-
a computer coupled to a disk storage unit, said disk storage unit stores a database, said computer creates a plurality of subdocuments of approximately equal length from said database;
said computer represents the occurrence of a plurality of terms in each of said subdocuments by an encoded bit string;
said computer combines a plurality of said encoded bit strings; and
said computer stores said plurality of said encoded bit strings. - View Dependent Claims (6, 7, 8)
-
-
9. A method of retrieving data from a database, comprising the steps of:
-
creating a plurality of subdocuments from a database;
representing the occurrence of at least one term in said subdocuments by an encoded bit string;
identifying said subdocuments containing said bit string; and
retrieving said subdocuments containing said bit string. - View Dependent Claims (10, 11)
-
Specification