Sort system for text retrieval
First Claim
1. A method for retrieving documents from a database, comprising the steps of:
- creating a plurality of subdocuments from a plurality of documents stored in a database;
scoring the subdocuments;
sorting the subdocument scores via a modified heap sort, wherein said modified heap sort identifies the highest ranking subdocument from a portion of said subdocuments prior to a complete sort of said subdocuments; and
upon completing said modified heap sort, displaying text from a document corresponding to said highest ranking subdocument.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention is a method for operating a computer system to retrieve information from a computer database. This method decomposes documents from the database into subdocuments and then inverts the database. Also, a query for retrieving documents from a database is decomposed into terms. The subdocuments from the inverted database and the terms from the query are then used to compute a score that indicates a quantitative relation between terms in the query and the subdocuments. The resulting list of the subdocuments and their scores is then reorganized into a heap form. The highest ranking subdocument is then selected by the computer and the text associated with this subdocument is displayed. The remainder of the subdocument score list is concurrently reheapified.
28 Citations
9 Claims
-
1. A method for retrieving documents from a database, comprising the steps of:
-
creating a plurality of subdocuments from a plurality of documents stored in a database;
scoring the subdocuments;
sorting the subdocument scores via a modified heap sort, wherein said modified heap sort identifies the highest ranking subdocument from a portion of said subdocuments prior to a complete sort of said subdocuments; and
upon completing said modified heap sort, displaying text from a document corresponding to said highest ranking subdocument. - View Dependent Claims (2, 3)
-
-
4. An apparatus for retrieving documents from a database, comprising:
-
a computer coupled to a disk storage unit and to a display unit, said disk storage unit stores a database in at least one file;
said computer retrieves data from said database file to create a plurality of subdocuments from a plurality of documents stored in a database file;
said computer scores said subdocuments;
said computer sorts the subdocument scores via a modified heap sort, wherein said modified heap sort identifies the highest ranking subdocument from a portion of said subdocuments prior to a complete sort of said subdocuments; and
upon completing said modified heap sort, said computer displays on said display unit text retrieved from a document corresponding to said highest ranking subdocument. - View Dependent Claims (5, 6)
-
-
7. A computer readable media bearing sequences of computer executable instructions for retrieving documents from a database, comprising the computer-implemented steps of:
-
creating a plurality of subdocuments from a plurality of documents stored in a database;
scoring the subdocuments;
sorting the subdocument scores via a modified heap sort, wherein said modified heap sort identifies the highest ranking subdocument from a portion of said subdocuments prior to a complete sort of said subdocuments; and
upon completing said modified heap sort, displaying text from a document corresponding to said highest ranking subdocument. - View Dependent Claims (8, 9)
-
Specification