Information retrieval system for archiving multiple document versions
First Claim
Patent Images
1. A system comprising:
- a phrase-based index that includes a plurality of posting lists, each of the posting lists including;
a phrase, anda list of documents associated with the phrase;
at least one processor; and
memory storing instructions that, when executed by the at least one processor, cause the system to perform operations including;
receiving, on a first date, a current version of a previously indexed version of a document, the previously indexed version being associated with a closed date of a date range that represents current validity,determining that the current version differs from the previously indexed version,generating a document identifier for the current version that is based on the first date,determining at least one phrase associated with the current version,updating the closed date of the date range for the previously indexed version based on the first date,setting a closed date of a date range for the current version to a status that represents current validity, andupdating a posting list for the at least one phrase to include the document identifier.
2 Assignments
0 Petitions
Accused Products
Abstract
An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents ate the indexed according to their included phrases. Index data for multiple versions or instances of documents is also maintained. Each document instance is associated with a date range and relevance data derived from the document for the date range.
230 Citations
19 Claims
-
1. A system comprising:
-
a phrase-based index that includes a plurality of posting lists, each of the posting lists including; a phrase, and a list of documents associated with the phrase; at least one processor; and memory storing instructions that, when executed by the at least one processor, cause the system to perform operations including; receiving, on a first date, a current version of a previously indexed version of a document, the previously indexed version being associated with a closed date of a date range that represents current validity, determining that the current version differs from the previously indexed version, generating a document identifier for the current version that is based on the first date, determining at least one phrase associated with the current version, updating the closed date of the date range for the previously indexed version based on the first date, setting a closed date of a date range for the current version to a status that represents current validity, and updating a posting list for the at least one phrase to include the document identifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
receiving, on a first date, a current version of a previously indexed version of a document, the previously indexed version being associated with a closed date of a date range that represents current validity; determining, using at least one processor, that the current version differs from the previously indexed version; generating, using the at least one processor, a document identifier for the current version that is based on the first date; determining at least one phrase associated with the current version; updating the closed date of the date range for the previously indexed version based on the first date; setting a closed date of a date range for the current version to a status that represents current validity; and updating a posting list for the at least one phrase to include the document identifier, the posting list being included in a phrase-based index. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
at least one processor; and a memory storing; date specific instances of a document, each instance having an associated date range, the instances including a first version of the document with a first date range having a closed date that indicates the first version is currently valid, an index relating phrases to instances of documents, and instructions that, when executed by the at least one processor, cause the system to perform operations comprising; receiving, on a first date, a second version of the document, determining that the second version differs from the first version, updating, in response to the determining, the closed date of the first date range from a status indicating current validity to a date prior to the first date, associating the second version of the document with a date range having the first date as an open date and a closed date that represents current validity, and updating the index to relate some of the phrases in the index to the second version of the document. - View Dependent Claims (16, 17, 18, 19)
-
Specification