INTEGRATING EXTERNAL RELATED PHRASE INFORMATION INTO A PHRASE-BASED INDEXING INFORMATION RETRIEVAL SYSTEM
First Claim
1. A method for updating phrases associated with a limited document collection, comprising:
- determining a list of top phrases for the limited document collection, at least in part based on presence of related phrases of the top phrases;
receiving a replacement phrase for at least one of the top phrases; and
updating related phrase data for the replacement phrase from the related phrase data of the top phrase that is being replaced.
2 Assignments
0 Petitions
Accused Products
Abstract
An information retrieval system uses phrases to index, retrieve, organize and describe documents, analyzing documents and storing the results of the analysis as phrase data. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. Related phrases and phrase extensions are also identified. Changes to existing phrase data about a document collection submitted by a user is captured and analyzed, and the existing phrase data is updated to reflect the additional knowledge gained through the analysis.
146 Citations
16 Claims
-
1. A method for updating phrases associated with a limited document collection, comprising:
-
determining a list of top phrases for the limited document collection, at least in part based on presence of related phrases of the top phrases; receiving a replacement phrase for at least one of the top phrases; and updating related phrase data for the replacement phrase from the related phrase data of the top phrase that is being replaced.
-
-
2. A method of determining top phrases of a limited document collection, comprising:
-
determining top phrases for each of a plurality of documents in the limited document collection, at least in part based on presence of related phrases of the top phrases in each document, each top phrase being associated with a score; for each top phrase of a document, determining an aggregate score for the top phrase corresponding to the top phrase'"'"'s scores for documents in which it appears in the limited document collection; selecting a set of top phrases with the highest aggregate scores. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of updating existing phrase information, responsive to a user requesting a change of a current top phrase for a limited document collection to a replacement top phrase, the method comprising:
-
associating the replacement top phrase with a root document of the document collection; associating the current top phrase and replacement top phrase with each other; adding to phrase information for the replacement top phrase, phrase information for the current top phrase; and adding to related phrase information of the replacement top phrase, related phrase information of the current top phrase. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
Specification