SYSTEM AND METHOD OF FINDING DOCUMENTS RELATED TO OTHER DOCUMENTS AND OF FINDING RELATED WORDS IN RESPONSE TO A QUERY TO REFINE A SEARCH
First Claim
1. A processor implemented method of finding related documents that are relevant to one or more known documents, comprising:
- based on the known documents, automatically finding relevant keywords;
using the keywords, automatically searching for documents that are relevant to the keywords to generate a search result list that includes the known documents; and
removing the known documents from the result list to obtain a list of the related documents.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented system and method is disclosed for retrieving documents using context-dependant probabilistic modeling of words and documents. The present invention uses multiple overlapping vectors to represent each document. Each vector is centered on each of the words in the document and includes the local environment. The vectors are used to build probability models that are used for predictions of related documents and related keywords. The results of the statistical analysis are used for retrieving an indexed document, for extracting features from a document, or for finding a word within a document. The statistical evaluation is also used to evaluate the probability of relation between the key words appearing in the document and building a vocabulary of key words that are generally found together. The results of the analysis are stored in a repository. Searches of the data repository produce a list of related documents and a list of related terms. The user may select from the list of documents and/or from the list of related terms to refine the search and retrieve those documents which meet the search goal of the user with a minimum of extraneous data.
74 Citations
12 Claims
-
1. A processor implemented method of finding related documents that are relevant to one or more known documents, comprising:
- based on the known documents, automatically finding relevant keywords;
using the keywords, automatically searching for documents that are relevant to the keywords to generate a search result list that includes the known documents; and
removing the known documents from the result list to obtain a list of the related documents. - View Dependent Claims (2, 3, 4, 5, 6, 7)
- based on the known documents, automatically finding relevant keywords;
-
8. A computer program product having program code stored on a computer usable medium, the program code configured to cause a computer to implement a method for finding related documents that are relevant to one or more known documents, comprising:
- a first set of program instructions which, based on the known documents, automatically find relevant keywords;
a second set of program instructions that use the keywords to automatically search for documents that are relevant to the keywords, and to generate a search result list that includes the known documents; and
a third set of program instructions that remove the known documents from the result list to obtain a list of the related documents. - View Dependent Claims (9, 10, 11)
- a first set of program instructions which, based on the known documents, automatically find relevant keywords;
-
12. A processor implemented system for finding related documents that are relevant to one or more known documents, comprising:
- a first module for automatically finding relevant keywords based on the known documents;
a second module that uses the keywords for automatically searching for documents that are relevant to the keywords, to generate a search result list that includes the known documents; and
a third module for removing the known documents from the result list to obtain a list of the related documents.
- a first module for automatically finding relevant keywords based on the known documents;
Specification