Method and system for improving a text search
First Claim
1. A method for improving a text search comprising the steps of:
- (a) preprocessing a plurality of documents, including performing relationship mining that provides a network of document relationships for the documents and relationship metadata, wherein a relevance ranking is determined during the preprocessing;
(b) receiving an identification from a user of at least one candidate document from a first plurality of documents obtained via a text search query provided by the user, the text search query using the preprocessed documents, wherein the step (a) of preprocessing is performed before the first plurality of documents are obtained via the text search query;
(c) locating a second plurality of documents that are related to the at least one candidate document by the relationship metadata, wherein the network of document relationships is used to locate the second plurality of documents; and
(d) providing a third plurality of documents to the user as search results to the text search query, each of the third plurality of documents being provided based upon the at least one candidate document and the number of relationships it has with the first and second plurality of documents, wherein the network of document relationships and the relevance ranking are used to provide the third plurality of documents.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for improving text searching is disclosed. The method and system provides a network of document relationship and utilizes the network of document relationships to identify the region of documents that can be used to satisfy a user'"'"'s request. In a preferred embodiment, the text searching method in accordance with the present invention augments a conventional text search by using information on document relationships and metadata. The text searching method and system improves upon conventional text search techniques by incorporating relationship metadata to define regions to search within. In the present invention the definition of a region is not limited to just categories as it includes neighborhoods around individual documents and sets which have been user defined.
-
Citations
12 Claims
-
1. A method for improving a text search comprising the steps of:
-
(a) preprocessing a plurality of documents, including performing relationship mining that provides a network of document relationships for the documents and relationship metadata, wherein a relevance ranking is determined during the preprocessing; (b) receiving an identification from a user of at least one candidate document from a first plurality of documents obtained via a text search query provided by the user, the text search query using the preprocessed documents, wherein the step (a) of preprocessing is performed before the first plurality of documents are obtained via the text search query; (c) locating a second plurality of documents that are related to the at least one candidate document by the relationship metadata, wherein the network of document relationships is used to locate the second plurality of documents; and (d) providing a third plurality of documents to the user as search results to the text search query, each of the third plurality of documents being provided based upon the at least one candidate document and the number of relationships it has with the first and second plurality of documents, wherein the network of document relationships and the relevance ranking are used to provide the third plurality of documents. - View Dependent Claims (2, 3, 4)
-
-
5. A system for improving a text search comprising:
-
means for preprocessing a plurality of documents, including means for performing relationship mining that provides a network of document relationships for the documents and relationship metadata, wherein a relevance ranking is determined during the preprocessing; means for receiving an identification from a user of at least one candidate document from a first plurality of documents obtained via a text search query provided by the user, the text search query using the preprocessed documents, wherein the means for preprocessing performs the preprocessing before the first plurality of documents are obtained via the text search query; means for locating a second plurality of documents that are related to the at least one candidate document by the relationship metadata, wherein the network of document relationships is used by the means for locating the second plurality of documents; and means for providing a third plurality of documents to the user as search results to the text search query, each of the third plurality of documents being provided based upon the at least one candidate document and the number of relationships it has with the first and second plurality of documents, wherein the network of document relationships and the relevance ranking are used by the means for providing the third plurality of documents. - View Dependent Claims (6, 7, 8)
-
-
9. A computer readable medium containing program instructions for improving a text search comprising:
-
(a) preprocessing a plurality of documents, including performing relationship mining that provides a network of document relationships for the documents and relationship metadata, wherein a relevance ranking is determined during the preprocessing; (b) receiving an identification from a user of at least one candidate document from a first plurality of documents obtained via a text search query provided by the user, the text search query using the preprocessed documents, wherein the step (a) of preprocessing is performed before the first plurality of documents are obtained via the text search query; (c) locating a second plurality of documents that are related to the at least one candidate document by the relationship metadata, wherein the network of document relationships is used to locate the second plurality of documents; and (d) providing a third plurality of documents to the user as search results to the text search query, each of the third plurality of documents being provided based upon the at least one candidate document and the number of relationships it has with the first and second plurality of documents, wherein the network of document relationships and the relevance ranking are used to provide the third plurality of documents. - View Dependent Claims (10, 11, 12)
-
Specification