Propagating relevance from labeled documents to unlabeled documents
First Claim
1. A system for propagating relevance of labeled documents to unlabeled documents, comprising:
- a document store that contains representations of documents, some of the documents being labeled with relevance to a query and others of the documents not being labeled with relevance to the query;
a graph component that creates a graph of the documents with the documents represented as nodes being connected by edges representing similarity between documents; and
a propagate relevance component that propagates relevance of the labeled documents to the unlabeled documents based on the similarity between documents as indicated by the similarity represented by the edges of the graph.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for propagating the relevance of labeled documents to a query to unlabeled documents is provided. The propagation system provides training data that includes queries, documents labeled with their relevance to the queries, and unlabeled documents. The propagation system then calculates the similarity between pairs of documents in the training data. The propagation system then propagates the relevance of the labeled documents to similar, but unlabeled, documents. The propagation system may iteratively propagate labels of the documents until the labels converge on a solution. The training data with the propagated relevances can then be used to train a ranking function.
79 Citations
20 Claims
-
1. A system for propagating relevance of labeled documents to unlabeled documents, comprising:
-
a document store that contains representations of documents, some of the documents being labeled with relevance to a query and others of the documents not being labeled with relevance to the query;
a graph component that creates a graph of the documents with the documents represented as nodes being connected by edges representing similarity between documents; and
a propagate relevance component that propagates relevance of the labeled documents to the unlabeled documents based on the similarity between documents as indicated by the similarity represented by the edges of the graph. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for propagating relevance of labeled pages to a query to unlabeled pages to the query, comprising:
-
a page store that contains representations of pages, some of the pages being labeled with relevance to a query and others of the pages not being labeled with relevance to the query;
a graph component that creates a graph of the pages with the pages represented as nodes connected by edges representing similarity between the pages, including;
a build graph component that builds a graph in which nodes representing similar pages are connected via edges; and
a generate weights component that generates weights for the edges based on similarity of the pages represented by the connected nodes; and
a propagate relevance component that propagates relevance of the labeled pages to the unlabeled pages based on the similarity between pages as indicated by the similarity represented by the edges of the graph and based on a manifold ranking algorithm. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computer-readable medium containing instructions for controlling a computer system to propagate relevance of documents to a query to other documents, by a method comprising:
-
creating a graph of the documents represented as nodes connected by edges having weights representing similarity between documents; and
propagating the relevance of the labeled documents to the unlabeled documents based on the weights of the edges between nodes using a manifold ranking based algorithm. - View Dependent Claims (19, 20)
-
Specification