System and method for indexing web content using click-through features
First Claim
1. A method for indexing content items based on click-through features, the method comprising:
- generating a training set comprising one or more query-content item pairs, wherein a given query-content item pair has one or more click-through features associated therewith;
labeling one or more query-content item pairs in the training set by assigning click score thereto based on the one or more click-through features thereof;
determining a click score function based on the click scores of the labeled query-content item pairs and the click-through features thereof;
applying the click score function to a plurality of unlabeled query-content item pairs to determine click scores thereof based on the one or more click-through features of the unlabeled query-content item pairs.
9 Assignments
0 Petitions
Accused Products
Abstract
The system and method of the present invention allows for the determination of the relevance of a content item to a query through the use of a machine learned relevance function that incorporates click-through features of the content items. A method for selecting a relevance function to determine a relevance of a query-content item pair comprises generating training set having one or more query-URL pairs labeled for relevance based on their click-through features. The labeled query-URL pairs are used to determine the relevance function by minimizing a loss function that accounts for click-through features of the content item. The computed relevance function is then applied to the click-though features of unlabeled content items to assign relevance scores thereto. An inverted click-through index of query-score pairs is formed and combined with the content index to improve relevance of search results.
-
Citations
16 Claims
-
1. A method for indexing content items based on click-through features, the method comprising:
-
generating a training set comprising one or more query-content item pairs, wherein a given query-content item pair has one or more click-through features associated therewith;
labeling one or more query-content item pairs in the training set by assigning click score thereto based on the one or more click-through features thereof;
determining a click score function based on the click scores of the labeled query-content item pairs and the click-through features thereof;
applying the click score function to a plurality of unlabeled query-content item pairs to determine click scores thereof based on the one or more click-through features of the unlabeled query-content item pairs. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for indexing and searching content items based on its click-through features, the system comprising:
-
an index component operative to determine a click score function based on a training set of labeled query-content item pairs and the click-through features thereof, assign click scores to a plurality of unlabeled query-content item pairs through application of the click score function to the one or more click-through features and generate an inverted click-through index of the unlabeled content items and the associated query-score pairs;
a relevance engine operative to receive one or more query scores for one or more content items and generate one or more relevance scores therefore; and
a search engine operative to retrieve one or more content items in a result set in response to receipt of the query from the user and order the content items in the result set according to the relevance scores from the relevance engine. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
Specification