LEARNING-BASED IMAGE PAGE INDEX SELECTION
First Claim
Patent Images
1. A system, comprising:
- a data sampling and labeling component that samples training webpages having images to create positive webpage samples and negative webpage samples;
a feature generation component that extracts features from the training webpages;
a model creation component that creates a selection model based on the sampled training webpages and the features;
a validation component that evaluates and validates the selection model using benchmark data to obtain an optimal selection model;
a prediction component that predicts an image webpage static rank based on the optimal selection model; and
a microprocessor that executes computer-executable instructions associated with at least one of the data sampling and labeling component, feature generation component, model creation component, validation component, or prediction component.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture that performs image page index selection. A learning-based framework learns a statistical model based on the hyperlink (URL-uniform resource locator) previous click information obtained from the image search users. The learned model can combine the features of a newly discovered URL to predict the possibility of the newly-discovered URL being clicked in the future image search. In addition to existing web index selection features, image clicks are added as features, and the image clicks are aggregated over different URL segments, as well as the site modeling pattern trees to reduce the sparse problem of the image click information.
27 Citations
20 Claims
-
1. A system, comprising:
-
a data sampling and labeling component that samples training webpages having images to create positive webpage samples and negative webpage samples; a feature generation component that extracts features from the training webpages; a model creation component that creates a selection model based on the sampled training webpages and the features; a validation component that evaluates and validates the selection model using benchmark data to obtain an optimal selection model; a prediction component that predicts an image webpage static rank based on the optimal selection model; and a microprocessor that executes computer-executable instructions associated with at least one of the data sampling and labeling component, feature generation component, model creation component, validation component, or prediction component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method, performed by a computer system executing machine-readable instructions, the method comprising acts of:
-
sampling and labeling webpages having images as training webpages to create positive webpages and negative webpages; extracting features from the training webpages; creating a selection model based on the training webpages and the features; evaluating and validating the selection model using benchmark data to obtain an optimal selection model; and predicting an image webpage rank based on the optimal selection model. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-readable medium comprising instructions that when executed by a processor cause the processor to perform a method comprising acts of:
-
sampling and labeling training webpages having images to create positive webpages and negative webpages derived from user clicks as labeling criteria; extracting historical features from the training webpages, the historical features of each webpage accumulated and merged to a specific time; creating a statistical selection model based on the sampled training webpages and the features; evaluating and validating the selection model using benchmark data to obtain an optimal selection model; applying the optimal selection model to newly-discovered URLs to compute corresponding selection scores; and selecting a set of top-ranked URLs as candidates for indexing based on the scores. - View Dependent Claims (18, 19, 20)
-
Specification