Assigning Human-Understandable Labels to Web Pages
First Claim
Patent Images
1. A computer-implemented method of labeling a web page from a host, comprising:
- a. estimating a language model for the web page;
b. collecting, from the set of web documents linking to the web page, a set of inbound labels for the web page;
c. computing the likelihood of generating each inbound label given the language model and assigning a score to each inbound label based on this likelihood; and
d. assigning a label to the web page based on the scores assigned to each of the set of inbound labels.
9 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems that label a web page collect a set of inbound labels for the web page, estimate a language model for the web page, compute the likelihood of generating each inbound label given the language model and assign a score to each inbound label based on this likelihood, and assign a label to the web page based on the score assigned to each of the set of inbound labels. Inbound labels are preferably collected from the set of web documents linking to the web page. Labels assigned are useful in providing labeled links to web pages from top hosts in search result pages.
29 Citations
25 Claims
-
1. A computer-implemented method of labeling a web page from a host, comprising:
-
a. estimating a language model for the web page; b. collecting, from the set of web documents linking to the web page, a set of inbound labels for the web page; c. computing the likelihood of generating each inbound label given the language model and assigning a score to each inbound label based on this likelihood; and d. assigning a label to the web page based on the scores assigned to each of the set of inbound labels. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An offline processing module for labeling a web page from a host, comprising:
-
a. a label collection element configured to collect, from the set of web documents linking to the web page, a set of inbound labels for the web page; b. a language model estimator configured to estimate a language model for the web page; c. a computation element configured to compute the likelihood of generating each inbound label given the language model and to assign a score to each inbound label based on this likelihood; and d. a label assignment element configured to assign a label to the web page from the set of inbound labels based on the scores assigned to each inbound label. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for providing labeled links to top web pages from a top host in a search results page, comprising:
-
a. an offline processing module configured to provide labels for top web pages from top hosts by collecting a set of inbound labels for the web pages, estimating a language model for each web page, and assigning label to each web page based on a computation involving the language model; b. an element configured to select a set of top web pages from a top host; and c. an online module that publishes the labels of each of the top web pages of the top host in the search result for the top host. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25)
-
Specification