×

Assigning human-understandable labels to web pages

  • US 8,185,528 B2
  • Filed: 06/23/2008
  • Issued: 05/22/2012
  • Est. Priority Date: 06/23/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of labeling a web page from a host, comprising:

  • a. estimating a language model comprising an association of words from content of the web page;

    b. collecting, from a set of web documents linking to the web page, a set of inbound labels for the web page, the set of inbound labels comprises data from an anchor text of a link to the web page on at least one web document of the set of web documents and text of a search query that results in a click-through to the web page;

    c. computing a likelihood of generating each inbound label from the collected set of inbound labels for the web page given the estimated language model and assigning a score to each inbound label based on the computed likelihood; and

    d. assigning a label to the web page used on a search results page that returns the web page based on the assigned score to each inbound label from the collected set of inbound labels for the web page.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×