×

Ranking search results using language types

  • US 7,792,833 B2
  • Filed: 04/26/2006
  • Issued: 09/07/2010
  • Est. Priority Date: 03/03/2005
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method for ranking search results, comprising:

  • determining a first property associated with each document in a collection of documents;

    wherein the first property is a language type associated with the document that identifies a language of the document;

    wherein the language of the document is determined by performing a statistical analysis of a character distribution in the document and comparing it to a trained language character distribution;

    storing an identified language for each of the documents when it is determined that the identified language is not a default language in a language storage that is a query independent rank (QIR) storage that is separate from a QIR storage that stores other values used at query time;

    determining a query language of a search query;

    estimating a ranking value corresponding to properties for each document, wherein the ranking value corresponds to a measure of the relevance of each document based on the search query;

    ranking each document that is responsive to the search query to obtain the search results, wherein each document is ranked based on the estimated ranking value and a comparison of the query language with the first property value;

    ranking the documents according to a scoring function (score) that is determined according to at least;

    a computed click distance (CD), a weight of a query-independent component (wcd), a weight of the click distance (bcd), a weight of a URL depth (bud), the URL depth (UD) and a click distance saturation constant (Kcd); and

    using the ranking of the documents to display the search results.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×