×

Learning a document ranking using a loss function with a rank pair or a query parameter

  • US 7,593,934 B2
  • Filed: 07/28/2006
  • Issued: 09/22/2009
  • Est. Priority Date: 07/28/2006
  • Status: Active Grant
First Claim
Patent Images

1. A computer system for generating a ranking function to rank relevance of a document to a query, comprising:

  • a processor; and

    a memory for storinga collection of queries, resultant documents, and relevance of each resultant document to its query, the collection being generated by submitting the queries to a search engine with search results for each query being the resultant documents for that query and inputting the relevance of each resultant document to its query; and

    an application program for execution by the processor comprising;

    a component that trains a ranking function using the resultant documents and their relevances by weighting incorrect rankings of relevant resultant documents more heavily than incorrect rankings of not relevant resultant documents so that the ranking function more correctly ranks relevant resultant documents than it does not relevant resultant documents, wherein a different weighting is used for each rank pair where a rank pair represents a combination of two different relevance classifications, the ranking function being trained byfor each resultant document, generating a feature vector of features for the resultant document,for each query, generating ordered pairs of resultant documents with different relevances;

    for each feature, initializing a current weighting parameter for the feature, the current weighting parameters forming the ranking function; and

    modifying the current weighting parameters of the ranking function by iteratively applying the ranking function with current weighting parameters to the feature vectors of each pair of resultant documents and when the ranking for the resultant documents of a pair is in error, adjusting the weighting parameters by comparing an evaluation measure of incorrect rankings of documents to an evaluation measure of correct rankings of the documents, wherein the weighting is set to an average of differences between the evaluation measure of the correct ranking and the evaluation measure of the incorrect rankings, such that an error in ranking is weighted more heavily when a resultant document with a higher relevance is ranked incorrectly than when a resultant document with a lower relevance is ranked incorrectly; and

    a component that ranks relevance of a document to a query that is not part of the collectionwherein the component that trains the ranking function uses a gradient descent algorithm.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×