×

Content-based information retrieval

  • US 8,346,800 B2
  • Filed: 04/02/2009
  • Issued: 01/01/2013
  • Est. Priority Date: 04/02/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of similar item retrieval comprising:

  • receiving a query item;

    analyzing content of the query item, the analyzing comprising identifying tokens in that query item using a library of tokens, wherein each token comprises a symbol representing a cluster of features;

    dynamically forming a classifier, using a processor, at query time on the basis of the query item'"'"'s content and a training set of items, wherein the training set comprises a plurality of pairs of items and a plurality of background items such that for each pair, the items in that pair are specified as similar to one another, the forming the classifier comprising choosing a subset of the identified tokens such that, on the training set as many as possible of the similar pairs have the chosen subset of tokens while the number of background items containing the subset of tokens is below a specified bound; and

    using the classifier to select a plurality of items from a database of items.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×