×

Efficient retrieval algorithm by query term discrimination

  • US 7,925,644 B2
  • Filed: 02/27/2008
  • Issued: 04/12/2011
  • Est. Priority Date: 03/01/2007
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for use in information retrieval, the method comprising:

  • for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term;

    receiving a query comprising a plurality of query terms;

    ranking the plurality of query terms received in the query based at least in part on the corresponding document sets for each of the plurality of query terms, wherein the ranking comprises using an inverse document frequency algorithm;

    selecting a number of ranked query terms from the plurality of query terms, wherein each selected ranked query term comprises its corresponding document set and each document in a respective document set comprises a document identification number;

    forming a union set based on the document sets associated with the selected number of ranked query terms; and

    for a document identification number in the union set, scanning a document set corresponding to an unselected query term for a matching document identification number, wherein the unselected query term is included in the query comprising the plurality of query terms.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×