×

Efficient Retrieval Algorithm by Query Term Discrimination

  • US 20080215574A1
  • Filed: 02/27/2008
  • Published: 09/04/2008
  • Est. Priority Date: 03/01/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method for use in information retrieval, the method comprising:

  • for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term;

    receiving a plurality of terms, optionally as a query;

    ranking the plurality of terms for importance based at least in part on the document sets for the plurality of terms wherein the ranking comprises using an inverse document frequency algorithm;

    selecting a number of ranked terms based on importance wherein each selected, ranked term comprises its corresponding document set wherein each document in a respective document set comprises a document identification number;

    forming a union set based on the document sets associated with the selected number of ranked terms; and

    for a document identification number in the union set, scanning a document set corresponding to an unselected term for a matching document identification number.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×