×

REPRESENTATIVE DOCUMENT SELECTION FOR A SET OF DUPLICATE DOCUMENTS

  • US 20120323896A1
  • Filed: 08/30/2012
  • Published: 12/20/2012
  • Est. Priority Date: 07/03/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • at a computing device having one or more processors and memory;

    selecting a first document in a plurality of documents on the basis that the first document is associated with a query independent score, wherein the first document has a fingerprint that indicates that the first document has substantially identical content to every other document in the plurality of documents;

    indexing, in accordance with the query independent score, the first document thereby producing an indexed first document; and

    with respect to the plurality of documents, including only the indexed first document in a document index.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×