×

Representative document selection for a set of duplicate documents

  • US 8,868,559 B2
  • Filed: 08/30/2012
  • Issued: 10/21/2014
  • Est. Priority Date: 07/03/2003
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method, comprising:

  • at a computing device having one or more processors and memory;

    obtaining a plurality of documents, wherein a respective document in the plurality of document is associated with a query independent score;

    selecting a first document in the plurality of documents in accordance with a query independent score associated with the first document, whereinthe first document has a fingerprint that indicates that the first document has substantially identical content to every other document in the plurality of documents;

    indexing, in accordance with the query independent score, the first document thereby producing an indexed first document; and

    with respect to the plurality of documents, including only the indexed first document in a document index.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×