×

COMPARING SIMILARITY BETWEEN DOCUMENTS FOR FILTERING UNWANTED DOCUMENTS

  • US 20110055332A1
  • Filed: 08/28/2009
  • Published: 03/03/2011
  • Est. Priority Date: 08/28/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of determining similarity between a reference document and a candidate document, comprising:

  • segmenting the reference document into a plurality of reference data items;

    segmenting the candidate document into a plurality of document data items;

    computing a count representing a number of document data items matching the reference data items; and

    computing a similarity index representing similarity between the reference document and the candidate document based on the count.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×