×

Automatic generation of embedded signatures for duplicate detection on a public network

  • US 20090299994A1
  • Filed: 05/30/2008
  • Published: 12/03/2009
  • Est. Priority Date: 05/30/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • identifying at least one set of words in a first electronic document, said set of words having a frequency of occurrence in a first collection of electronic documents that is below a predetermined threshold; and

    transmitting a query to search a second collection of electronic documents for any electronic documents that contain the said set of words.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×