×

Grouping and differentiating files based on content

  • US 9,053,120 B2
  • Filed: 12/15/2009
  • Issued: 06/09/2015
  • Est. Priority Date: 07/16/2009
  • Status: Active Grant
First Claim
Patent Images

1. In a computing system environment, a method of differentiating files stored on one or more computing devices, each file having a plurality of symbols derived from an underlying data stream of all original bits of raw data of said each file, comprising:

  • encoding said each file as a plurality of symbols representing an underlying data stream of all original bits of binary data of the file;

    determining a number of occurrences of each said symbol in said each file; and

    computing a distance between said each file and every other file based on the determined number of occurrences.

View all claims
  • 16 Assignments
Timeline View
Assignment View
    ×
    ×