Grouping and differentiating files based on underlying grouped and differentiated files
First Claim
1. In a computing system environment, a method of differentiating files stored on one or more computing devices, comprising:
- receiving a plurality of compressed original files, said compressed original files being encoded as a plurality of first symbols derived from original data of said original files;
determining a frequency count for each first symbol of the plurality of symbols for each of the plurality of compressed original files;
determining an original distance relationship between each of said plurality of compressed original files using said frequency counts, said original distance relationship being a distance in an informational mapping space defined by the total number of said first symbols in said plurality of compressed original files;
converting said original distance relationships into a plurality of new files;
encoding each of said new files as a plurality of second symbols derived from said plurality of new files thereby generating a plurality of compressed new files;
determining a frequency count for each said second symbol for each of the plurality of compressed new files;
determining a second distance relationship between each of said compressed new files using said frequency counts for each said second symbol; and
differentiating said original files based on the determined second distance relationships.
16 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus teach a digital spectrum of a file. The digital spectrum is used to map a file'"'"'s position. This position relative to another file'"'"'s position reveals closest neighbors. When multiple such neighbors are arranged, first “patterns” of data are created that further define digital spectrums of new files. It is within this sorted new data that emergent relationships or second “patterns” are examined, according to the techniques for its underlying files, or “patterns of patterns.” Representatively, original files are stored on computing devices. If encoded, they have pluralities of symbols representing an underlying data stream of original bits of data. The original files are examined for relationships between each of the files. The original relationships are converted to new files. The new files are representatively encoded and examined for other relationships. The new files are then grouped or differentiated from one another based these new relationships yielding insight into how the original files can be grouped or differentiated.
91 Citations
15 Claims
-
1. In a computing system environment, a method of differentiating files stored on one or more computing devices, comprising:
-
receiving a plurality of compressed original files, said compressed original files being encoded as a plurality of first symbols derived from original data of said original files; determining a frequency count for each first symbol of the plurality of symbols for each of the plurality of compressed original files; determining an original distance relationship between each of said plurality of compressed original files using said frequency counts, said original distance relationship being a distance in an informational mapping space defined by the total number of said first symbols in said plurality of compressed original files; converting said original distance relationships into a plurality of new files; encoding each of said new files as a plurality of second symbols derived from said plurality of new files thereby generating a plurality of compressed new files; determining a frequency count for each said second symbol for each of the plurality of compressed new files; determining a second distance relationship between each of said compressed new files using said frequency counts for each said second symbol; and differentiating said original files based on the determined second distance relationships. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. In a computing system environment, a method of differentiating original files stored on one or more computing devices, each original file compressed as a plurality of first symbols derived from an underlying data stream of original bits of digital data of said original files, comprising:
-
determining a frequency count for each said first symbol of said each compressed original file; determining an original distance relationship between each of said plurality of compressed original files from said frequency counts, said original distance relationship being a distance in an informational mapping space defined by the total number of said first symbols in said plurality of compressed original files; converting said original distance relationships into a plurality of new files; compressing said new files as a plurality of second symbols; determining a frequency count for each said second symbol for each of the plurality of compressed new files; determining a second distance relationship between each of said compressed new files using said second symbol frequency counts; and differentiating said original files by grouping together ones of the new files according to said second distance relationship. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. In a computing system environment, a method of differentiating files stored on one or more computing devices, comprising:
-
encoding a plurality of original files as a plurality of first symbols each representing an optimal compression of all bits of original data of said original files to provide a plurality of compressed original files; determining a frequency count for each first symbol of the plurality of symbols for each of the plurality of compressed original files; determining an original distance relationship between each of the original files using said frequency counts, said original distance relationship being a distance in an informational mapping space defined by the total number of said first symbols in each said compressed original file; converting said original distance relationship into pluralities of new files of binary data encoding each said new file as a plurality of second symbols to provide a plurality of compressed new files; determining a frequency count for each said second symbol; determining a second distance relationship between each said compressed new file using said second symbol frequency counts; and differentiating said original files based on the second distance relationship. - View Dependent Claims (14, 15)
-
Specification