Hash file system and method for use in a commonality factoring system
First Claim
1. A method for managing data comprising:
- producing a probabilistically unique identifier for a digital sequence; and
comparing said probabilistically unique identifier to a list of other identifiers with their corresponding digital sequences.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method for a computer file system that is based and organized upon hashes and/or strings of digits of certain, different, or changing lengths and which is capable of eliminating or screening redundant copies of aggregate blocks of data (or parts of data blocks) from the system. The hash file system of the present invention utilizes hash values for computer files or file pieces which may be produced by a checksum generating program, engine or algorithm such as industry standard MD4, MD5, SHA or SHA-1 algorithms. Alternatively, the hash values may be generated by a checksum program, engine, algorithm or other means that produces an effectively unique hash value for a block of data of indeterminate size based upon a non-linear probablistic mathematical algorithm.
-
Citations
40 Claims
-
1. A method for managing data comprising:
-
producing a probabilistically unique identifier for a digital sequence; and
comparing said probabilistically unique identifier to a list of other identifiers with their corresponding digital sequences. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 24)
-
-
14. A method for managing data comprising:
-
dividing a digital sequence into a plurality of shorter digital sequences; and
producing probabilistically unique identifiers for each said plurality of shorter digital sequences; and
comparing said probabilistically unique identifiers to a list of other identifiers. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
25. A computer program product comprising:
-
a computer usable medium having computer readable code embodied therein for managing data, said computer program product comprising;
computer readable program code devices configured to cause a computer to effect producing a probabilistically unique identifier for a digital sequence; and
computer readable program code devices configured to cause a computer to effect comparing said probabilistically unique identifier to a list of other identifiers corresponding to other digital sequences. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A method for managing data comprising:
-
producing a probabilistically unique identifier for a digital sequence; and
comparing said probabilistically unique identifier to a list of other identifiers corresponding to other digital sequences. - View Dependent Claims (40)
-
Specification