Managing deduplication in a data storage system using a Bloomier filter data dictionary
First Claim
1. A method for facilitating data deduplication in a data storage system, the method comprising:
- maintaining an index comprising a plurality of filters, each filter electronically mapping each of a plurality of hashes to an electronic data library storing a plurality of hash-to-storage mappings;
wherein the plurality of filters are categorized into layers, with a plurality of filters in each layer.
8 Assignments
0 Petitions
Accused Products
Abstract
A method including maintaining a library having a plurality of storage tablets, each storage tablet storing a plurality of hash-to-storage mappings, each mapping a hash value to a storage location at which a block of data is stored, the block of data translating to the hash value pursuant to a hashing algorithm. The method also including upon receipt and/or determination of a new hash for incoming data pursuant to the hashing algorithm: a) querying a tablet cache for a hash-to-storage mapping having the new hash, the tablet cache comprising a subset of storage tablets copied from the library; and/or b) querying a secondary index for a hash-to-storage tablet mapping having the new hash, the secondary index including a plurality of filters, each filter mapping each of a plurality of key hashes to a storage tablet of the library storing that particular key hash in a hash-to-storage mapping.
-
Citations
18 Claims
-
1. A method for facilitating data deduplication in a data storage system, the method comprising:
-
maintaining an index comprising a plurality of filters, each filter electronically mapping each of a plurality of hashes to an electronic data library storing a plurality of hash-to-storage mappings; wherein the plurality of filters are categorized into layers, with a plurality of filters in each layer. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An information handling system comprising:
-
a processor; and a memory accessible to the processor and storing instructions that, when executed by the processor, cause the processor to maintain an index comprising a plurality of filters, each filter electronically mapping each of a plurality of hashes to an electronic data library storing a plurality of hash-to-storage mappings, wherein the plurality of filters are categorized into layers, with a plurality of filters in each layer. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for data deduplication in a data storage system, the method comprising:
-
maintaining an index comprising a plurality of filters, each filter electronically mapping each of a plurality of hashes to an electronic data library storing a plurality of hash-to-storage mappings, wherein the plurality of filters are categorized into layers, with a plurality of filters in each layer; and upon at least one of receipt and determination of a new hash for incoming data, querying the index for a hash-to-storage tablet mapping comprising the new hash. - View Dependent Claims (18)
-
Specification