SYSTEM AND METHOD FOR CACHING HASHES FOR CO-LOCATED DATA IN A DEDUPLICATION DATA STORE
First Claim
1. A computerized method for caching hashes for deduplicated data in a deduplication data store, in which data is stored using a persist header comprising a set of hashes, wherein each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes, the computerized method comprising:
- receiving, by a computing device, a request to read data from the deduplication data store;
identifying, by the computing device, in a first hash structure that is not stored in memory of the computing device, a persist header stored in a deduplication data store, wherein;
the persist header comprises a set of hashes that includes a hash that is indicative of the data the computing device requested to read; and
wherein each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes; and
caching, by the computing device, the set of hashes in a second hash structure stored in the memory of the computing device, whereby if the computing device requests to read additional data, the computing device can identify the additional data using the second hash structure if the additional data is represented by the persist header.
8 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are provided for caching hashes for deduplicated data. A request to read data from the deduplication data store is received. A persist header stored in a deduplication data store is identified in a first hash structure that is not stored in memory of the computing device. The persist header comprises a set of hashes that includes a hash that is indicative of the data the computing device requested to read. Each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes. The set of hashes is cached in a second hash structure stored in the memory, whereby the computing device can identify the additional data using the second hash structure if the additional data is represented by the persist header.
-
Citations
11 Claims
-
1. A computerized method for caching hashes for deduplicated data in a deduplication data store, in which data is stored using a persist header comprising a set of hashes, wherein each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes, the computerized method comprising:
-
receiving, by a computing device, a request to read data from the deduplication data store; identifying, by the computing device, in a first hash structure that is not stored in memory of the computing device, a persist header stored in a deduplication data store, wherein; the persist header comprises a set of hashes that includes a hash that is indicative of the data the computing device requested to read; and wherein each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes; and caching, by the computing device, the set of hashes in a second hash structure stored in the memory of the computing device, whereby if the computing device requests to read additional data, the computing device can identify the additional data using the second hash structure if the additional data is represented by the persist header. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing device for caching hashes for deduplicated data in a deduplication data store, in which data is stored using a persist header comprising a set of hashes, wherein each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes, the computing device comprising:
-
a deduplication data store; and a processor in communication with the deduplication data store, and configured to run a module stored in memory that is configured to cause the processor to; receive a request to read data from the deduplication data store; identify in a first hash structure that is not stored in memory of the computing device, a persist header stored in a deduplication data store, wherein; the persist header comprises a set of hashes that includes a hash that is indicative of the data the computing device requested to read; and wherein each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes; and cache the set of hashes in a second hash structure stored in the memory of the computing device, whereby if the computing device requests to read additional data, the computing device can identify the additional data using the second hash structure if the additional data is represented by the persist header.
-
-
11. A non-transitory computer readable medium having executable instructions operable to cause an apparatus to:
-
receive a request to read data from a deduplication data store; identify in a first hash structure that is not stored in memory of the computing device, a persist header stored in a deduplication data store, wherein; the persist header comprises a set of hashes that includes a hash that is indicative of the data the computing device requested to read; and wherein each hash in the set of hashes represents data stored in the deduplication data store after the persist header that is co-located with other data represented by the remaining hashes in the set of hashes; and cache the set of hashes in a second hash structure stored in the memory of the computing device, whereby if the computing device requests to read additional data, the computing device can identify the additional data using the second hash structure if the additional data is represented by the persist header.
-
Specification