Selecting a store for deduplicated data
First Claim
Patent Images
1. A method comprising:
- communicating, by a processor, a subset of a plurality of hashes associated with chunks of an object to each of a plurality of stores, wherein each of the stores is to match the subset of hashes to a list of hashes of data stored in the store and generate a count value of matched hashes;
in response to the communication, receiving, by the processor, the count value of matched hashes from each of the stores;
selecting, by the processor, one of the stores to store deduplicated data associated with the object based on a comparison of the count values of matched hashes received from the stores to a threshold; and
performing, by the processor, a deduplication process with the selected store using remaining hashes of the plurality of hashes to identify the deduplicated data associated with the object; and
communicating, by the processor, the deduplicated data associated with the object to the selected store for storage.
2 Assignments
0 Petitions
Accused Products
Abstract
A technique includes communicating a plurality of hashes associated with chunks of an object to at least some stores of a plurality of stores on which the object is distributed; and in response to the communication, receiving responses indicating a distribution of the associated chunks. The technique includes selecting one of the stores based at least in part on the responses and communicating deduplicated data associated with the object to the selected store.
144 Citations
14 Claims
-
1. A method comprising:
-
communicating, by a processor, a subset of a plurality of hashes associated with chunks of an object to each of a plurality of stores, wherein each of the stores is to match the subset of hashes to a list of hashes of data stored in the store and generate a count value of matched hashes; in response to the communication, receiving, by the processor, the count value of matched hashes from each of the stores; selecting, by the processor, one of the stores to store deduplicated data associated with the object based on a comparison of the count values of matched hashes received from the stores to a threshold; and performing, by the processor, a deduplication process with the selected store using remaining hashes of the plurality of hashes to identify the deduplicated data associated with the object; and communicating, by the processor, the deduplicated data associated with the object to the selected store for storage. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
a plurality of team members to store a distributed teamed object; and a client server comprising at least one central processing unit and a memory comprising instructions that, when executed by the at least one central processing unit, are to cause the client server to; communicate a subset of hashes associated with chunks of the object to each of the plurality of team members, wherein each of the plurality of team members is to match the subset of hashes to a list of hashes of data stored in the team member and generate a count value of matched hashes; in response to the communication, receive the count value of matched hashes from each team member of the plurality of team members; select a team member of the plurality of team members to store deduplicated data associated with the object based on a comparison of the count values of matched hashes to a threshold; perform a deduplication process with the selected team member using remaining hashes of the plurality of hashes to identify the deduplicated data associated with the object; and communicate the deduplicated data associated with the object to the selected team member for storage. - View Dependent Claims (7, 8, 9)
-
-
10. A non-transitory computer readable storage medium storing instructions that when executed by a computer cause the computer to:
-
communicate a subset of hashes associated with chunks of an object to each of a plurality of stores, wherein each of the plurality of stores is to match the subset of hashes to a list of hashes of data stored in the store and generate a count value of matched hashes; in response to the communication, receive the count value of matched hashes from each of the plurality of stores; select one of the stores to store deduplicated data associated with the object based at least in part on a comparison of the count values of matched hashes to a threshold; perform a deduplication process with the selected store using remaining hashes of the plurality of hashes to identify the deduplicated data associated with the object; and communicate the deduplicated data associated with the object to the selected store for storage. - View Dependent Claims (11, 12, 13, 14)
-
Specification