DATA DE-DUPLICATION IN A DISPERSED STORAGE NETWORK UTILIZING DATA CHARACTERIZATION
First Claim
1. A method comprises:
- receiving, from a requesting device, a data storage request that includes data for storage;
determining whether substantially identical data is currently stored in a dispersed storage network (DSN) memory; and
when the substantially identical data is not stored in the DSN memory;
encoding at least a portion of the data using an error coding dispersal storage function to produce a set of encoded data slices;
sending the set of encoded data slices to the DSN memory for storage therein; and
generating a unique retrieval matrix for the requesting device, wherein the unique retrieval matrix identifies a sub-set of encoded data slices of the set of encoded data slices for subsequent retrieval of the at least a portion of the data.
5 Assignments
0 Petitions
Accused Products
Abstract
A method begins with a processing module receiving, from a requesting device, a data storage request that includes data for storage. The method continues with the processing module determining whether substantially identical data is currently stored in a dispersed storage network (DSN) memory. The method continues with the processing module encoding at least a portion of the data using an error coding dispersal storage function to produce a set of encoded data slices, sending the set of encoded data slices to the DSN memory for storage therein, and generating a unique retrieval matrix for the requesting device, wherein the unique retrieval matrix identifies a sub-set of encoded data slices of the set of encoded data slices for subsequent retrieval of the at least a portion of the data, when the substantially identical data is not stored in the DSN memory.
-
Citations
20 Claims
-
1. A method comprises:
-
receiving, from a requesting device, a data storage request that includes data for storage; determining whether substantially identical data is currently stored in a dispersed storage network (DSN) memory; and when the substantially identical data is not stored in the DSN memory; encoding at least a portion of the data using an error coding dispersal storage function to produce a set of encoded data slices; sending the set of encoded data slices to the DSN memory for storage therein; and generating a unique retrieval matrix for the requesting device, wherein the unique retrieval matrix identifies a sub-set of encoded data slices of the set of encoded data slices for subsequent retrieval of the at least a portion of the data. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprises:
-
receiving, from a requesting device, a read request for data stored as a plurality of sets of encoded data slices in a dispersed storage network (DSN) memory; obtaining a unique retrieval matrix based on an identity of the requesting device; identifying a sub-set of encoded data slices for a set of the plurality of sets of the encoded data slices based on the unique retrieval matrix; and when the sub-set of encoded data slices has been retrieved from the DSN memory; sending the sub-set of encoded data slices to the requesting device;
ordecoding the sub-set of the encoded data slices to produce a decoded data segment and sending the decoded data segment to the requesting device. - View Dependent Claims (9, 10)
-
-
11. A computer comprises:
-
an interface; and a processing module operable to; receive, from a requesting device via the interface, a data storage request that includes data for storage; determine whether substantially identical data is currently stored in a dispersed storage network (DSN) memory; and when the substantially identical data is not stored in the DSN memory; encode at least a portion of the data using an error coding dispersal storage function to produce a set of encoded data slices; send, via the interface, the set of encoded data slices to the DSN memory for storage therein; and generate a unique retrieval matrix for the requesting device, wherein the unique retrieval matrix identifies a sub-set of encoded data slices of the set of encoded data slices for subsequent retrieval of the at least a portion of the data. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer comprises:
-
an interface; and a processing module operable to; receive, from a requesting device via the interface, a read request for data stored as a plurality of sets of encoded data slices in a dispersed storage network (DSN) memory; obtain a unique retrieval matrix based on an identity of the requesting device; identify a sub-set of encoded data slices for a set of the plurality of sets of the encoded data slices based on the unique retrieval matrix; and when the sub-set of encoded data slices has been retrieved from the DSN memory; send, via the interface, the sub-set of encoded data slices to the requesting device;
ordecode the sub-set of the encoded data slices to produce a decoded data segment and send, via the interface, the decoded data segment to the requesting device. - View Dependent Claims (19, 20)
-
Specification