Accessing large amounts of data in a dispersed storage network
First Claim
1. A method for storing large amounts of data, the method comprises:
- obtaining, by a processing module of a computing device, a plurality of data objects for storage in a dispersed storage network (DSN);
determining, by the processing module, that two data objects of the plurality of data objects have one or more common data object aspects wherein each of the two data objects includes a plurality of data segments;
disperse storage error encoding, by the processing module, the plurality of data segments of a first data object of the two data objects to produce a first plurality of sets of encoded data slices, wherein a data segment of the plurality of data segments is dispersed storage error encoded into a set of encoded data slices of the plurality of sets of encoded data slices and wherein a decode threshold number of encoded data slices of the set of encoded data slices in needed to recover the data segment;
generating, by the processing module, a first plurality of sets of DSN addresses for the first plurality of sets of encoded data slices, wherein DSN addresses of the first plurality of sets of DSN addresses includes a field referencing the one or more common data object aspects;
disperse storage error encoding, by the processing module, the plurality of data segments of a second data object of the two data objects to produce a second plurality of sets of encoded data slices;
generating, by the processing module, a second plurality of sets of DSN addresses for the second plurality of sets of encoded data slices, wherein DSN addresses of the second plurality of sets of DSN addresses includes the field referencing the one or more common data object aspects; and
outputting the first and second plurality of sets of encoded data slices for storage in the DSN based on the first and second plurality of sets of DSN addresses.
2 Assignments
0 Petitions
Accused Products
Abstract
A method begins by a dispersed storage (DS) processing module obtaining a plurality of data objects for storage in a dispersed storage network (DSN) and determining one or more common data object aspects of a data object of the plurality of data objects. The method continues with the DS processing module disperse storage error encoding at least a portion of the data object to produce a set of encoded data slices and generating a set of DSN addresses for the set of encoded data slices, wherein each of the set of DSN addresses includes a field referencing the one or more common data object aspects. The method continues with the DS processing module outputting the set of encoded data slices for storage in the DSN based on the set of DSN addresses.
-
Citations
18 Claims
-
1. A method for storing large amounts of data, the method comprises:
-
obtaining, by a processing module of a computing device, a plurality of data objects for storage in a dispersed storage network (DSN); determining, by the processing module, that two data objects of the plurality of data objects have one or more common data object aspects wherein each of the two data objects includes a plurality of data segments; disperse storage error encoding, by the processing module, the plurality of data segments of a first data object of the two data objects to produce a first plurality of sets of encoded data slices, wherein a data segment of the plurality of data segments is dispersed storage error encoded into a set of encoded data slices of the plurality of sets of encoded data slices and wherein a decode threshold number of encoded data slices of the set of encoded data slices in needed to recover the data segment; generating, by the processing module, a first plurality of sets of DSN addresses for the first plurality of sets of encoded data slices, wherein DSN addresses of the first plurality of sets of DSN addresses includes a field referencing the one or more common data object aspects; disperse storage error encoding, by the processing module, the plurality of data segments of a second data object of the two data objects to produce a second plurality of sets of encoded data slices; generating, by the processing module, a second plurality of sets of DSN addresses for the second plurality of sets of encoded data slices, wherein DSN addresses of the second plurality of sets of DSN addresses includes the field referencing the one or more common data object aspects; and outputting the first and second plurality of sets of encoded data slices for storage in the DSN based on the first and second plurality of sets of DSN addresses. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for retrieving data objects having a common aspect, the method comprises:
-
selecting, by a processing module of a computing device, one or more common data object aspects from a plurality of common data object aspects to produce selected common data object aspects; accessing, by the processing module, a common data object aspect database based on the selected common data object aspects to identify a set of records, wherein a record of the common data object aspect database includes a data object identifier of a data object, information regarding one or more common data object aspects of the data object, and a portions number indicating a number of portions constituting the data object; generating, by the processing module, a first plurality of sets of dispersed storage network (DSN) addresses based on a first data object identifier, the information regarding the one or more common data object aspects, and the portions number of a first record of the set of records; retrieving, by the processing module, at least a decode threshold number of encoded data slices of a first plurality of sets of encoded data slices from the DSN based on the first plurality of sets of DSN addresses; decoding, by the processing module, the at least a decode threshold number of encoded data slices of the first plurality of sets of encoded data slices in accordance with a dispersed storage error encoding function to reproduce a plurality of portions of a first data object; generating, by the processing module, a second plurality of sets of dispersed storage network (DSN) addresses based on a second data object identifier, the information regarding the one or more common data object aspects, and the portions number of a second record of the set of records; retrieving, by the processing module, at least a decode threshold number of encoded data slices of a second plurality of sets of encoded data slices from the DSN based on the second plurality of sets of DSN addresses; and decoding, by the processing module, the at least a decode threshold number of encoded data slices of the second plurality of sets of encoded data slices in accordance with a dispersed storage error encoding function to reproduce a plurality of portions of a second data object. - View Dependent Claims (8, 9)
-
-
10. A capturing unit comprises:
-
a first module, when operable within a computing device, causes the computing device to obtain a plurality of data objects for storage in a dispersed storage network (DSN); a second module, when operable within the computing device, causes the computing device to determine that two data objects of the plurality of data objects have one or more common data object aspects wherein each of the two data objects includes a plurality of data segments; a third module, when operable within the computing device, causes the computing device to; disperse storage error encode the plurality of data segments of a first data object of the two data objects to produce a first plurality of sets of encoded data slices, wherein a data segment of the plurality of data segments is dispersed storage error encoded into a set of encoded data slices of the first plurality of sets of encoded data slices and wherein a decode threshold number of encoded data slices of the set of encoded data slices in needed to recover the data segment; and disperse storage error encode the plurality of data segments of a second data object of the two data objects to produce a second plurality of sets of encoded data slices; a fourth module, when operable within the computing device, causes the computing device to; generate a first plurality of sets of DSN addresses for the first plurality of sets of encoded data slices, wherein DSN addresses of the first plurality of sets of DSN addresses includes a field referencing the one or more common data object aspects; and generate a second plurality of sets of DSN addresses for the second plurality of sets of encoded data slices, wherein DSN addresses of the second plurality of sets of DSN addresses includes the field referencing the one or more common data object aspects; and a fifth module, when operable within the computing device, causes the computing device to output the first and second plurality of sets of encoded data slices for storage in the DSN based on the first and second plurality of sets of DSN addresses. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A retrieving unit comprises:
-
a first module, when operable within a computing device, causes the computing device to select one or more common data object aspects from a plurality of common data object aspects to produce selected common data object aspects; a second module, when operable within the computing device, causes the computing device to access a common data object aspect database based on the selected common data object aspects to identify a set of records, wherein a record of the common data object aspect database includes a data object identifier of a data object, information regarding one or more common data object aspects of the data object, and a portions number indicating a number of portions constituting the data object; a third module, when operable within the computing device, causes the computing device to; generate a first plurality of sets of dispersed storage network (DSN) addresses based on a first data object identifier, the information regarding the one or more common data object aspects, and the portions number of a first record of the set of records; and generate a second plurality of sets of dispersed storage network (DSN) addresses based on a second data object identifier, the information regarding the one or more common data object aspects, and the portions number of a second record of the set of records; a fourth module, when operable within the computing device, causes the computing device to; retrieve at least a decode threshold number of encoded data slices of a first plurality of sets of encoded data slices from the DSN based on the first plurality of sets of DSN addresses; and retrieve at least a decode threshold number of encoded data slices of a second plurality of sets of encoded data slices from the DSN based on the second plurality of sets of DSN addresses; a fifth module, when operable within the computing device, causes the computing device to; decode the at least a decode threshold number of encoded data slices of the first plurality of sets of encoded data slices in accordance with a dispersed storage error encoding function to reproduce a plurality of portions of a first data object; and decode the at least a decode threshold number of encoded data slices of the second plurality of sets of encoded data slices in accordance with a dispersed storage error encoding function to reproduce a plurality of portions of a second data object. - View Dependent Claims (17, 18)
-
Specification