Accessing a large data object in a dispersed storage network
First Claim
1. A method comprises:
- generating a data object identifier for data to be stored in a dispersed storage network (DSN);
generating data boundary information based on the data and one or more retrieval preferences, wherein the one or more retrieval preferences facilitates searching for the data, or a partition thereof, once the data is stored in the DSN and wherein the one or more retrieval preferences includes information regarding the data, or the partition thereof;
partitioning the data into a plurality of data partitions based on the data boundary information;
for a data partition of the plurality of data partitions;
segmenting the data partition into a plurality of data segments;
dispersed storage error encoding the plurality of data segments to produce a plurality of sets of encoded data slices;
generating a plurality of sets of DSN addresses for the plurality of sets of encoded data slices, wherein a DSN address of the plurality of sets of DSN addresses includes;
a representation of the data object identifier;
a representation of the one or more retrieval preferences;
a representation of a corresponding portion of the data boundary information; and
dispersed storage addressing information, wherein the data partition is retrievable from storage by;
obtaining the data object identifier via a file directory access; and
calculating the plurality of sets of DSN addresses based on the data object identifier and the one or more retrieval preferences.
5 Assignments
0 Petitions
Accused Products
Abstract
A method begins by a dispersed storage (DS) processing module generating a data object identifier for data to be stored in a dispersed storage network (DSN) and partitioning the data into a plurality of data partitions based on a set of retrieval preferences and data boundary information. For a data partition, the method continues with the DS processing module dispersed storage error encoding the data partition to produce a plurality of sets of encoded data slices and generating a plurality of sets of DSN addresses for the plurality of sets of encoded data slices, wherein a DSN address of the plurality of sets of DSN addresses includes a representation of the data object identifier, a representation of one or more retrieval preferences of the set of retrieval preferences, a representation of a corresponding portion of the data boundary information, and dispersed storage addressing information.
-
Citations
28 Claims
-
1. A method comprises:
-
generating a data object identifier for data to be stored in a dispersed storage network (DSN); generating data boundary information based on the data and one or more retrieval preferences, wherein the one or more retrieval preferences facilitates searching for the data, or a partition thereof, once the data is stored in the DSN and wherein the one or more retrieval preferences includes information regarding the data, or the partition thereof; partitioning the data into a plurality of data partitions based on the data boundary information; for a data partition of the plurality of data partitions; segmenting the data partition into a plurality of data segments; dispersed storage error encoding the plurality of data segments to produce a plurality of sets of encoded data slices; generating a plurality of sets of DSN addresses for the plurality of sets of encoded data slices, wherein a DSN address of the plurality of sets of DSN addresses includes; a representation of the data object identifier; a representation of the one or more retrieval preferences; a representation of a corresponding portion of the data boundary information; and dispersed storage addressing information, wherein the data partition is retrievable from storage by; obtaining the data object identifier via a file directory access; and calculating the plurality of sets of DSN addresses based on the data object identifier and the one or more retrieval preferences. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprises:
-
receiving a data object identifier via a file directory access, wherein the data object identifier identifies data stored in a dispersed storage network (DSN); receiving a retrieval preference to retrieve a data portion of the data, wherein the retrieval preference facilitates searching for the data portion and wherein the retrieval preferences include information regarding the data portion; determining corresponding data boundary information based on the retrieval preference; calculating a plurality of sets of DSN addresses based on the data object identifier, the retrieval preference, and the corresponding data boundary information; retrieving a plurality of sets of encoded data slices from the DSN based on the plurality of sets of DSN addresses; decoding the plurality of sets of encoded data slices to produce a plurality of sets of data segments, and reconstructing the data portion from the plurality of data segments. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A dispersed storage (DS) module comprises:
-
a first module, when operable within a computing device, causes the computing device to; generate a data object identifier for data to be stored in a dispersed storage network (DSN); a second module, when operable within the computing device, causes the computing device to; generate data boundary information based on the data and one or more retrieval preferences, wherein the one or more retrieval preferences facilitates searching for the data, or a partition thereof, once the data is stored in the DSN and wherein the one or more retrieval preferences includes information regarding the data, or the partition thereof; and partition the data into a plurality of data partitions based on the data boundary information; a third module, when operable within the computing device, causes the computing device to; for a data partition of the plurality of data partitions; segment the data partition into a plurality of data segments; dispersed storage error encode the plurality of data segments to produce a plurality of sets of encoded data slices; and a fourth module, when operable within the computing device, causes the computing device to; for a data partition of the plurality of data partitions, generate a plurality of sets of DSN addresses for the plurality of sets of encoded data slices, wherein a DSN address of the plurality of sets of DSN addresses includes; a representation of the data object identifier; a representation of the one or more retrieval preferences; a representation of a corresponding portion of the data boundary information; and dispersed storage addressing information, wherein the data partition is retrievable from storage by; obtaining the data object identifier via a file directory access; and calculating the plurality of sets of DSN addresses based on the data object identifier and the one or more retrieval preferences. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A dispersed storage (DS) module comprises:
-
a first module, when operable within a computing device, causes the computing device to; receive a data object identifier via a file directory access, wherein the data object identifier identifies data stored in a dispersed storage network (DSN); and receive a retrieval preference to retrieve a data portion of the data, wherein the retrieval preference facilitates searching for the data portion and wherein the retrieval preferences include information regarding the data portion; a second module, when operable within the computing device, causes the computing device to; determine corresponding data boundary information based on the retrieval preference; a third module, when operable within the computing device, causes the computing device to; calculate a plurality of sets of DSN addresses based on the data object identifier, the retrieval preference, and the corresponding data boundary information; and a fourth module, when operable within the computing device, causes the computing device to; retrieve a plurality of sets of encoded data slices from the DSN based on the plurality of sets of DSN addresses; decode the plurality of sets of encoded data slices to produce a plurality of sets of data segments, and reconstruct the data portion from the plurality of data segments. - View Dependent Claims (23, 24, 25, 26, 27, 28)
-
Specification