Updating de-duplication tracking data for a dispersed storage network
First Claim
1. A method for execution by one or more processing modules of one or more computing devices of a dispersed storage network (DSN), the method comprises:
- determining whether a change has occurred to a data object of a set of data objects, wherein each data object of the set of data objects has substantially identical data to other data objects of the set of data objects and wherein only one plurality of sets of encoded data slices corresponding to one data object of the set of data objects is stored in memory of the DSN; and
when a change has occurred;
accessing deduplication tracking data for the set of data objects, wherein the deduplication tracking data includes a representation of the substantially identical data, addressing information for the only one plurality of sets of encoded data slices, and linking information that links the set of data objects to the addressing information, wherein the linking information includes the number of copies of data objects having substantially identical data;
when the change is deletion of an identified data object of the set of data objects;
determining whether the identified data object is the only data object in the set of data objects; and
when the identified data object is not the only data object in the set of data objects, updating the linking information to delete linking the identified data object to the addressing information.
5 Assignments
0 Petitions
Accused Products
Abstract
A method begins by a dispersed storage (DS) processing module of a dispersed storage network (DSN) determining whether a change has occurred to a data object of a set of data objects. When a change has occurred, the method continues with the DS processing module accessing de-duplication tracking data for the set of data objects. When the change is deletion of an identified data object of the set of data objects, the method continues with the DS processing module determining whether the identified data object is the only data object in the set of data objects. When the identified data object is not the only data object in the set of data objects, the method continues with the DS processing module updating the linking information to delete linking the identified data object to addressing information.
-
Citations
18 Claims
-
1. A method for execution by one or more processing modules of one or more computing devices of a dispersed storage network (DSN), the method comprises:
-
determining whether a change has occurred to a data object of a set of data objects, wherein each data object of the set of data objects has substantially identical data to other data objects of the set of data objects and wherein only one plurality of sets of encoded data slices corresponding to one data object of the set of data objects is stored in memory of the DSN; and when a change has occurred; accessing deduplication tracking data for the set of data objects, wherein the deduplication tracking data includes a representation of the substantially identical data, addressing information for the only one plurality of sets of encoded data slices, and linking information that links the set of data objects to the addressing information, wherein the linking information includes the number of copies of data objects having substantially identical data; when the change is deletion of an identified data object of the set of data objects; determining whether the identified data object is the only data object in the set of data objects; and when the identified data object is not the only data object in the set of data objects, updating the linking information to delete linking the identified data object to the addressing information. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A dispersed storage (DS) processing module of a dispersed storage network (DSN), the DS module comprises:
-
a first module, when operable within a computing device, causes the computing device to; determine whether a change has occurred to a data object of a set of data objects, wherein each data object of the set of data objects has substantially identical data to other data objects of the set of data objects and wherein only one plurality of sets of encoded data slices corresponding to one data object of the set of data objects is stored in memory of the DSN; and a second module, when operable within the computing device, causes the computing device to; when a change has occurred; access deduplication tracking data for the set of data objects, wherein the deduplication tracking data includes a representation of the substantially identical data, addressing information for the only one plurality of sets of encoded data slices, and linking information that links the set of data objects to the addressing information, wherein the linking information includes the number of copies of data objects having substantially identical data; when the change is deletion of an identified data object of the set of data objects; determine whether the identified data object is the only data object in the set of data objects; and when the identified data object is not the only data object in the set of data objects, update the linking information to delete linking the identified data object to the addressing information. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer readable storage medium comprises:
-
at least one memory section that stores operational instructions that, when executed by one or more processing modules of one or more computing devices of a dispersed storage network (DSN), causes the one or more computing devices to; determine whether a change has occurred to a data object of a set of data objects, wherein each data object of the set of data objects has substantially identical data to other data objects of the set of data objects and wherein only one plurality of sets of encoded data slices corresponding to one data object of the set of data objects is stored in memory of the DSN; and when a change has occurred; access deduplication tracking data for the set of data objects, wherein the deduplication tracking data includes a representation of the substantially identical data, addressing information for the only one plurality of sets of encoded data slices, and linking information that links the set of data objects to the addressing information, wherein the linking information includes the number of copies of data objects having substantially identical data; when the change is deletion of an identified data object of the set of data objects; determine whether the identified data object is the only data object in the set of data objects; and when the identified data object is not the only data object in the set of data objects, update the linking information to delete linking the identified data object to the addressing information. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification