Consistent deduplicated snapshot generation for a distributed database using optimistic deduplication
First Claim
1. A method of generating a consistent deduplicated snapshot of a distributed database using optimistic deduplication, the method comprising:
- for each node of a plurality of nodes in the distributed database,deduplicating data items that are identified as being stored on the node and owned by the node, andgenerating a summary for the node, the summary describing a file in which the data items that are identified as being stored on the node are located;
identifying from the summaries for each of the nodes whether one or more mistakes occurred during deduplication;
upon identifying the one or more mistakes, determining one or more corrections for the one or more mistakes; and
generating a consistent deduplicated snapshot for the distributed database comprising the deduplicated data items from each node and the one or more corrections.
3 Assignments
0 Petitions
Accused Products
Abstract
Embodiments disclosed herein provide systems, methods, and computer readable media for generating a consistent deduplicated snapshot of a distributed database using optimistic deduplication. In a particular embodiment, a method provides, for each node of a plurality of nodes in the distributed database, deduplicating data items stored on the node that are owned by the node and generating a summary that describes a file in which the data items are located. The method further provides identifying from the summaries for each of the nodes whether mistakes occurred during deduplication and, upon identifying one or more mistakes, determining one or more corrections for the one or more mistakes. Also, the method provides generating a consistent deduplicated snapshot for the distributed database comprising the deduplicated data items from each node and the one or more corrections.
-
Citations
18 Claims
-
1. A method of generating a consistent deduplicated snapshot of a distributed database using optimistic deduplication, the method comprising:
-
for each node of a plurality of nodes in the distributed database, deduplicating data items that are identified as being stored on the node and owned by the node, and generating a summary for the node, the summary describing a file in which the data items that are identified as being stored on the node are located; identifying from the summaries for each of the nodes whether one or more mistakes occurred during deduplication; upon identifying the one or more mistakes, determining one or more corrections for the one or more mistakes; and generating a consistent deduplicated snapshot for the distributed database comprising the deduplicated data items from each node and the one or more corrections. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for generating a consistent deduplicated snapshot of a distributed database using optimistic deduplication, the system comprising:
-
one or more computer readable storage media; a processing system operatively coupled with the one or more computer readable storage media; and program instructions stored on the one or more computer readable storage media that, when read and executed by the processing system, direct the processing system to perform operations comprising; for each node of a plurality of nodes in the distributed database, deduplicating data items that are identified as being stored on the node and owned by the node and generating a summary for the node, the summary describing a file in which the data items that are identified as being stored on the node are located; identifying from the summaries for each of the nodes whether one or more mistakes occurred during deduplication; upon identifying the one or more mistakes, determining one or more corrections for the one or more mistakes; and generating a consistent deduplicated snapshot for the distributed database comprising the deduplicated data items from each node and the one or more corrections. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification