Virtual machine snapshot backup based on multilayer de-duplication
First Claim
Patent Images
1. A computer-implemented method for virtual machine snapshot backup, comprising:
- dividing a virtual machine snapshot of a virtual machine into one or more child data blocks;
applying a child data block de-duplication to exclude all child data blocks that are determined as unchanged since last snapshot;
dividing each of child data blocks marked as revised into a number of data segments;
applying a data segment de-duplication on each of the child data blocks to exclude from the number of data segments all data segments that are determined as unchanged since last snapshot;
applying a public data set de-duplication on data segments of the child data blocks that are determined as revised after the data segment de-duplication to exclude data segments that belong to a public data set; and
storing child data blocks and data segments that are not excluded as a virtual machine snapshot backup for the virtual machine snapshot, so as to exclude data causing duplicate backup in the virtual machine snapshot.
0 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure provides an example method and system for virtual machine backup based on multilayer de-duplication. A virtual machine snapshot is divided into multiple child data blocks. Each child data block is divided into multiple data segments. Multilayer de-duplication is applied to the virtual machine snapshot to exclude data causing duplicate backup in the virtual machine snapshot. The remaining virtual machine snapshot data after the processing of the multilayer de-duplication is stored.
38 Citations
20 Claims
-
1. A computer-implemented method for virtual machine snapshot backup, comprising:
-
dividing a virtual machine snapshot of a virtual machine into one or more child data blocks; applying a child data block de-duplication to exclude all child data blocks that are determined as unchanged since last snapshot; dividing each of child data blocks marked as revised into a number of data segments; applying a data segment de-duplication on each of the child data blocks to exclude from the number of data segments all data segments that are determined as unchanged since last snapshot; applying a public data set de-duplication on data segments of the child data blocks that are determined as revised after the data segment de-duplication to exclude data segments that belong to a public data set; and storing child data blocks and data segments that are not excluded as a virtual machine snapshot backup for the virtual machine snapshot, so as to exclude data causing duplicate backup in the virtual machine snapshot. - View Dependent Claims (2, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
3. A computer-implemented method for virtual machine snapshot backup, comprising:
-
dividing a virtual machine snapshot of a virtual machine into one or more child data blocks; applying a child data block de-duplication to exclude all child data blocks that are determined as unchanged since last snapshot; dividing each of child data blocks marked as revised into a number of data segments; applying a data segment de-duplication on each of the child data blocks to exclude from the number of data segments all data segments that are determined as unchanged since last snapshot; applying a public data set de-duplication on data segments of the child data blocks that are determined as revised after the data segment de-duplication to exclude data segments that belong to a public data set, wherein the public data set stores one or more data segments with repetition rates higher than a preset threshold in a backup storage file system; and storing child data blocks and data segments that are not excluded as a virtual machine snapshot backup for the virtual machine snapshot, so as to exclude data causing duplicate backup in the virtual machine snapshot.
-
-
13. A system for virtual machine snapshot backup, comprising:
-
a processor; and a memory coupled to the processor for storing computer programs to be executed by the processor, wherein the processor is configured to; divide a virtual machine snapshot of a virtual machine into one or more child data blocks; apply a child data block de-duplication to exclude all child data blocks that are determined as unchanged since last snapshot; divide each of child data blocks marked as revised into a number of data segments; apply a data segment de-duplication on each of the child data blocks to exclude from the number of data segments all data segments that are determined as unchanged since last snapshot; apply a public data set de-duplication on data segments of the child data blocks that are determined as revised after the data segment de-duplication to exclude data segments that belong to a public data set; and store child data blocks and data segments that are not excluded as a virtual machine snapshot backup for the virtual machine snapshot, so as to exclude data causing duplicate backup in the virtual machine snapshot. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification