Virtual machine snapshot backup based on multilayer De-duplication
First Claim
Patent Images
1. A computer-implemented method for virtual machine snapshot backup, comprising:
- dividing a virtual machine snapshot of a virtual machine into one or more child data blocks;
dividing a respective child data block into one or more data segments;
applying multilayer de-duplication to the virtual machine snapshot;
periodically scanning the backup storage file system; and
based on one or more data repetition characteristics of data storage, extracting data whose repetition rate is higher than a preset threshold into a public data set.
1 Assignment
0 Petitions
Accused Products
Abstract
The present disclosure provides an example method and system for virtual machine backup based on multilayer de-duplication. A virtual machine snapshot is divided into multiple child data blocks. Each child data block is divided into multiple data segments. Multilayer de-duplication is applied to the virtual machine snapshot to exclude data causing duplicate backup in the virtual machine snapshot. The remaining virtual machine snapshot data after the processing of the multilayer de-duplication is stored.
36 Citations
18 Claims
-
1. A computer-implemented method for virtual machine snapshot backup, comprising:
-
dividing a virtual machine snapshot of a virtual machine into one or more child data blocks; dividing a respective child data block into one or more data segments; applying multilayer de-duplication to the virtual machine snapshot; periodically scanning the backup storage file system; and based on one or more data repetition characteristics of data storage, extracting data whose repetition rate is higher than a preset threshold into a public data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented method for virtual machine snapshot backup, comprising:
-
dividing a virtual machine snapshot of a virtual machine into one or more child data blocks; dividing a respective child data block into one or more data segments; applying multilayer de-duplication to the virtual machine snapshot; and performing a rollover of the virtual machine snapshot, the performing including; reading an index of child data blocks from a backup storage file system according to an index of a to-be-rollover virtual machine snapshot; reading data segments according to the index of child data blocks;
forming the read data segments into one or more child data blocks; andforming the formed one or more child data blocks into the to-be-rollover virtual machine snapshot. - View Dependent Claims (13)
-
-
14. A computer-implemented method for virtual machine snapshot backup, comprising:
-
dividing a virtual machine snapshot of a virtual machine into one or more child data blocks; dividing a respective child data block into one or more data segments; applying multilayer de-duplication to the virtual machine snapshot; and performing deleting the virtual machine snapshot, the deleting including; writing deletion information of an index of a to-be-deleted virtual machine snapshot of the virtual machine into a log; when a volume of the deletion information in the log is larger than a preset threshold, scanning backup data of the virtual machine to find a child data block or a data segment that has not been referenced for a threshold period of time; and deleting the child data block or the data segment.
-
-
15. A system for virtual machine snapshot backup, comprising:
-
a processor; and a memory coupled to the processor for storing computer programs to be executed by the processor, wherein the processor is configured to; divide a virtual machine snapshot of a virtual machine into one or more child data blocks and to divide a respective child data block into one or more data segments; apply multilayer de-duplication to the virtual machine snapshot; store remaining data in the virtual machine snapshot after applying the multilayer de-duplication; periodically scan the backup storage file system; and based on one or more data repetition characteristics of data storage, extract data whose repetition rate is higher than a preset threshold into the public data set. - View Dependent Claims (16, 17, 18)
-
Specification