System and method for block-based subtree virtual machine backup
First Claim
1. A computer-implemented method, comprising:
- parsing, by a backup engine, a source virtual machine (VM) disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the backup engine operates in a backup server that is communicatively coupled to the source storage system over a network, wherein the backup engine remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems;
generating, by the backup engine, a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file;
in response to a request to back up a subdirectory of one or more files stored in the source VM disk file, identifying file records of the one or more files of the requested subdirectory based on the metadata stored in the file index; and
communicating with the VMM of the source storage system via the API to back up the identified file records associated with the subdirectory from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up remaining files in the source VM disk file.
9 Assignments
0 Petitions
Accused Products
Abstract
Techniques for virtual machine backup are described herein. According to one embodiment, a source virtual machine (VM) disk file associated with a VM is parsed to generate metadata representing files stored in a virtual storage device of the VM, where the VM is one of one or more VMs hosted by a VM monitor (VMM). In response to a request to back up a subdirectory of one or more files stored in the source VM disk file, file records of the one or more files of the requested subdirectory are identified based on the metadata. The identified file records associated with the subdirectory are backed up from the source VM disk file to a target VM disk file, without having to back up remaining files in the source VM disk file.
62 Citations
21 Claims
-
1. A computer-implemented method, comprising:
-
parsing, by a backup engine, a source virtual machine (VM) disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the backup engine operates in a backup server that is communicatively coupled to the source storage system over a network, wherein the backup engine remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems; generating, by the backup engine, a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file; in response to a request to back up a subdirectory of one or more files stored in the source VM disk file, identifying file records of the one or more files of the requested subdirectory based on the metadata stored in the file index; and communicating with the VMM of the source storage system via the API to back up the identified file records associated with the subdirectory from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up remaining files in the source VM disk file. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable storage medium having instructions stored therein, which when executed by a computer, cause the computer to perform operations, the operations comprising:
-
parsing, by a backup engine, a source virtual machine (VM) disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the backup engine operates in a backup server that is communicatively coupled to the source storage system over a network, wherein the backup engine remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems; generating, by the backup engine, a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file; in response to a request to back up a subdirectory of one or more files stored in the source VM disk file, identifying file records of the one or more files of the requested subdirectory based on the metadata stored in the file index; and communicating with the VMM of the source storage system via the API to back up the identified file records associated with the subdirectory from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up remaining files in the source VM disk file. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A storage system, comprising:
-
a virtual machine (VM) disk parser to parse a source virtual machine (VM) disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the storage system is communicatively coupled to the source storage system over a network, wherein the VM disk parser remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems, wherein the VM disk parser is to generate a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file; and a backup engine, in response to a request to back up a subdirectory of one or more files stored in the source VM disk file, to identify file records of the one or more files of the requested subdirectory based on the metadata stored in the file index and to communicate with the VMM of the source storage system via the API to back up the identified file records associated with the subdirectory from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up remaining files in the source VM disk file. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification