System and method for file-based virtual machine incremental backup
First Claim
1. A computer-implemented method, comprising:
- parsing, by a backup engine, a source virtual machine (VM) disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the backup engine operates in a backup server that is communicatively coupled to the source storage system over a network, wherein the backup engine remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems;
generating, by the backup engine, a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file;
in response to a request for an incremental backup of files stored in the source VM disk file, examining based on a file attribute of each of the files from the file index to identify one or more files that have been modified within a predetermined past period of time;
determining from the file index one or more file records associated with the identified files that have been modified within the predetermined past period of time; and
transmitting, by the backup engine, to the VMM of the source storage system via the API, information representing the identified file records to back up the identified file records from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up unmodified files in the source VM disk file.
9 Assignments
0 Petitions
Accused Products
Abstract
A source virtual machine (VM) disk file associated with a VM is parsed to generate metadata representing files stored in a virtual storage device of the VM, where the VM is one of VMs hosted by a VM monitor (VMM). In response to a request for an incremental backup of files stored in the source VM disk file, a file attribute of each of the files is examined based on the metadata to identify one or more files that have been modified within a predetermined past period of time. File records associated with the identified files are determined based on the metadata that have been modified within the predetermined past period of time. The identified file records are then backed up from the source VM disk file to a target VM disk file, without having to back up unmodified files in the source VM disk file.
98 Citations
21 Claims
-
1. A computer-implemented method, comprising:
-
parsing, by a backup engine, a source virtual machine (VM) disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the backup engine operates in a backup server that is communicatively coupled to the source storage system over a network, wherein the backup engine remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems; generating, by the backup engine, a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file; in response to a request for an incremental backup of files stored in the source VM disk file, examining based on a file attribute of each of the files from the file index to identify one or more files that have been modified within a predetermined past period of time; determining from the file index one or more file records associated with the identified files that have been modified within the predetermined past period of time; and transmitting, by the backup engine, to the VMM of the source storage system via the API, information representing the identified file records to back up the identified file records from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up unmodified files in the source VM disk file. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable storage medium having instructions stored therein, which when executed by a computer, cause the computer to perform operations, the operations comprising:
-
parsing, by a backup engine, a source virtual machine (VM) disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the backup engine operates in a backup server that is communicatively coupled to the source storage system over a network, wherein the backup engine remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems; generating, by the backup engine, a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file; in response to a request for an incremental backup of files stored in the source VM disk file, examining based on a file attribute of each of the files from the file index to identify one or more files that have been modified within a predetermined past period of time; determining from the file index one or more file records associated with the identified files that have been modified within the predetermined past period of time; and transmitting, by the backup engine, to the VMM of the source storage system via the API, information representing the identified file records to back up the identified file records from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up unmodified files in the source VM disk file. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A storage system representing a backup server, comprising:
-
a virtual machine (VM) disk parser to parse a source VM disk file associated with a VM to generate metadata representing a plurality of files stored in a virtual storage device of the VM, the VM being one of one or more VMs hosted by a VM monitor (VMM) of a source storage system, wherein the storage system operates as a backup server that is communicatively coupled to the source storage system over a network, wherein the VM parser remotely accesses the source VM disk file via an application programming interface (API) of the VMM of the source storage system over the network, wherein the backup server is configured to communicate with a plurality of source storage systems over the network and to back up data from the plurality of source storage systems, wherein the VM disk parser generates a file index based on the metadata representing the source VM disk file based on the parsing of the source VM disk file; a backup engine, in response to a request for an incremental backup of files stored in the source VM disk file, to examine based on a file attribute of each of the files from the file index to identify one or more files that have been modified within a predetermined past period of time, to determine from the file index one or more file records associated with the identified files that have been modified within the predetermined past period of time, and to transmit to the VMM of the source storage system via the API information representing the identified file records to back up identified file records from the source VM disk file to a target VM disk file of a target storage system that is remote to the source storage system over the network, without having to back up unmodified files in the source VM disk file. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification