Cataloging system for image-based backup
First Claim
1. A system for backing up a virtual machine disk file in a virtual computing environment, the system comprising:
- a computer system comprising computer hardware, the computer system comprising a data store, the computer system programmed to implement a cataloging module, a backup module, and a restore module, wherein;
the cataloging module is configured to, for each virtual machine disk file of a plurality of virtual machine disk files;
access the virtual machine disk file comprising a file system of a guest operating system of a virtual machine to obtain metadata about the file system, the metadata comprising information about a logical structure of the file system;
analyze the metadata to determine a logical hierarchy of a set of files in the file system;
analyze the metadata to determine one or more attributes of the set of files in the file system;
generate a catalog comprising information about the logical hierarchy and the one or more attributes of the set of files in the file system, the catalog comprising a computer-enabled searchable index of at least one of a plurality of image-based backups, wherein the computer-enabled searchable index performs automated file searches within the plurality of image-based backups without manually mounting a backup image containing a searched-for file;
wherein each image-based backup of the plurality of image-based backups is associated with a unique catalog; and
store the catalog outside the virtual machine;
the backup module is configured to, for each virtual machine disk file of the plurality of virtual machine disk files;
backup the virtual machine disk file to a backup image stored on the data store; and
the restore module is configured to;
enable a user to search the catalogs for a desired file from a virtual machine image-based backup from outside the virtual machines and without specifying which virtual machine image includes the desired file; and
wherein the computer system is programmed to concurrently and asynchronously execute the cataloging module in a first thread and the backup module in a second thread different from the first thread, wherein the cataloging module is configured to determine the logical hierarchy of the at least some files in the file system in a first pass through the metadata and to determine the one or more attributes of the at least some files in the file system in a second pass through the metadata; and
wherein the first thread and the second thread are executed by a backup server outside the virtual machine associated with the virtual machine disk file.
23 Assignments
0 Petitions
Accused Products
Abstract
Examples of systems and methods for cataloging image-based backups are disclosed. In certain implementations, these systems and methods can obtain information about file system structure from a volume undergoing image-based backup. The systems and methods can generate a catalog of the file system structure that can be stored with the backup image. The systems and methods can generate a searchable index of the catalog so that users can search for files within the backup image without being required to manually mount the backup image containing the searched-for file. As a result, in certain embodiments, users advantageously can more quickly locate and restore files from image-based backups.
221 Citations
19 Claims
-
1. A system for backing up a virtual machine disk file in a virtual computing environment, the system comprising:
-
a computer system comprising computer hardware, the computer system comprising a data store, the computer system programmed to implement a cataloging module, a backup module, and a restore module, wherein; the cataloging module is configured to, for each virtual machine disk file of a plurality of virtual machine disk files; access the virtual machine disk file comprising a file system of a guest operating system of a virtual machine to obtain metadata about the file system, the metadata comprising information about a logical structure of the file system; analyze the metadata to determine a logical hierarchy of a set of files in the file system; analyze the metadata to determine one or more attributes of the set of files in the file system; generate a catalog comprising information about the logical hierarchy and the one or more attributes of the set of files in the file system, the catalog comprising a computer-enabled searchable index of at least one of a plurality of image-based backups, wherein the computer-enabled searchable index performs automated file searches within the plurality of image-based backups without manually mounting a backup image containing a searched-for file; wherein each image-based backup of the plurality of image-based backups is associated with a unique catalog; and store the catalog outside the virtual machine; the backup module is configured to, for each virtual machine disk file of the plurality of virtual machine disk files; backup the virtual machine disk file to a backup image stored on the data store; and the restore module is configured to; enable a user to search the catalogs for a desired file from a virtual machine image-based backup from outside the virtual machines and without specifying which virtual machine image includes the desired file; and wherein the computer system is programmed to concurrently and asynchronously execute the cataloging module in a first thread and the backup module in a second thread different from the first thread, wherein the cataloging module is configured to determine the logical hierarchy of the at least some files in the file system in a first pass through the metadata and to determine the one or more attributes of the at least some files in the file system in a second pass through the metadata; and wherein the first thread and the second thread are executed by a backup server outside the virtual machine associated with the virtual machine disk file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of backing up a virtual machine disk file in a virtual computing environment, the method comprising:
-
under control of a computer system comprising one or more physical computing devices, for each virtual machine disk file of a plurality of virtual machine disk files; accessing metadata of the virtual machine disk file, the virtual machine disk file comprising a file system of a guest operating system of a virtual machine executing on a host server, the metadata comprising information about a logical structure of the file system; analyzing the metadata to determine a logical hierarchy of a set of files in the file system and one or more attributes of the set of files in the file system; filtering the metadata based on at least one filter criterion to determine the set of files in the file system; generating a catalog comprising information about the logical hierarchy and the one or more attributes of the set of files in the file system, the catalog comprising a computer-enabled searchable index of at least one of a plurality of image-based backups, wherein the computer-enabled searchable index performs automated file searches within the plurality of image-based backups without manually mounting a backup image containing a searched-for file; wherein each image-based backup of the plurality of image-based backups is associated with a unique catalog; storing the catalog in a persistent storage outside the virtual machine; backing up the virtual machine disk file to a backup image; the computer system enabling a user to search the catalogs for a desired file from a virtual machine image-based backup from outside the virtual machines and without specifying which virtual machine image includes the desired file; wherein the computer system is programmed to concurrently and asynchronously execute a first thread and a second thread; wherein the first thread comprises a cataloging thread that is configured to determine the logical hierarchy of the at least some files in the file system in a first pass through the metadata and to determine the one or more attributes of the at least some files in the file system in a second pass through the metadata; wherein the second thread comprises the backing up; and wherein the first thread and the second thread are executed by a backup server outside the virtual machine associated with the virtual machine disk file. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A computer-readable storage medium comprising computer-executable instructions configured to implement a method of backing up a virtual machine image, the method comprising:
-
for each virtual machine image of a plurality of virtual machine images; accessing metadata of the virtual machine image, the virtual machine image comprising a file system of a guest operating system of a virtual machine, the metadata comprising information about a logical structure of the file system; analyzing the metadata to determine a logical hierarchy of a set of files in the file system and one or more attributes of the set of files in the file system; filtering the metadata based on at least one filter criterion to determine the set of files in the file system; generating a catalog comprising information about the logical hierarchy and the one or more attributes of the set of files in the file system, the catalog comprising a computer-enabled searchable index of at least one of a plurality of image-based backups, wherein the computer-enabled searchable index performs automated file searches within the plurality of image-based backups without manually mounting a backup image containing a searched-for file; wherein each image-based backup of the plurality of image-based backups is associated with a unique catalog; storing the catalog in a persistent storage; backing up the virtual machine image to a backup image; enabling a user to search the catalogs for a desired file from a virtual machine image-based backup from outside the virtual machines and without specifying which virtual machine image includes the desired file; wherein the computer-executable instructions launch a first thread and a second thread that execute concurrently and asynchronously; wherein the first thread comprises a cataloging thread that is configured to determine the logical hierarchy of the at least some files in the file system in a first pass through the metadata and to determine the one or more attributes of the at least some files in the file system in a second pass through the metadata; wherein the second thread comprises the backing up; and wherein the first thread and the second thread are executed by a backup server outside the virtual machine associated with a virtual machine disk file.
-
Specification