Method for using redundant data elimination to accelerate storage system scanning
First Claim
1. A method for use in a computer system comprising at least one mapping layer that makes available a plurality of storage objects, and a storage system that stores data in each of the plurality of storage objects in one or more data blocks, the method comprising:
- reducing space usage on the storage system, using a redundant data elimination tool, of at least some redundant data by sharing at least one of the data blocks among multiple storage objects;
generating information about the at least one of the data blocks shared among multiple storage objects, wherein the information identifies the storage objects that include shared data blocks;
assigning to the plurality of storage objects a scanning priority based at least in part on the information about the at least one of the data blocks shared among multiple storage objects, wherein a higher scanning priority is assigned for storage objects having redundant data blocks shared among multiple storage objects over storage objects that do not include a shared data block; and
performing, using at least one scanning tool, at least one scanning operation on at least some of the plurality of storage objects wherein ones of the storage objects having a higher scanning priority are scanned prior to ones of the objects having a lower scanning priority.
9 Assignments
0 Petitions
Accused Products
Abstract
Embodiments are directed to methods and apparatus for making available to at least one scanning tool, information about at least one data unit shared among multiple storage objects of a plurality of storage objects stored on a storage system. The at least one scanning tool can use the information to influence at least one scanning operation on the at least some of the plurality of storage objects. Embodiments may be implemented in a computer system comprising at least one application program, at least one mapping layer that makes available to the at least one application program a plurality of storage objects, and a storage system that stores data in each of the plurality of storage objects in one or more data units. The at least one computer system further comprises the at least one scanning tool that performs the at least one scanning operation on at least some of the plurality of storage objects, and at least one redundant data elimination tool that eliminates at least some redundant data by sharing at least one of the data units among multiple storage objects.
-
Citations
16 Claims
-
1. A method for use in a computer system comprising at least one mapping layer that makes available a plurality of storage objects, and a storage system that stores data in each of the plurality of storage objects in one or more data blocks, the method comprising:
-
reducing space usage on the storage system, using a redundant data elimination tool, of at least some redundant data by sharing at least one of the data blocks among multiple storage objects; generating information about the at least one of the data blocks shared among multiple storage objects, wherein the information identifies the storage objects that include shared data blocks; assigning to the plurality of storage objects a scanning priority based at least in part on the information about the at least one of the data blocks shared among multiple storage objects, wherein a higher scanning priority is assigned for storage objects having redundant data blocks shared among multiple storage objects over storage objects that do not include a shared data block; and performing, using at least one scanning tool, at least one scanning operation on at least some of the plurality of storage objects wherein ones of the storage objects having a higher scanning priority are scanned prior to ones of the objects having a lower scanning priority. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. At least one non-transitory computer readable medium encoded with a plurality of instructions that, when executed, perform a method for use in a computer system comprising at least one mapping layer that makes available a plurality of storage objects, and a storage system that stores data in each of the plurality of storage objects in one or more data blocks, the method comprising:
-
reducing space usage on the storage system, using a redundant data elimination tool, of at least some redundant data by sharing at least one of the data blocks among multiple storage objects; generating information about the at least one of the data blocks shared among multiple storage objects, wherein the information identifies the storage objects that include shared data blocks; assigning to the plurality of storage objects a scanning priority based at least in part on the information about the at least one of the data blocks shared among multiple storage objects, wherein a higher scanning priority is assigned for storage objects having redundant data blocks shared among multiple storage objects over storage objects that do not include a shared data block; and performing, using at least one scanning tool, at least one scanning operation on at least some of the plurality of storage objects wherein ones of the storage objects having a higher scanning priority are scanned prior to ones of the objects having a lower scanning priority. - View Dependent Claims (9, 10, 11, 12)
-
-
13. At least one computer system, comprising:
-
at least one mapping layer that makes available a plurality of storage objects; a storage system that stores data in each of the plurality of storage objects in one or more data blocks; at least one scanning tool that performs at least one scanning operation on at least some of the plurality of storage objects; at least one redundant data elimination tool, different from the at least one scanning tool, that reduces space usage on the storage system of at least some redundant data by sharing at least one of the data blocks among multiple storage objects; and at least one processor programmed to; generate information about the at least one of the data blocks shared among multiple storage objects, wherein the information identifies the storage objects that include shared data blocks; assign to the plurality of storage objects a scanning priority based at least in part on the information about the at least one of the data blocks shared among multiple storage objects, wherein a higher scanning priority is assigned for storage objects having redundant data blocks shared among multiple storage objects over storage objects that do not include a shared data block; and perform, using at least one scanning tool, at least one scanning operation on at least some of the plurality of storage objects wherein ones of the storage objects having a higher scanning priority are scanned prior to ones of the objects having a lower scanning priority. - View Dependent Claims (14, 15, 16)
-
Specification