SYSTEM AND METHOD FOR APPLICATION AWARE DE-DUPLICATION OF DATA BLOCKS ON A VIRTUALIZED STORAGE ARRAY
First Claim
1. A method for application aware de-duplication (de-dup) of data blocks on virtualized storage arrays in a storage area network (SAN), comprising:
- enabling a de-dup agent on each of one or more components of the SAN, wherein the one or more components of the SAN comprises a host device, a data path module (DPM), and virtualized storage arrays;
creating a master list of metadata associated with indexed data and storing the masterlist in the virtualized storage arrays;
creating one or more sublists of metadata from the masterlist and storing the one or more sublists in remaining one or more components of the SAN;
upon receiving a write request from an application residing in the host device, determining whether a data block being written has an entry in the sublist stored in the host device; and
if so, replacing the data block with a pointer indicating where the data block is residing in the virtualized storage arrays.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for application aware de-duplication (de-dup) of data blocks in a virtualized storage array is disclosed. In one embodiment, in a method of application aware de-dup of data blocks on virtualized storage arrays in a storage area network, a de-dup agent is enabled on each of one or more components of the storage area network. A master list of metadata associated with indexed data is then created and stored in the virtualized storage arrays. One or more sublists of metadata are then created from the masterlist and are stored. Upon receiving a write request from an application residing in the host device, it is determined whether data block being written has an entry in a sublist stored in a host device, and if so, the data block is then replaced with a pointer indicating where the data block is residing in the virtualized storage arrays.
17 Citations
20 Claims
-
1. A method for application aware de-duplication (de-dup) of data blocks on virtualized storage arrays in a storage area network (SAN), comprising:
-
enabling a de-dup agent on each of one or more components of the SAN, wherein the one or more components of the SAN comprises a host device, a data path module (DPM), and virtualized storage arrays; creating a master list of metadata associated with indexed data and storing the masterlist in the virtualized storage arrays; creating one or more sublists of metadata from the masterlist and storing the one or more sublists in remaining one or more components of the SAN; upon receiving a write request from an application residing in the host device, determining whether a data block being written has an entry in the sublist stored in the host device; and if so, replacing the data block with a pointer indicating where the data block is residing in the virtualized storage arrays. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A SAN, comprising:
-
a host device; a DPM connected to the host device; and one or more virtualized storage arrays connected to the DPM, wherein each of the host device, the DPM and the one or more virtualized storage arrays includes an associated de-dup agent to enable application aware de-dup of data blocks on the one or more virtualized storage arrays in the SAN. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A SAN, comprising:
-
a host device; and one or more virtualized storage arrays connected to the host device, wherein each of the host device, and the one or more virtualized storage arrays includes an associated de-dup agent to enable application aware de-dup of data blocks on the one or more virtualized storage arrays in the SAN. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A non-transitory computer-readable storage medium for application aware de-dup of data blocks on virtualized storage arrays in a SAN, having instructions that, when executed by a computing device causes the computing device to:
-
enable a de-dup agent on each of one or more components of the SAN, wherein the one or more components of the storage array comprises a host device, a DPM, and virtualized storage arrays; create a master list of metadata associated with indexed data and storing the masterlist in the virtualized storage arrays; create one or more sublists of metadata from the masterlist and storing the one or more sublists in remaining one or more components of the SAN; upon receiving a write request from an application residing in the host device, determines whether data block being written has an entry in a sublist stored in the host device; and if so, replaces the data block with a pointer indicating where the data block is residing in the virtualized storage arrays.
-
-
19. The non-transitory computer-readable storage medium 18, wherein the master list of metadata is an ordered weightage list decided based on number of occurrences of the data blocks in each of the virtualized storage arrays.
-
20. The non-transitory computer-readable storage medium 18, wherein the virtualized storage arrays comprise thin provisioned virtual volumes.
Specification