Method and system for managing de-duplication of data in a virtualization environment
First Claim
Patent Images
1. A system for managing de-duplication of data in a virtualization environment, comprising:
- a plurality of nodes having multiple storage controllers implemented as virtual machines, the multiple storage controllers distributed across multiple nodes, wherein the plurality of nodes implements a virtualization environment;
a plurality of primary storage devices that are accessed by virtual machines in the virtualization environment, wherein the plurality of primary storage devices are managed by the multiple storage controllers; and
wherein the plurality of primary storage devices correspond to a global storage pool having a local storage device on a first node of the plurality of nodes and a local storage device on a second node of the plurality of nodes, the second node being different from the first node, and wherein a virtual disk is exposed to the virtual machines as a set of addressable storage units corresponding to a plurality of virtual disks structured from the global storage pool and data within the set of addressable storage units is organized as extents and the extents are grouped within extent groups, wherein data undergoes de-duplication in the virtualization environment such that an instruction issued by a virtual machine to store a data item does not result in the data item being duplicatively stored if the data item is already stored in the global storage pool, and de-duplication is enabled on an extent group basis.
3 Assignments
0 Petitions
Accused Products
Abstract
A system for managing de-duplication of data in a virtualization environment, that includes a plurality of nodes, wherein the plurality of nodes implements a virtualization environment, a plurality of primary storage devices that are accessed by virtual machines in the virtualization environment, and wherein data undergoes de-duplication in the virtualization environment such that an instruction to store a data item that is issued by a virtual machine is not duplicatively stored if the data item is already stored in the plurality of the primary storage devices.
183 Citations
17 Claims
-
1. A system for managing de-duplication of data in a virtualization environment, comprising:
-
a plurality of nodes having multiple storage controllers implemented as virtual machines, the multiple storage controllers distributed across multiple nodes, wherein the plurality of nodes implements a virtualization environment; a plurality of primary storage devices that are accessed by virtual machines in the virtualization environment, wherein the plurality of primary storage devices are managed by the multiple storage controllers; and wherein the plurality of primary storage devices correspond to a global storage pool having a local storage device on a first node of the plurality of nodes and a local storage device on a second node of the plurality of nodes, the second node being different from the first node, and wherein a virtual disk is exposed to the virtual machines as a set of addressable storage units corresponding to a plurality of virtual disks structured from the global storage pool and data within the set of addressable storage units is organized as extents and the extents are grouped within extent groups, wherein data undergoes de-duplication in the virtualization environment such that an instruction issued by a virtual machine to store a data item does not result in the data item being duplicatively stored if the data item is already stored in the global storage pool, and de-duplication is enabled on an extent group basis. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for implementing de-duplication in a virtualization environment, comprising:
-
receiving an instruction by a virtual machine to store a data item in a virtualization environment, the virtualization environment implemented by a plurality of nodes having multiple storage controllers implemented as virtual machines distributed across the plurality of nodes, the multiple storage controllers managing a plurality of primary storage devices that are accessed by virtual machines in the virtualization environment to store data; and performing de-duplication on data in the virtualization environment such that an instruction issued by a virtual machine to store a data item to a virtual disk does not result in the data item being duplicatively stored if the data item is already stored in a global storage pool having a local storage device on a first node of the plurality of nodes and a local storage device on a second node of the plurality of nodes, the second node being different from the first node, wherein the virtual disk is exposed to the virtual machines as a set of addressable storage units corresponding to a plurality of virtual disks structured from the global storage pool and data within the set of addressable storage units is organized as extents, and the extents are grouped within extent groups, and de-duplication is enabled on an extent group basis. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A computer program product embodied on a non-transitory computer readable medium, the non-transitory computer readable medium having stored thereon a sequence of instructions which, when executed by a processor causes the processor to execute a method for implementing de-duplication in a virtualization environment, the method comprising:
-
receiving an instruction by a virtual machine to store a data item in a virtualization environment, the virtualization environment implemented by a plurality of nodes having multiple storage controllers implemented as virtual machines distributed across the plurality of nodes, the multiple storage controllers managing a plurality of primary storage devices that are accessed by virtual machines in the virtualization environment to store data; and performing de-duplication on data in the virtualization environment such that an instruction issued by a virtual machine to store a data item to a virtual disk does not result in the data item being duplicatively stored if the data item is already stored in a global storage pool having a local storage device on a first node of the plurality of nodes and a local storage device on a second node of the plurality of nodes, the second node being different from the first node, wherein the virtual disk is exposed to the virtual machines as a set of addressable storage units corresponding to a plurality of virtual disks structured from the global storage pool and data within the set of addressable storage units is organized as extents, and the extents are grouped within extent groups, and de-duplication is enabled on an extent group basis. - View Dependent Claims (14, 15, 16, 17)
-
Specification