Virtual machine image co-migration
First Claim
Patent Images
1. A method comprising:
- two or more data sets operating in a computer system, with one data set assigned to a single node;
pairing each node in the computer system with a specific pair node;
profiling each data set, including dividing each data set into two or more data chunks and applying a hash value to each data chunk;
assigning a hash value range to each node, and storing hash values within the assigned hash value range in memory local to the assigned node;
eliminating duplicate data chunks associated with the two or more data sets, including comparing hash values of each of two or more data sets, and for each duplicate hash value retaining a single copy of the duplicate hash value;
virtual machine co-migration, including;
profiling the data sets to be migrated, wherein the data sets are virtual machine images, including identifying data chunks required to support a boot process of a virtual machine image and prioritizing migration of the identified data chunks; and
migrating the non-duplicate data chunks of the data sets from a selected node to a corresponding pair node including eliminating duplicate data chunks between the selected node and the corresponding pair node, wherein the migration of the non-duplicate data chunks is limited to migration between paired nodes.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the invention relate to co-migration in a shared pool of resources with similarity across data sets of a migrating application. The data sets are processed and profiled. Metadata is reviewed to remove duplicate elements and to distribute the processing load across available nodes. At the same time, a ranking may be assigned to select metadata to support a prioritized migration. Non-duplicate data chunks are migrated across the shared pool of resources responsive to the assigned prioritization.
45 Citations
18 Claims
-
1. A method comprising:
-
two or more data sets operating in a computer system, with one data set assigned to a single node; pairing each node in the computer system with a specific pair node; profiling each data set, including dividing each data set into two or more data chunks and applying a hash value to each data chunk; assigning a hash value range to each node, and storing hash values within the assigned hash value range in memory local to the assigned node; eliminating duplicate data chunks associated with the two or more data sets, including comparing hash values of each of two or more data sets, and for each duplicate hash value retaining a single copy of the duplicate hash value; virtual machine co-migration, including; profiling the data sets to be migrated, wherein the data sets are virtual machine images, including identifying data chunks required to support a boot process of a virtual machine image and prioritizing migration of the identified data chunks; and migrating the non-duplicate data chunks of the data sets from a selected node to a corresponding pair node including eliminating duplicate data chunks between the selected node and the corresponding pair node, wherein the migration of the non-duplicate data chunks is limited to migration between paired nodes. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
a processor; a profile manager to profile each data set, the profile manager to divide each data set into at least two data chunks and to apply a hash value to each data chunk; the profile manager to pair each node in the computer system with a pair node; hash manager in communication with the profile manager, the hash manager to assign a hash range to each node and to store hash values within the range in local memory of the assigned node; a comparison manager in communication with the hash manager, the comparison manager to compare hash values of each of the data sets, and to retain a single copy of an identified duplicate hash value, with each node to retain non-duplicate data chunks of the two or more data sets within the hash value range assigned to the node; virtual machine co-migration, including; profiling the data sets to be migrated, wherein the data sets are virtual machine images, including identifying data chunks required to support a boot process of a virtual machine image and prioritizing migration of the identified data chunks; and a migration manager in communication with the comparison manager, the migration manager to migrate the non-duplicate data chunks of the data sets from a selected node to a corresponding pair node including eliminating duplicate data chunks between the selected node and the corresponding pair node, wherein the migration of the non-duplicate data chunks is limited to migration between paired nodes. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A computer program product, the computer program product comprising a non-transitory computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising:
-
computer readable program code configured to pair each source node with a pair node; computer readable program code configured to profile two or more data sets operating in a computer system, with one virtual machine image assigned to a single node; computer readable program code configured to divide each data set into two or more data chunks and applying a hash value to each data chunk; computer readable program code configured to assign a hash value range to each source and destination node, and storing hash values within the hash value range in memory local to the assigned node; computer readable program code configured to eliminate duplicate data chunks associated with the data sets, including comparison of hash values of each of two or more data sets, and for each duplicate hash value retaining a single copy of the duplicate hash value; virtual machine co-migration, including; profiling the data sets to be migrated, wherein the data sets are virtual machine images, including identifying data chunks required to support a boot process of a virtual machine image and prioritizing migration of the identified data chunks; and computer readable program code configured to migrate the non-duplicate data chunks from a selected source node to a corresponding pair node including eliminating duplicate data chunks between the selected source node and the corresponding pair node, wherein the migration of the non-duplicate data chunks is limited to migration between paired nodes. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A method comprising:
-
in a computer system with a shared pool of resource, two or more data sets having one data set assigned to a single node; pairing each node in the computer system with a pair node; profiling each data set, including dividing each data set into two or more data chunks, applying a hash value to each data chunk, and prioritizing the data chunks; assigning a hash value range to each node, and storing hash values within the hash value range of a data set in memory local to the assigned node; eliminating duplicate data chunks in the system; virtual machine co-migration, including; profiling the data sets to be migrated, wherein the data sets are virtual machine images, including identifying data chunks required to support a boot process of a virtual machine image and prioritizing migration of the identified data chunks; and migrating the non-duplicate data chunks from a selected node to a corresponding paired node responsive to the prioritization, including eliminating duplicate data chunks between the selected node and the corresponding pair node, wherein the migration of the non-duplicate data chunks is limited to migration between paired nodes.
-
-
18. A method comprising:
-
two or more data sets operating in a computer system, with one data set assigned to a single node; profiling each data set, wherein the data sets are virtual machine images, the profiling including dividing each data set into two or more data chunks, applying a hash value to each data chunk, identifying data chunks required to support a boot process of a virtual machine image, and prioritizing migration of the identified data chunks; assigning a hash value range to each node, and storing hash values within the assigned hash value range in memory local to the assigned node; eliminating duplicate data chunks associated with the two or more data sets, including comparing hash values of each of two or more data sets, and for each duplicate hash value retaining a single copy of the duplicate hash value, with each node retaining non-duplicate data chunks within the hash value range assigned to the node; virtual machine co-migration, including; profiling the data sets to be migrated, wherein the data sets are virtual machine images, including identifying data chunks required to support a boot process of a virtual machine image and prioritizing migration of the identified data chunks; and migrating the non-duplicate data chunks of the data sets, the migration including booting a virtual machine image at a destination node during migration, including intercepting an I/O request of the destination node, and if the I/O requests access of a non-migrated data chunk, migrating the non-migrated data chunk at a high priority form a source node.
-
Specification