Workload discovery using real-time analysis of input streams
First Claim
Patent Images
1. A method, comprising:
- identifying, with a processor of a computer, a single workload for analysis;
storing, with the processor of the computer, changes to data objects in a source copy made by change operations that are in a replication change stream into a recovery log;
identifying, with the processor of the computer, associations between the data objects and a plurality of applications based on usage and access patterns in one of the recovery log and the replication change stream, wherein the associations comprise any combination of;
a) a set of the data objects that are updated by a particular application of the plurality of applications;
b) a set of the data objects that are updated by a given set of applications of the plurality of applications;
c) patterns of data access of the data objects;
d) patterns of shared data access of the data objects; and
e) a set of the data objects that are referenced together in transactions; and
using, with the processor of the computer, the associations to identify change operations that form consistency groups for replication from the source copy to a target copy, and wherein the consistency groups comprise sub-workloads; and
executing, with the processor of the computer, the change operations of the sub-workloads to perform the replication for the single workload, wherein the change operations within each of the sub-workloads are executed such that the target copy is a replica of the source copy at a given point in time, and wherein the sub-workloads provide increased granularity.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are techniques for workload discovery using real-time analysis of input streams. For a meta workload, changes to data objects made by change operations that are in a replication change stream are stored into a recovery log. Using an analytics engine, one of the recovery log and the replication change stream are analyzed to identify associations between the data objects based on usage and access patterns. The associations are used to identify sub-workloads of the meta workload that form consistency groups for replication.
-
Citations
17 Claims
-
1. A method, comprising:
-
identifying, with a processor of a computer, a single workload for analysis; storing, with the processor of the computer, changes to data objects in a source copy made by change operations that are in a replication change stream into a recovery log; identifying, with the processor of the computer, associations between the data objects and a plurality of applications based on usage and access patterns in one of the recovery log and the replication change stream, wherein the associations comprise any combination of; a) a set of the data objects that are updated by a particular application of the plurality of applications; b) a set of the data objects that are updated by a given set of applications of the plurality of applications; c) patterns of data access of the data objects; d) patterns of shared data access of the data objects; and e) a set of the data objects that are referenced together in transactions; and using, with the processor of the computer, the associations to identify change operations that form consistency groups for replication from the source copy to a target copy, and wherein the consistency groups comprise sub-workloads; and executing, with the processor of the computer, the change operations of the sub-workloads to perform the replication for the single workload, wherein the change operations within each of the sub-workloads are executed such that the target copy is a replica of the source copy at a given point in time, and wherein the sub-workloads provide increased granularity. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product, the computer program product comprising a non-transitory computer readable storage medium having program code embodied therewith, the program code executable by at least one processor to perform:
-
identifying a single workload for analysis; storing changes to data objects in a source copy made by change operations that are in a replication change stream into a recovery log; identifying associations between the data objects and a plurality of applications based on usage and access patterns in one of the recovery log and the replication change stream, wherein the associations comprise any combination of; a) a set of the data objects that are updated by a particular application of the plurality of applications; b) a set of the data objects that are updated by a given set of applications of the plurality of applications; c) patterns of data access of the data objects; d) patterns of shared data access of the data objects; and e) a set of the data objects that are referenced together in transactions; using the associations to identify change operations that form consistency groups for replication from the source copy to a target copy, and wherein the consistency groups comprise sub-workloads; and executing the change operations of the sub-workloads to perform the replication for the single workload, wherein the change operations within each of the sub-workloads are executed such that the target copy is a replica of the source copy at a given point in time, and wherein the sub-workloads provide increased granularity. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer system, comprising:
-
one or more processors, one or more computer-readable memories and one or more computer-readable, tangible storage devices; and program instructions, stored on at least one of the one or more computer-readable, tangible storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to perform operations, wherein the operations comprise; identifying a single workload for analysis; storing changes to data objects in a source copy made by change operations that are in a replication change stream into a recovery log; identifying associations between the data objects and a plurality of applications based on usage and access patterns in one of the recovery log and the replication change stream, wherein the associations comprise any combination of; a) a set of the data objects that are updated by a particular application of the plurality of applications; b) a set of the data objects that are updated by a given set of applications of the plurality of applications; c) patterns of data access of the data objects; d) patterns of shared data access of the data objects; and e) a set of the data objects that are referenced together in transactions; using the associations to identify change operations that form consistency groups for replication from the source copy to a target copy, and wherein the consistency groups comprise sub-workloads; and executing the change operations of the sub-workloads to perform the replication for the single workload, wherein the change operations within each of the sub-workloads are executed such that the target copy is a replica of the source copy at a given point in time, and wherein the sub-workloads provide increased granularity. - View Dependent Claims (14, 15, 16, 17)
-
Specification