Model-based design for transforming data
First Claim
1. A method for providing a model-based design for transforming data, the method comprising:
- developing, by one or more processors, a modeling framework, wherein the modeling framework includes a plurality of rules for performing transformations of data;
configuring, by one or more processors, a data unit, wherein the data unit is assigned to a job that includes one or more rules of the plurality of rules, and wherein the job transforms the data unit from a data format of a source system into a data format of a destination system;
monitoring, by one or more processors, usage statistics for at least one rule of the job, wherein a plurality of data units are split into a first stream and a second stream for parallel processing, wherein different operations are performed on each stream in parallel, and wherein results of each stream are combined;
determining, by one or more processors, a pattern in at least one data unit of the first stream and at least a second data unit of the second stream;
determining, by one or more processors, an aggregate rule based on a first atomic rule of the plurality of rules and a second atomic rule of the plurality of rules, wherein the first atomic rule and the second atomic rule are identified, in part, by the pattern;
identifying, by one or more processors, a first rule of the job based on failing to meet a predetermined threshold of usage of data units assigned to the job that include the first rule;
removing, by one or more processors, the first rule from memory, based on the first rule of the job failing to meet the predetermined threshold; and
configuring, by one or more processors, a subsequent data unit, wherein;
the subsequent data unit is assigned to a subsequent job that includes the aggregate rule; and
the subsequent job transforms the subsequent data unit from the data format of the source system into the data format of the destination system.
1 Assignment
0 Petitions
Accused Products
Abstract
A model-based approach for transforming data. A modeling framework that includes a plurality of rules for performing transformations of data is developed. A data unit is configured. The data unit is assigned to a job that includes one or more rules of the plurality of rules. The job transforms the data unit from a data format of a source system into a data format of a destination system. Usage statistics for at least one rule of the job are monitored. An aggregate rule is determined based on a first atomic rule and a second atomic rule. The first atomic rule and the second atomic rule are identified by at least one, or in part by a common pattern. A first rule of the job is identified based on the usage statistics of a plurality of data units. The first rule is removed from memory.
-
Citations
20 Claims
-
1. A method for providing a model-based design for transforming data, the method comprising:
-
developing, by one or more processors, a modeling framework, wherein the modeling framework includes a plurality of rules for performing transformations of data; configuring, by one or more processors, a data unit, wherein the data unit is assigned to a job that includes one or more rules of the plurality of rules, and wherein the job transforms the data unit from a data format of a source system into a data format of a destination system; monitoring, by one or more processors, usage statistics for at least one rule of the job, wherein a plurality of data units are split into a first stream and a second stream for parallel processing, wherein different operations are performed on each stream in parallel, and wherein results of each stream are combined; determining, by one or more processors, a pattern in at least one data unit of the first stream and at least a second data unit of the second stream; determining, by one or more processors, an aggregate rule based on a first atomic rule of the plurality of rules and a second atomic rule of the plurality of rules, wherein the first atomic rule and the second atomic rule are identified, in part, by the pattern; identifying, by one or more processors, a first rule of the job based on failing to meet a predetermined threshold of usage of data units assigned to the job that include the first rule; removing, by one or more processors, the first rule from memory, based on the first rule of the job failing to meet the predetermined threshold; and configuring, by one or more processors, a subsequent data unit, wherein; the subsequent data unit is assigned to a subsequent job that includes the aggregate rule; and the subsequent job transforms the subsequent data unit from the data format of the source system into the data format of the destination system. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product for providing a model-based design for transforming data, the computer program product comprising:
-
one or more computer readable storage media and program instructions stored on the one or more computer readable storage media, the program instructions comprising; program instructions to develop a modeling framework, wherein the modeling framework includes a plurality of rules for performing transformations of data; program instructions to configure a data unit, wherein the data unit is assigned to a job that includes one or more rules of the plurality of rules, and wherein the job transforms the data unit from a data format of a source system into a data format of a destination system; program instructions to monitor usage statistics for at least one rule of the job, wherein a plurality of data units are split into a first stream and a second stream for parallel processing, wherein different operations are performed on each stream in parallel, and wherein results of each stream are combined; program instructions to determine a pattern in at least one data unit of the first stream and at least a second data unit of the second stream; program instructions to determine an aggregate rule based on a first atomic rule of the plurality of rules and a second atomic rule of the plurality of rules, wherein the first atomic rule and the second atomic rule are identified, in part by, the pattern; program instructions to identify a first rule of the job based on failing to meet a predetermined threshold of usage of data units assigned to the job that include the first rule; program instructions to remove the first rule from memory, based on the first rule of the job failing to meet the predetermined threshold; and program instructions to configure a subsequent data unit, wherein; the subsequent data unit is assigned to a subsequent job that includes the aggregate rule; and the subsequent job transforms the subsequent data unit from the data format of the source system into the data format of the destination system. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer system for a model-based approach for transforming data, the computer system comprising:
-
one or more computer processors, one or more computer readable storage media, and program instructions stored on the computer readable storage media for execution by at least one of the one or more processors, the program instructions comprising; program instructions to develop a modeling framework, wherein the modeling framework includes a plurality of rules for performing transformations of data; program instructions to configure a data unit, wherein the data unit is assigned to a job that includes one or more rules of the plurality of rules, and wherein the job transforms the data unit from a data format of a source system into a data format of a destination system; program instructions to monitor usage statistics for at least one rule of the job, wherein a plurality of data units are split into a first stream and a second stream for parallel processing, wherein different operations are performed on each stream in parallel, and wherein results of each stream are combined; program instructions to determine a pattern in at least one data unit of the first stream and at least a second data unit of the second stream; program instructions to determine an aggregate rule based on a first atomic rule of the plurality of rules and a second atomic rule of the plurality of rules, wherein the first atomic rule and the second atomic rule are identified, in part, by the pattern; program instructions to identify a first rule of the job based on failing to meet a predetermined threshold of usage of data units assigned to the job that include the first rule; program instructions to remove the first rule from memory, based on the first rule of the job failing to meet the predetermined threshold; and program instructions to configure a subsequent data unit, wherein; the subsequent data unit is assigned to a subsequent job that includes the aggregate rule; and the subsequent job transforms the subsequent data unit from the data format of the source system into the data format of the destination system. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification