System and method for optimizing federated and ETL'd databases with considerations of specialized data structures within an environment having multidimensional constraints
First Claim
Patent Images
1. A computer-implemented method for creating a data warehousing scheme having optimally selected components, the computer-implemented method comprising:
- inputting into an optimization engine a mathematical model of a goal for the data warehousing scheme;
inputting into the optimization engine at least one constraint on the data warehousing scheme;
performing a mathematical optimization algorithm using the optimization engine, wherein an output of the optimization engine is an optimized data warehousing scheme having optimally selected components;
storing the optimized data warehousing scheme, wherein the data warehousing scheme is part of a major information technology project, and wherein the optimally selected components comprise a subset of optimally selected sub-projects of the major information technology project; and
creating an optimized project having the optimally selected sub-projects, wherein creating the optimized project comprises;
determining value clusters, wherein value clusters are the discrete intersection of selected data and an ability of the selected data to add value to an organization;
organizing data objects into data models of future data structures;
mapping the data models of future data structures to existing data sources;
determining additional processes, wherein the additional processes move the selected data from a source to a target;
creating an affinity matrix based on the value clusters, wherein the affinity matrix is a matrix, wherein the matrix indicates a relationship between groups of data sources and groups of output objects, and indicates a relationship between groups of available logical processes and groups of output objects;
inputting into an optimization engine boundary conditions, the existing data sources, the data objects, logical processes used to create output objects, the value clusters, the data models of future data structures, the mapping of the data models of future data structures to the existing data sources, the additional processes for moving data from the source to the target, formats of data to be used in the optimized project, and the affinity matrix; and
storing the optimized project in a storage device, wherein the optimized project is derived from an optimization process using the optimization engine.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for creating a data warehousing scheme having optimally selected components. A mathematical model of a goal for the data warehousing scheme is input into an optimization engine. At least one constraint on the data warehousing scheme is input into the optimization engine. A mathematical optimization algorithm is performed using the optimization engine, wherein an output of the optimization engine is an optimized data warehousing scheme having optimally selected components. The optimized data warehousing scheme can be stored.
-
Citations
18 Claims
-
1. A computer-implemented method for creating a data warehousing scheme having optimally selected components, the computer-implemented method comprising:
-
inputting into an optimization engine a mathematical model of a goal for the data warehousing scheme; inputting into the optimization engine at least one constraint on the data warehousing scheme; performing a mathematical optimization algorithm using the optimization engine, wherein an output of the optimization engine is an optimized data warehousing scheme having optimally selected components; storing the optimized data warehousing scheme, wherein the data warehousing scheme is part of a major information technology project, and wherein the optimally selected components comprise a subset of optimally selected sub-projects of the major information technology project; and creating an optimized project having the optimally selected sub-projects, wherein creating the optimized project comprises; determining value clusters, wherein value clusters are the discrete intersection of selected data and an ability of the selected data to add value to an organization; organizing data objects into data models of future data structures; mapping the data models of future data structures to existing data sources; determining additional processes, wherein the additional processes move the selected data from a source to a target; creating an affinity matrix based on the value clusters, wherein the affinity matrix is a matrix, wherein the matrix indicates a relationship between groups of data sources and groups of output objects, and indicates a relationship between groups of available logical processes and groups of output objects; inputting into an optimization engine boundary conditions, the existing data sources, the data objects, logical processes used to create output objects, the value clusters, the data models of future data structures, the mapping of the data models of future data structures to the existing data sources, the additional processes for moving data from the source to the target, formats of data to be used in the optimized project, and the affinity matrix; and storing the optimized project in a storage device, wherein the optimized project is derived from an optimization process using the optimization engine. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer program product comprising:
-
a computer usable storage medium having computer usable program code for creating a data warehousing scheme having optimally selected components, the computer program product including; computer usable program code for inputting into an optimization engine a mathematical model of a goal for the data warehousing scheme; computer usable program code for inputting into the optimization engine at least one constraint on the data warehousing scheme; computer usable program code for performing a mathematical optimization algorithm using the optimization engine, wherein an output of the optimization engine is an optimized data warehousing scheme having optimally selected components; computer usable program code for storing the optimized data warehousing scheme, wherein the data warehousing scheme is part of a major information technology project, and wherein the optimally selected components comprise a subset of optimally selected sub-projects of the major information technology project; and computer usable program code for creating an optimized project having the optimally selected sub-projects, wherein the computer usable program code for creating the optimized project comprises; computer usable program code for determining value clusters, wherein value clusters are the discrete intersection of selected data and an ability of the selected data to add value to an organization; computer usable program code for organizing data objects into data models of future data structures; computer usable program code for mapping the data models of future data structures to existing data sources; computer usable program code for determining additional processes, wherein the additional processes move the selected data from a source to a target; computer usable program code for creating an affinity matrix based on the value clusters, wherein the affinity matrix is a matrix, wherein the matrix indicates a relationship between groups of data sources and groups of output objects, and indicates a relationship between groups of available logical processes and groups of output objects; computer usable program code for inputting into an optimization engine boundary conditions, the existing data sources, the data objects, logical processes used to create output objects, the value clusters, the data models of future data structures, the mapping of the data models of future data structures to the existing data sources, the additional processes for moving data from the source to the target, formats of data to be used in the optimized project, and the affinity matrix; and computer usable program code for storing the optimized project in a storage device, wherein the optimized project is derived from an optimization process using the optimization engine. - View Dependent Claims (14, 15)
-
-
16. A data processing system comprising:
-
a bus; at least one processor coupled to the bus; a computer usable medium coupled to the bus, wherein the computer usable medium contains a set of instructions for creating a data warehousing scheme having optimally selected components, wherein the at least one processor is adapted to carry out the set of instructions to; input into an optimization engine a mathematical model of a goal for the data warehousing scheme; input into the optimization engine at least one constraint on the data warehousing scheme; perform a mathematical optimization algorithm using the optimization engine, wherein an output of the optimization engine is an optimized data warehousing scheme having optimally selected components; store the optimized data warehousing scheme, wherein the data warehousing scheme is part of a major information technology project, and wherein the optimally selected components comprise a subset of optimally selected sub-projects of the major information technology project; and create an optimized project having the optimally selected sub-projects, wherein the at least one processor adapted to carry out the set of instructions to create the optimized project comprises the at least one processor adapted to carry out the set of instructions to; determine value clusters, wherein value clusters are the discrete intersection of selected data and an ability of the selected data to add value to an organization; organize data objects into data models of future data structures; map the data models of future data structures to existing data sources; determine additional processes, wherein the additional processes move the selected data from a source to a target; create an affinity matrix based on the value clusters, wherein the affinity matrix is a matrix, wherein the matrix indicates a relationship between groups of data sources and groups of output objects, and indicates a relationship between groups of available logical processes and groups of output objects; input into an optimization engine boundary conditions, the existing data sources, the data objects, logical processes used to create output objects, the value clusters, the data models of future data structures, the mapping of the data models of future data structures to the existing data sources, the additional processes for moving data from the source to the target, formats of data to be used in the optimized project, and the affinity matrix; and store the optimized project in a storage device, wherein the optimized project is derived from an optimization process using the optimization engine. - View Dependent Claims (17, 18)
-
Specification