Defining a data analysis process
First Claim
1. A computer program product tangibly embodied in an information carrier, the computer program product including instructions that when executed generate a graphical user interface on a display device for using a computer to display and modify a data analysis process, the graphical user interface comprising:
- a process list display for;
displaying identifications of data analysis processes, and receiving an entry of an identification of a data analysis process; and
a data analysis display for;
displaying a representation of each sub-process included in the data analysis process identified by the received entry, and displaying a connection between each displayed sub-process, wherein the data analysis display is operable to display;
a data mining sub-process for creating a data attribute by performing an analytical process on data from an analytical processing data source, and one or more of sub-processes of (1) an extraction sub-process for extracting data from a data source, (2) a transformation sub-process for transforming the extracted data from a data format used by the data source to a data format used for analytical processing, (3) a loading sub-process for loading data into the data source used for analytical processing, and (4) a deployment sub-process for storing the created data attribute.
3 Assignments
0 Petitions
Accused Products
Abstract
A data analysis workbench enables a user to define a data analysis process that includes an extract sub-process to obtain transactional data from a source system, a load sub-process for providing the extracted data to a data warehouse or data mart, a data mining analysis sub-process to use the obtained transactional data, and a deployment sub-process to make the data mining results accessible by another computer program. Common settings used by each of the sub-processes are defined, as are specialized settings relevant to each of the sub-processes. The invention also enables a user to define an order in which the defined sub-processes are to be executed.
98 Citations
34 Claims
-
1. A computer program product tangibly embodied in an information carrier, the computer program product including instructions that when executed generate a graphical user interface on a display device for using a computer to display and modify a data analysis process, the graphical user interface comprising:
a process list display for;
displaying identifications of data analysis processes, and receiving an entry of an identification of a data analysis process; and
a data analysis display for;
displaying a representation of each sub-process included in the data analysis process identified by the received entry, and displaying a connection between each displayed sub-process, wherein the data analysis display is operable to display;
a data mining sub-process for creating a data attribute by performing an analytical process on data from an analytical processing data source, and one or more of sub-processes of (1) an extraction sub-process for extracting data from a data source, (2) a transformation sub-process for transforming the extracted data from a data format used by the data source to a data format used for analytical processing, (3) a loading sub-process for loading data into the data source used for analytical processing, and (4) a deployment sub-process for storing the created data attribute. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A computer program product tangibly embodied in an information carrier, the computer program product including instructions that when executed generate a graphical user interface on a display device for using a computer to define a data analysis process, the graphical user interface comprising:
-
a sub-processes display for;
receiving an entry of an identification of which of sub-processes of (1) an extraction sub-process for extracting data from a data source, (2) a transformation sub-process for transforming the extracted data from a data format used by the data source to a data format used for analytical processing, (3) a loading sub-process for loading data into a data source that is used for analytical processing, (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and (5) an deployment sub-process for storing the created data attribute, and receiving an entry identifying a computer program to be associated with each of the identified sub-processes such that the execution of the computer program causes the identified sub-process to be performed; and
a common data display for receiving an entry of selected meta-data elements to be used in the data analysis process wherein each meta-data element is associated with a corresponding data element in the data source and with a corresponding data element in the analytical processing data source. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A computer-implemented method for receiving information from a user for use in a data analysis process, the method comprising:
-
receiving an input identifying a data analysis process;
receiving sub-process inputs, each sub-process input identifying a sub-process associated with the data analysis process, wherein;
at least one of the identified sub-processes is (1) an extraction sub-process for extracting data from a transactional data source, (2) a transformation sub-process for transforming data extracted from the transactional data source from a data format used by the transactional data source to a data format used for analytical processing, (3) a loading sub-process for loading data into an analytical data source that is used for analytical processing, or (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and at least one of the identified sub-processes is a deployment sub-process for storing a data attribute created in another of the identified sub-processes; and
storing the input identifying the data analysis process in association with the inputs identifying the multiple sub-processes for use in the data analysis process. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A computer program product tangibly embodied in an information carrier, the computer program product including instructions that, when executed, receive information from a user for use in a data analysis process, and computer program product being configured to
receive an input identifying a data analysis process; -
receive sub-process inputs, each sub-process input identifying a sub-process associated with the data analysis process, wherein;
at least one of the identified sub-processes is (1) an extraction sub-process for extracting data from a transactional data source, (2) a transformation sub-process for transforming data extracted from the transactional data source from a data format used by the transactional data source to a data format used for analytical processing, (3) a loading sub-process for loading data into an analytical data source that is used for analytical processing, or (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and at least one of the identified sub-processes is a deployment sub-process for storing a data attribute created in another of the identified sub-processes; and
store the input identifying the data analysis process in association with the inputs identifying the multiple sub-processes for use in the data analysis process. - View Dependent Claims (29, 30, 31, 32, 33)
-
-
34. A system for receiving information from a user for use in a data analysis the system comprising a processor connected to a storage device and one or more input/output devices, wherein the processor is configured to:
-
receive an input identifying a data analysis process;
receive sub-process inputs, each sub-process input identifying a sub-process associated with the data analysis process, wherein;
at least one of the identified sub-processes is (1) an extraction sub-process for extracting data from a transactional data source, (2) a transformation sub-process for transforming data extracted from the transactional data source from a data format used by the transactional data source to a data format used for analytical processing, (3) a loading sub-process for loading data into an analytical data source that is used for analytical processing, or (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and at least one of the identified sub-processes is a deployment sub-process for storing a data attribute created in another of the identified sub-processes; and
store the input identifying the data analysis process in association with the inputs identifying the multiple sub-processes for use in the data analysis process.
-
Specification