Defining a data analysis process
First Claim
1. A computer program product tangibly embodied in a storage medium, the computer program product including instructions that when executed generate a graphical user interface on a display device for using a computer to display and modify a data analysis process, the graphical user interface comprising:
- a process list display configured to;
display identifications of data analysis processes, andreceive user input selecting an entry of an identification of a data analysis process; and
a data analysis display configured to;
display representations of sub-processes included in the data analysis process identified by the selected entry, the displayed representations of sub-processes including;
a representation of a data mining sub-process for creating a data attribute by performing an analytical process on data from an analytical processing data source,a representation of at least one of (1) an extraction sub-process for extracting data from a first transactional data source, (2) a transformation sub-process for transforming the extracted data from a data format used by the first transactional data source to a data format used for analytical processing, and (3) a loading sub-process for loading data into the analytical processing data source, anda representation of a deployment sub-process for storing the created data attribute in one of the first transactional data source, a second transactional data source other than the first transactional data source, or a second analytical data source used for analytical processing, anddisplay connections between the displayed sub-processes, the connections indicating a sequence with which the displayed sub-processes are performed when performing the data analysis process.
3 Assignments
0 Petitions
Accused Products
Abstract
A data analysis workbench enables a user to define a data analysis process that includes an extract sub-process to obtain transactional data from a source system, a load sub-process for providing the extracted data to a data warehouse or data mart, a data mining analysis sub-process to use the obtained transactional data, and a deployment sub-process to make the data mining results accessible by another computer program. Common settings used by each of the sub-processes are defined, as are specialized settings relevant to each of the sub-processes. The invention also enables a user to define an order in which the defined sub-processes are to be executed.
87 Citations
35 Claims
-
1. A computer program product tangibly embodied in a storage medium, the computer program product including instructions that when executed generate a graphical user interface on a display device for using a computer to display and modify a data analysis process, the graphical user interface comprising:
-
a process list display configured to; display identifications of data analysis processes, and receive user input selecting an entry of an identification of a data analysis process; and a data analysis display configured to; display representations of sub-processes included in the data analysis process identified by the selected entry, the displayed representations of sub-processes including; a representation of a data mining sub-process for creating a data attribute by performing an analytical process on data from an analytical processing data source, a representation of at least one of (1) an extraction sub-process for extracting data from a first transactional data source, (2) a transformation sub-process for transforming the extracted data from a data format used by the first transactional data source to a data format used for analytical processing, and (3) a loading sub-process for loading data into the analytical processing data source, and a representation of a deployment sub-process for storing the created data attribute in one of the first transactional data source, a second transactional data source other than the first transactional data source, or a second analytical data source used for analytical processing, and display connections between the displayed sub-processes, the connections indicating a sequence with which the displayed sub-processes are performed when performing the data analysis process. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product tangibly embodied in a storage medium, the computer program product including instructions that when executed generate a graphical user interface on a display device for using a computer to define a data analysis process, the graphical user interface comprising:
-
a sub-processes display configured to; receive user input indicating an entry of an identification at least one of (1) an extraction sub-process for extracting data from a data source, (2) a transformation sub-process for transforming the extracted data from a data format used by the data source to a data format used for analytical processing, (3) a loading sub-process for loading data into a data source that is used for analytical processing, (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and (5) a deployment sub-process for storing a data attribute created in another sub-process, and receive user input indicating an entry identifying a computer program to be associated with each of the identified sub-processes such that execution of the computer program causes the identified sub-process to be performed; and a common data display configured to receive user input indicating an entry of selected meta-data elements to be used in the data analysis process wherein each meta-data element is associated with a corresponding data element in the data source and with a corresponding data element in the analytical processing data source. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. A computer-implemented method for receiving information from a user for use in a data analysis process, the method comprising:
-
receiving user input identifying a data analysis process; receiving multiple sub-process user inputs, each sub-process user input identifying a sub-process associated with the data analysis process, wherein; at least one of the identified sub-processes is (1) an extraction sub-process for extracting data from a first transactional data source, (2) a transformation sub-process for transforming data extracted from the first transactional data source from a data format used by the first transactional data source to a data format used for analytical processing, (3) a loading sub-process for loading data into an analytical processing data source that is used for analytical processing, or (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and at least one of the identified sub-processes is a deployment sub-process for storing a data attribute created in another of the identified sub-processes; and storing the input identifying the data analysis process in association with the inputs identifying the multiple sub-processes for use in the data analysis process, wherein the deployment sub-process stores the created data attribute in one of the first transactional data source, a second transactional data source other than the first transactional data source, or a second analytical data source used for analytical processing. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computer program product tangibly embodied in a storage medium, the computer program product including instructions that, when executed, receive information from a user for use in a data analysis process, and the computer program product being configured to
receive user input identifying a data analysis process; -
receive multiple sub-process user inputs, each sub-process user input identifying a sub-process associated with the data analysis process, wherein; at least one of the identified sub-processes is (1) an extraction sub-process for extracting data from a first transactional data source, (2) a transformation sub-process for transforming data extracted from the first transactional data source from a data format used by the first transactional data source to a data format used for analytical processing, (3) a loading sub-process for loading data into an analytical processing data source that is used for analytical processing, or (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and at least one of the identified sub-processes is a deployment sub-process for storing a data attribute created in another of the identified sub-processes; and store the input identifying the data analysis process in association with the inputs identifying the multiple sub-processes for use in the data analysis process, wherein the deployment sub-process stores the created data attribute in one of the first transactional data source, a second transactional data source other than the first transactional data source, or a second analytical data source used for analytical processing. - View Dependent Claims (26, 27, 28)
-
-
29. A system for receiving information from a user for use in a data analysis the system comprising a processor connected to a storage device and one or more input/output devices, wherein the processor is configured to:
-
receive user input identifying a data analysis process; receive multiple sub-process user inputs, each sub-process user input identifying a sub-process associated with the data analysis process, wherein; at least one of the identified sub-processes is (1) an extraction sub-process for extracting data from a first transactional data source, (2) a transformation sub-process for transforming data extracted from the first transactional data source from a data format used by the first transactional data source to a data format used for analytical processing, (3) a loading sub-process for loading data into an analytical processing data source that is used for analytical processing, or (4) a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source, and at least one of the identified sub-processes is a deployment sub-process for storing a data attribute created in another of the identified sub-processes; and store the input identifying the data analysis process in association with the inputs identifying the multiple sub-processes for use in the data analysis process, wherein the deployment sub-process stores the created data attribute in one of the first transactional data source, a second transactional data source other than the first transactional data source, or a second analytical data source used for analytical processing.
-
-
30. A computer program product tangibly embodied in a storage medium, the computer program product including instructions that when executed generate a graphical user interface on a display device for using a computer to display and modify a data analysis process, the graphical user interface comprising:
-
a first graphical icon representing an extraction sub-process for extracting data from a first transactional data source; a second graphical icon representing a loading sub-process for loading data into an analytical processing data source; a third graphical icon representing a data mining sub-process for creating a data attribute by performing an analytical process on data from the analytical processing data source; a fourth graphical icon representing a deployment sub-process for storing the created data attribute; and graphical connections between the displayed graphical icons, the graphical connections indicating a sequence with which the sub-processes represented by the displayed graphical icons are performed, wherein information representing the sequence with which the sub-processes represented by the displayed graphical icons are performed is stored in a storage medium for later access and execution of the sub-processes in the represented sequence. - View Dependent Claims (31, 32, 33, 34, 35)
-
Specification