Data mining framework using a signature associated with an algorithm
First Claim
1. A method for data mining using an algorithm, the algorithm having a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the method comprising:
- retrieving a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; and
creating a template for said the build task based on said signature, said template indicating one or more of said parameters that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and
executing said template to create a mapping between said one or more coefficients and said one or more model values.
3 Assignments
0 Petitions
Accused Products
Abstract
A framework is provided that enables data mining algorithms to be plugged into it without any change to algorithm software implementations, while still providing all the standard data mining tasks. It may be implemented by the data source provider. It also then allows for the complete separation of data storage and algorithms. When the user initiates a mining session and picks an algorithm for build task or a model for an apply or test task, the framework may become responsible for preparing a set of “prompts” to the user asking him to provide some expression which is specific to the particular kind of data the user is working with.
118 Citations
31 Claims
-
1. A method for data mining using an algorithm, the algorithm having a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the method comprising:
-
retrieving a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; and creating a template for said the build task based on said signature, said template indicating one or more of said parameters that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and executing said template to create a mapping between said one or more coefficients and said one or more model values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for data mining using an algorithm, the algorithm having a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the method comprising:
-
retrieving a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; creating a template for the build task based on said signature, said template indicating one or more of said parameters that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and executing said template to create a mapping between said one or more coefficients and said one or more model values, said execution generating a set of prompts asking said user to provide some expression specific to a data source said user is working with.
-
-
13. An apparatus for data mining using an algorithm, the algorithm having, a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the apparatus comprising:
-
a signature information field receiver configured to retrieve a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; a task template creator coupled to said signature information field receiver, wherein the task template creator is configured to create a template for the build task based on said signature, said template indicating one or more of said parameters that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and an executing module coupled to the task template creator, wherein the execution module is configured to execute said template to create a mapping between said one or more coefficients and said one or more model values. - View Dependent Claims (14, 15, 16, 17)
-
-
18. An apparatus for data mining using an algorithm, the algorithm having a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the apparatus comprising:
-
means for retrieving a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; means for creating a template for the build task based on said signature, said template indicating one or more of said parameters that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and means for executing said template to create a mapping between said one or more coefficients and said one or more model values. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. An apparatus for data mining using an algorithm, the algorithm having a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the apparatus comprising:
-
means for retrieving a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; means for creating a template for the build task based on said signature, said template indicating one or more of said parameters that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and means for executing said template to create a mapping between said one or more coefficients and said one or more model values, said execution generating a set of prompts asking said user to provide some expression specific to a data source said user is working with.
-
-
30. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform a method for data mining using an algorithm, the algorithm having a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the method comprising:
-
retrieving a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; and creating a template for the build task based on said signature, said template indicating one or more of said parameters that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and executing said template to create a mapping between said one or more coefficients and said one or more model values.
-
-
31. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform a method for data mining using an algorithm, the algorithm having a build task, a test task, and an apply task, each task having a number of parameters, each parameter having a type, the method comprising:
-
retrieving a signature associated with the algorithm, said signature including, for the build task, the number of parameters and the type of each parameter associated with said task, as well as an information field for each parameter associated with said task, said information field indicating the meaning and/or recommended usage of said parameter, said signature also including, for the build task, one or more coefficients for the algorithm; creating a template for the build task based on said signature, said template indicating one or more of said parameters fields that need to be initialized by a user to invoke said task and one or more model values that are to be derived from a data set; and executing said template to create a mapping between said one or more coefficients and said one or more model values, said execution generating a set of prompts asking said user to provide some expression specific to a data source said user is working with.
-
Specification