FEATURE PROCESSING RECIPES FOR MACHINE LEARNING
First Claim
1. A system, comprising:
- one or more computing devices configured to;
receive, at a network-accessible machine learning service of a provider network, a text representation of a recipe comprising one or more of;
(a) a group definitions section indicating one or more groups of variables, wherein individual ones of the one or more groups comprise a plurality of variables on which at least one common transformation operation is to be applied, (b) an assignment section defining one or more intermediate variables, (c) a dependency section indicating respective references to one or more machine learning artifacts stored in a repository, or (d) an output section indicating one or more transformation operations to be applied to at least one entity indicated in the group definitions section, the assignment section, or the dependency section;
validate, in accordance with (a) a set of syntax rules defined by the machine learning service and (b) a set of library function definitions for transformation operation types supported by the machine learning service, the text representation of the recipe;
generate an executable representation of the recipe;
store the executable representation in the repository;
determine that the recipe is to be applied to a particular data set;
verify that the particular data set meets a run-time acceptance criterion of the recipe; and
apply, using one or more selected provider network resources, a particular transformation operation of the one or more transformation operations to the particular data set.
1 Assignment
0 Petitions
Accused Products
Abstract
A first representation of a feature processing recipe is received at a machine learning service. The recipe includes a section in which groups of variables on which common transformations are to be applied are defined, and a section in which a set of transformation operations are specified. The first representation of the recipe is validated based at least in part on a library of function definitions supported by the service, and an executable version of the recipe is generated. In response to a determination that the recipe is to be executed on a particular data set, a set of provider network resources is used to implement a transformation operation indicated in the recipe.
-
Citations
27 Claims
-
1. A system, comprising:
one or more computing devices configured to; receive, at a network-accessible machine learning service of a provider network, a text representation of a recipe comprising one or more of;
(a) a group definitions section indicating one or more groups of variables, wherein individual ones of the one or more groups comprise a plurality of variables on which at least one common transformation operation is to be applied, (b) an assignment section defining one or more intermediate variables, (c) a dependency section indicating respective references to one or more machine learning artifacts stored in a repository, or (d) an output section indicating one or more transformation operations to be applied to at least one entity indicated in the group definitions section, the assignment section, or the dependency section;validate, in accordance with (a) a set of syntax rules defined by the machine learning service and (b) a set of library function definitions for transformation operation types supported by the machine learning service, the text representation of the recipe; generate an executable representation of the recipe; store the executable representation in the repository; determine that the recipe is to be applied to a particular data set; verify that the particular data set meets a run-time acceptance criterion of the recipe; and apply, using one or more selected provider network resources, a particular transformation operation of the one or more transformation operations to the particular data set. - View Dependent Claims (2, 3, 4, 5)
-
6. A method, comprising:
performing, by one or more computing devices; receiving, at a network-accessible machine learning service, a first representation of a recipe comprising one or more of;
(a) a group definitions section indicating one or more groups of variables, wherein individual ones of the one or more groups comprise a plurality of data set variables on which at least one common transformation operation is to be applied and (b) an output section indicating one or more transformation operations to be applied to at least one entity indicated in one or more of;
(i) the group definitions section or (ii) an input data set;validating, in accordance with at least a set of library function definitions for transformation operation types supported by the machine learning service, the first representation of the recipe; generating an executable representation of the recipe; determining that the recipe is to be applied to a particular data set; verifying that the particular data set meets a run-time acceptance criterion; and applying, using one or more selected provider network resources, a particular transformation operation of the one or more transformation operations to the particular data set. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
23. A non-transitory computer-accessible storage medium storing program instructions that when executed on one or more processors:
-
determine, at a machine learning service, a first representation of a recipe comprising one or more of;
(a) a group definitions section indicating one or more groups of variables, wherein individual ones of the one or more groups comprise a plurality of data set variables on which at least one common transformation operation is to be applied, or (b) an output section indicating one or more transformation operations to be applied to at least one entity indicated in one or more of (i) the group definitions section or (ii) an input data set of the recipe;validate, in accordance with at least a set of library function definitions for transformation operation types supported by the machine learning service, the first representation of the recipe; generate an executable representation of the recipe; and in response to a determination that the recipe is to be applied to a particular data set, use one or more selected provider network resources to implement a particular transformation operation of the one or more transformation operations to the particular data set. - View Dependent Claims (24, 25, 26, 27)
-
Specification