Exponential priors for maximum entropy models
First Claim
1. A computer implemented method for maximizing probability values to facilitate training a machine learning system comprising:
- receiving a data set;
determining an Exponential distribution as an Exponential prior, comprising;
graphing a distribution of parameter values that have at least 30 training instances; and
determining the Exponential prior by examining the distribution of parameter values;
defining one or more parameters; and
training a model based at least in part upon a subset of the data set, the Exponential prior, and the one or more parameters.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.
72 Citations
20 Claims
-
1. A computer implemented method for maximizing probability values to facilitate training a machine learning system comprising:
-
receiving a data set; determining an Exponential distribution as an Exponential prior, comprising; graphing a distribution of parameter values that have at least 30 training instances; and determining the Exponential prior by examining the distribution of parameter values; defining one or more parameters; and training a model based at least in part upon a subset of the data set, the Exponential prior, and the one or more parameters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer implemented method for maximizing probability values to facilitate training a machine learning system comprising:
-
identifying one or more parameters from a data set, each parameter comprises at least 30 teaching instances; plotting a distribution of teaching instances for each of the one or more parameters identified; establishing an Exponential distribution as an Exponential prior for each of the one or more parameters by examining the distribution of teaching instances; and teaching a model based at least in part upon a subset of the data set, the Exponential prior, and the one or more parameters. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer implemented method for maximizing probability values to facilitate training a machine learning system comprising:
-
defining one or more parameters from a data set, each parameter comprises at least 30 training instances; mapping a distribution of training instances for each of the one or more parameters defined; determining an Exponential distribution as an Exponential prior for each of the one or more parameters by examining the distribution of training instances; computing an σ
2 variance for the Exponential prior; andteaching a model based at least in part upon a subset of the data set, the σ
2 variance for the Exponential prior, and the one or more parameters.
-
Specification