Exponential priors for maximum entropy models
First Claim
1. A computer implemented system that facilitates maximizing probabilities to effectuate spain filtering comprising:
- a data input component that provides one or more types of data for analysis; and
an analysis component that analyzes at least a subset of the one or more types of data to compute maximized probabilities to identify at least one spain email message by employing an iterative scaling function, a plurality of Exponential priors that correspond to a plurality of different features respectively, and at least one of;
a LaPlacian prior and a non-Gaussian.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.
67 Citations
9 Claims
-
1. A computer implemented system that facilitates maximizing probabilities to effectuate spain filtering comprising:
-
a data input component that provides one or more types of data for analysis; and an analysis component that analyzes at least a subset of the one or more types of data to compute maximized probabilities to identify at least one spain email message by employing an iterative scaling function, a plurality of Exponential priors that correspond to a plurality of different features respectively, and at least one of;
a LaPlacian prior and a non-Gaussian. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-readable medium having stored thereon the following computer executable components to identify spain email messages, comprising:
-
a data input component that provides one or more types of data for analysis; and an analysis component that analyzes at least a subset of the one or more types of data to compute maximized probabilities by employing at least an iterative scaling function, a plurality of Exponential priors that correspond to a plurality of different features respectively, and at least one LaPlacian prior and a non-Gaussian distribution, the maximized probabilities employed to ascertain that at least one of the one or more types of data includes a spain email message.
-
Specification