Relevance vector machine
First Claim
Patent Images
1. A computer-implemented method operable on a data set to be modeled, comprising:
- determining a prior distribution for the data set for modeling thereof, the prior distribution having a plurality of weights and a corresponding plurality of hyperparameters;
determining a relevance vector learning machine to obtain a posterior distribution; and
outputting the posterior distribution for the data set.
2 Assignments
0 Petitions
Accused Products
Abstract
A relevance vector machine (RVM) for data modeling is disclosed. The RVM is a probabilistic basis model. Sparsity is achieved through a Bayesian treatment, where a prior is introduced over the weights governed by a set of hyperparameters. As compared to a Support Vector Machine (SVM), the non-zero weights in the RVM represent more prototypical examples of classes, which are termed relevance vectors. The trained RVM utilizes many fewer basis functions than the corresponding SVM, and typically superior test performance. No additional validation of parameters (such as C) is necessary to specify the model, except those associated with the basis.
-
Citations
23 Claims
-
1. A computer-implemented method operable on a data set to be modeled, comprising:
-
determining a prior distribution for the data set for modeling thereof, the prior distribution having a plurality of weights and a corresponding plurality of hyperparameters;
determining a relevance vector learning machine to obtain a posterior distribution; and
outputting the posterior distribution for the data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
determining a marginal likelihood for the plurality of hyperparameters; and
iteratively re-estimating the plurality of hyperparameters to optimize the plurality of hyperparameters.
-
-
3. The method of claim 2, wherein optimizing a marginal likelihood comprises utilizing an EM approach.
-
4. The method of claim 2, wherein optimizing a marginal likelihood comprises utilizing a direct differentiation approach.
-
5. The method of claim 1, wherein determining a prior distribution comprises:
-
for each hyperparameter, determining most probable weights;
determining a Hessian at the most probable weights; and
,repeating until a predetermined convergence criteria has been satisfied.
-
-
6. The method of claim 1, wherein determining a prior distribution for the data set for modeling thereof comprises determining the prior distribution for the data set for linear modeling thereof.
-
7. The method of claim 1, wherein determining a prior distribution for the data set for modeling thereof comprises determining the prior distribution for the data set for classification modeling thereof.
-
8. The method of claim 1, wherein the prior distribution comprises a Gaussian prior distribution.
-
9. A computer-implemented method comprising;
-
determining a relevance vector learning machine to obtain a posterior distribution of a data set to be modeled; and
,outputting at least the posterior distribution for the data set. - View Dependent Claims (10, 11)
-
-
12. A computer-implemented method comprising;
-
selecting a model for a data set to be modeled;
determining a prior distribution for the model having a plurality of weights by utilizing a corresponding plurality of hyperparameters to simplify the model;
determining a relevance vector learning machine to obtain a posterior distribution; and
outputting at least the posterior distribution for the model.
-
-
13. A machine-readable medium having instructions stored thereon for execution by a processor to perform a method comprising:
-
inputting a data set to be modeled;
determining a prior distribution for the data set for modeling thereof, the prior distribution having a plurality of weights and a corresponding plurality of hyperparameters, comprising, determining a marginal likelihood for the plurality of hyperparameters;
iteratively re-estimating the plurality of hyperparameters to optimize the plurality of hyperparameters;
determining a relevance vector learning machine to obtain a posterior distribution; and
outputting the posterior distribution for the data set. - View Dependent Claims (14, 15, 16)
-
-
17. A machine-readable medium having instructions stored thereon for execution by a processor to perform a method operable on a data set to be modeled comprising:
-
determining a prior distribution for the data set for modeling thereof, the prior distribution having a plurality of weights and a corresponding plurality of hyperparameters, comprising;
for each hyperparameter, determining most probable weights;
determining a Hessian at the most probable weights;
repeating until a predetermined convergence criteria has been satisfied; and
,outputting a posterior distribution for the data set. - View Dependent Claims (18)
-
-
19. A machine-readable medium having instructions stored thereon for execution by a process to perform a method operable on a data set to be modeled comprising:
-
selecting a model for the data set;
determining a prior distribution for the model having a plurality of weights by utilizing a corresponding plurality of hyperparameters to simplify the model;
determining a relevance vector learning machine to obtain a posterior distribution; and
outputting at least the posterior distribution for the model. - View Dependent Claims (20)
-
-
21. A machine-readable medium having instructions stored thereon for execution by a process to perform a method comprising:
-
determining a relevance vector learning machine having a plurality of basis functions to obtain a posterior distribution for a data set to be modeled; and
,outputting at least the posterior distribution for the data set. - View Dependent Claims (22, 23)
-
Specification