×

Probability estimate for K-nearest neighbor

  • US 7,451,123 B2
  • Filed: 12/08/2005
  • Issued: 11/11/2008
  • Est. Priority Date: 06/27/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method of training a K nearest neighbor classifier, comprising:

  • obtaining a set of data comprising a first subset of training data and a second subset of training data;

    training the K nearest neighbor classifier on the first subset of training data via receiving feature vectors of objects to be classified;

    sequentially processing the second subset of training data to compute K nearest neighbor classifier outputs for respective points of the second set of training data via outputting a classifier output vector and transforming distances between respective points of the first set and second set of training data, wherein transforming comprises a kernel function for taking an exponential of a negative of a scaled Euclidean distance between respective points of the first set and the second set to produce an associated Gaussian similarity measure;

    determining parameters for a parametric model according to the K nearest neighbor classifier outputs, and true outputs of respective points of the second set of training data, the K nearest neighbor classifier outputs indicate;

    a distance of an input to K nearest points, classes of the K nearest points, and identities of the K nearest points, wherein the parameters are trained via a second training set disjoint from a first training set used to train the K nearest neighbor classifier;

    converting the computed classifier outputs to probabilistic outputs using a probability model, wherein the probability model is built with the classifier outputs and trained via processing various inputs and outputs so as to provide probabilistic outputs from within acceptable error thresholds;

    employing the probabilistic outputs for recognition of at least one of;

    handwriting samples;

    medical images;

    faces;

    fingerprints;

    signals;

    automatic control phenomena;

    natural phenomena; and

    nucleotide sequences; and

    employing a class of the K nearest neighbor classifier outputs to determine a class of the first subset of training data, the K nearest neighbor classifier outputs indicate at least one of;

    a distance of an input to K nearest points, classes of the K nearest points, and identities of the K nearest points.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×