Method for training a statistical classifier with reduced tendency for overfitting
First Claim
1. A method for training a statistical classifier to recognize input patterns that belong to respective predetermined classes, utilizing a set of training samples which are respectively associated with said classes, comprising the following steps which are repeated over a large number of iterations:
- selecting a training sample from said set of training samples;
producing a set of distortion parameters;
selectively distorting said training sample in accordance with said distortion parameters to compute a classifier input pattern; and
training the classifier using said classifier input pattern.
2 Assignments
0 Petitions
Accused Products
Abstract
To prevent overfitting a neural network to a finite set of training samples, random distortions are dynamically applied to the samples each time they are applied to the network during a training session. A plurality of different types of distortions can be applied, which are randomly selected each time a sample is applied to the network. Alternatively, a combination of two or more types of distortion can be applied each time, with the amount of distortion being randomly varied for each type.
-
Citations
24 Claims
-
1. A method for training a statistical classifier to recognize input patterns that belong to respective predetermined classes, utilizing a set of training samples which are respectively associated with said classes, comprising the following steps which are repeated over a large number of iterations:
-
selecting a training sample from said set of training samples; producing a set of distortion parameters; selectively distorting said training sample in accordance with said distortion parameters to compute a classifier input pattern; and training the classifier using said classifier input pattern. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for training a statistical classifier to recognize input patterns that belong to respective predetermined classes, comprising the steps of:
-
(i) generating a set of training sample patterns which are respectively associated with said classes; (ii) processing sample patterns from said set in the classifier to generate output values; (iii) determining error values based on differences between said output values and target values associated with said sample patterns; (iv) adjusting operating parameters of the classifier in accordance with said error values; (v) iteratively repeating steps (ii)-(iv) with the sample patterns from said set; and (vi) modifying said patterns prior to processing them in the classifier, during successive iterations of steps (ii)-(iv). - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for training a statistical classifier to recognize input patterns that belong to respective predetermined classes, utilizing a set of training samples which are respectively associated with said classes, comprising the following steps:
-
selecting a first training sample from said set of training samples; producing a first set of distortion parameters; selectively distorting said first training sample in accordance with said first set of distortion parameters to dynamically compute a first classifier input pattern; training the classifier using said first classifier input pattern; selecting a second training sample from said set of training samples; producing a second set of distortion parameters; selectively distorting said second training sample in accordance with said second set of distortion parameters to dynamically compute a second classifier input pattern; and training the classifier using said second classifier input pattern.
-
-
24. A method for training a statistical classifier to recognize input patterns that belong to respective predetermined classes, utilizing a set of training samples which are respectively associated with said classes, comprising the following steps:
-
selecting a training sample from said set of training samples; producing a first set of distortion parameters; selectively distorting said training sample in accordance with said first set of distortion parameters to compute a first classifier input pattern; training the classifier using said first classifier input pattern; subsequently selecting said training sample from said set of training samples; producing a second set of distortion parameters; selectively distorting said training sample in accordance with said second set of distortion parameters, to compute a second classifier input pattern; and training the classifier using said second classifier input pattern.
-
Specification