False alarm reduction in speech recognition systems using contextual information
First Claim
1. A computerized method for reducing false alarms in a speech recognition system, the method comprising:
- receiving a plurality of training examples;
generating a model of a left internal context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the left internal context in the form of spectral, cepstral or sinusoidal descriptions;
generating a model of a right internal context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the right internal context in the form of spectral, cepstral or sinusoidal descriptions;
generating a model of a left external context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the left external context in the form of spectral, cepstral or sinusoidal descriptions;
generating a model of a right external context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the right external context in the form of spectral, cepstral or sinusoidal descriptions;
receiving at least one test word, the at least one test word comprising an external context;
comparing the external context of the at least one test word against a threshold associated with each of the model of the left internal context, the model of the right internal context, the model of the left external context, and the model of the right external context; and
rejecting the at least one test word if it is not within the thresholds.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method are presented for using spoken word verification to reduce false alarms by exploiting global and local contexts on a lexical level, a phoneme level, and on an acoustical level. The reduction of false alarms may occur through a process that determines whether a word has been detected or if it is a false alarm. Training examples are used to generate models of internal and external contexts which are compared to test word examples. The word may be accepted or rejected based on comparison results. Comparison may be performed either at the end of the process or at multiple steps of the process to determine whether the word is rejected.
32 Citations
14 Claims
-
1. A computerized method for reducing false alarms in a speech recognition system, the method comprising:
-
receiving a plurality of training examples; generating a model of a left internal context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the left internal context in the form of spectral, cepstral or sinusoidal descriptions; generating a model of a right internal context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the right internal context in the form of spectral, cepstral or sinusoidal descriptions; generating a model of a left external context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the left external context in the form of spectral, cepstral or sinusoidal descriptions; generating a model of a right external context based at least in part on the plurality of training examples, wherein the generation of the model includes compact representation of the right external context in the form of spectral, cepstral or sinusoidal descriptions; receiving at least one test word, the at least one test word comprising an external context; comparing the external context of the at least one test word against a threshold associated with each of the model of the left internal context, the model of the right internal context, the model of the left external context, and the model of the right external context; and rejecting the at least one test word if it is not within the thresholds. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computerized method for reducing false alarms in a speech recognition system, the method comprising:
-
receiving a plurality of training examples, each training example comprising a representation of a spoken word and a local context; generating at least one model of an acoustic context based on the plurality of training examples, wherein the generation of the model includes compact representation of the acoustic context in the form of spectral, cepstral or sinusoidal descriptions; generating at least one model of a phonetic context based on the plurality of training examples, wherein the generation of the model includes compact representation of the phonetic context in the form of spectral, cepstral or sinusoidal descriptions; generating at least one model of a linguistic context based on the plurality of training examples, wherein the generation of the model includes compact representation of the linguistic context in the form of spectral, cepstral or sinusoidal descriptions; receiving at least one test word, the at least one test word comprising an external context; comparing the at least one test word against a threshold associated with each of the model of the acoustic context, the model of the phonetic context, and the model of the linguistic context; and rejecting the at least one test word if it is not within the thresholds. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification