Method and system for identifying spoken sounds in continuous speech by comparing classifier outputs
First Claim
Patent Images
1. In a speech-recognition system having a plurality of classifiers a method of identifying a spoken sound the method comprising the following steps:
- (a) receiving a plurality of classifier output signal sequences from the classifiers, each of the classifier output signal sequences having been generated according to a polynomial discriminant function;
(b) defining a voting window to include portions of the classifier output signal sequences occurring within a finite period of time;
(c) selecting a winning classifier output corresponding to an interval within the finite period of the voting window, the winning classifier output signal having a magnitude larger than other classifier output signals that correspond to the same interval;
(d) repeating step (c) for a plurality of intervals occurring within the finite period to generate a plurality of winning classifier output signals; and
(e) identifying the spoken sound by determining which of the classifier output signal sequences includes the most winning classifier output signals within the voting window.
4 Assignments
0 Petitions
Accused Products
Abstract
In a speech-recognition system having a plurality of classifiers, a voting window includes a sequence of outputs from each of the classifiers. At each interval in the voting window, the outputs are compared to determine a winning output. A spoken sound is identified by determining which classifier generates the greatest number of winning outputs in the voting window.
60 Citations
22 Claims
-
1. In a speech-recognition system having a plurality of classifiers a method of identifying a spoken sound the method comprising the following steps:
-
(a) receiving a plurality of classifier output signal sequences from the classifiers, each of the classifier output signal sequences having been generated according to a polynomial discriminant function; (b) defining a voting window to include portions of the classifier output signal sequences occurring within a finite period of time; (c) selecting a winning classifier output corresponding to an interval within the finite period of the voting window, the winning classifier output signal having a magnitude larger than other classifier output signals that correspond to the same interval; (d) repeating step (c) for a plurality of intervals occurring within the finite period to generate a plurality of winning classifier output signals; and (e) identifying the spoken sound by determining which of the classifier output signal sequences includes the most winning classifier output signals within the voting window. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for recognizing a spoken sound from continuous speech, comprising the following steps:
-
(a) receiving the continuous speech; (b) sampling the continuous speech, over time, to form a sequence of sample datum which represents the continuous speech; (c) partitioning the sequence of sample datum into a sequence of data frames, each of the sequence of data frames including at least two of the sequence of sample datum; (d) extracting a plurality of features from the sequence of data frames; (e) forming a sequence of feature frames from the plurality of features; (f) distributing the sequence of feature frames to a plurality of classifiers, each of the classifiers generating a classifier output signal sequence in response thereto according to a polynomial discriminant function, whereby producing a plurality of classifier output signal sequences; (g) defining a voting window to include portions of the classifier output signal sequences occurring within a finite period of time; (h) selecting a winning classifier output signal corresponding to an interval within the finite period of the voting window, the winning classifier output signal having a magnitude larger than other classifier output signals that correspond to the same interval; (i) repeating step (h) for a plurality of intervals occurring within the finite period to generate a plurality of winning classifier output signals; and (j) identifying the spoken sound by determining which of the classifier output signal sequences includes the most winning classifier output signals within the voting window. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A speech-recognition system for identifying a spoken sound, the speech-recognition system comprising:
-
a plurality of classifiers for generating a plurality of classifier output signal sequences, each of the classifier output signal sequences being generated according to a polynomial discriminant function; defining means for defining a voting window to include portions of the classifier output signal sequences occurring within a finite period of time; determining means, associatively coupled to the defining means and the plurality of classifiers, for comparing classifier output signals corresponding to an interval occurring within the voting window to select a winning classifier output signal corresponding to the interval, the winning classifier output signal having a largest magnitude, the determining means repeating the comparison for a plurality of intervals occurring within the voting window to generate a plurality of winning classifier output signals; and identifying means, associatively coupled to the defining means and the determining means, for identifying the spoken sound by determining which of the classifier output signal sequences includes the most winning classifier output signals within the voting window. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
Specification