Error corrective mechanisms for consensus decoding of speech
First Claim
1. A method for correcting errors for consensus decoding of speech, the method comprising the steps of:
- creating a confusion set comprising a plurality of candidate words, each of said candidate words having an associated score and corresponding to an acoustic event;
selecting a candidate word from the confusion set as a word recognized for the acoustic event, wherein a candidate word other than a candidate word having a highest associated score is selected when one or more criteria are met; and
outputting the selected candidate word as the word recognized for the acoustic event.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are described for decreasing the number of errors when consensus decoding is used during speech recognition. A number of corrective rules are applied to confusion sets that are extracted during real-time speech recognition. The corrective rules are determined during training of the speech recognition system, which entails using many training confusion sets. A learning process is used that generates a number of possible rules, called template rules, that can be applied to the training confusion sets. The learning process also determines the corrective rules from the template rules. The corrective rules operate on the real-time confusion sets to select hypothesis words from the confusion sets, where the hypothesis words are not necessarily the words having the highest score.
28 Citations
22 Claims
-
1. A method for correcting errors for consensus decoding of speech, the method comprising the steps of:
-
creating a confusion set comprising a plurality of candidate words, each of said candidate words having an associated score and corresponding to an acoustic event;
selecting a candidate word from the confusion set as a word recognized for the acoustic event, wherein a candidate word other than a candidate word having a highest associated score is selected when one or more criteria are met; and
outputting the selected candidate word as the word recognized for the acoustic event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 14, 15, 16)
-
-
9. A method for determining a plurality of corrective rules from a plurality of training confusion sets, the method comprising the steps of:
-
determining a plurality of allowable transformations on the training confusion sets, each training confusion set comprising a plurality of candidate words determined from utterances of one or more individuals, wherein each candidate word in a training confusion set has an associated score;
providing an objective function; and
learning a plurality of corrective rules for the training confusion sets, the step of learning using the allowable transformations and objective function, wherein at least one of the plurality of corrective rules selects, for a given training confusion set, a candidate word other than a candidate word having a highest associated score. - View Dependent Claims (10, 11, 12, 13)
-
-
17. A system for correcting errors for consensus decoding of speech, comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to said memory, said processor configured to implement said computer-readable code, said computer-readable code configured to;
create a confusion set comprising a plurality of candidate words, each of said candidate words having an associated score and corresponding to an acoustic event;
select a candidate word from the confusion set as a word recognized for the acoustic event, wherein a candidate word other than a candidate word having a highest associated score is selected when one or more criteria are met; and
output the selected candidate word as the word recognized for the acoustic event.
-
-
18. A system for determining a plurality of corrective rules from a plurality of training confusion sets, comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to said memory, said processor configured to implement said computer-readable code, said computer-readable code configured to;
determine a plurality of allowable transformations on the training confusion sets, each training confusion set comprising a plurality of candidate words determined from utterances of one or more individuals, wherein each candidate word in a training confusion set has an associated score;
provide an objective function; and
learn a plurality of corrective rules for the training confusion sets, the step of learning using the allowable transformations and objective function, wherein at least one of the plurality of corrective rules selects, for a given training confusion set, a candidate word other than a candidate word having a highest associated score.
-
-
19. An article of manufacture for correcting errors for consensus decoding of speech, comprising:
-
a computer readable medium having computer readable code means embodied thereon, said computer readable program code means comprising;
a step to create a confusion set comprising a plurality of candidate words, each of said candidate words having an associated score and corresponding to an acoustic event;
a step to select a candidate word from the confusion set as a word recognized for the acoustic event, wherein a candidate word other than a candidate word having a highest associated score is selected when one or more criteria are met; and
a step to output the selected candidate word as the word recognized for the acoustic event.
-
-
20. An article of manufacture for determining a plurality of corrective rules from a plurality of training confusion sets, comprising:
-
a computer readable medium having computer readable code means embodied thereon, said computer readable program code means comprising;
a step to determine a plurality of allowable transformations on the training confusion sets, each training confusion set comprising a plurality of candidate words determined from utterances of one or more individuals, wherein each candidate word in a training confusion set has an associated score;
a step to provide an objective function; and
a step to learn a plurality of corrective rules for the training confusion sets, the step of learning using the allowable transformations and objective function, wherein at least one of the plurality of corrective rules selects, for a given training confusion set, a candidate word other than a candidate word having a highest associated score.
-
-
21. A system for correcting errors for consensus decoding of speech, comprising:
-
means for creating a confusion set comprising a plurality of candidate words, each of said candidate words having an associated score and corresponding to an acoustic event;
means for selecting a candidate word from the confusion set as a word recognized for the acoustic event, wherein a candidate word other than a candidate word having a highest associated score is selected when one or more criteria are met; and
means for outputting the selected candidate word as the word recognized for the acoustic event.
-
-
22. A system for determining a plurality of corrective rules from a plurality of training confusion sets, comprising:
-
means for determining a plurality of allowable transformations on the training confusion sets, each training confusion set comprising a plurality of candidate words determined from utterances of one or more individuals, wherein each candidate word in a training confusion set has an associated score;
means for providing an objective function; and
means for learning a plurality of corrective rules for the training confusion sets, the step of learning using the allowable transformations and objective function, wherein at least one of the plurality of corrective rules selects, for a given training confusion set, a candidate word other than a candidate word having a highest associated score.
-
Specification