UTTERANCE VERIFICATION METHOD AND APPARATUS FOR ISOLATED WORD N-BEST RECOGNITION RESULT
First Claim
1. An utterance verification method for an isolated word N-best speech recognition result, comprising:
- calculating log likelihoods of a context-dependent phoneme and an anti-phoneme model based on an N-best speech recognition result for an input utterance;
measuring a confidence score of an N-best speech-recognized word using the log likelihoods;
calculating a distance between phonemes for the N-best speech-recognized word;
comparing the confidence score with a threshold and the distance with a mean of distances; and
accepting the N-best speech-recognized word when the compared results for the confidence score and the distance correspond to acceptance.
1 Assignment
0 Petitions
Accused Products
Abstract
An utterance verification method for an isolated word N-best speech recognition result includes: calculating log likelihoods of a context-dependent phoneme and an anti-phoneme model based on an N-best speech recognition result for an input utterance; measuring a confidence score of an N-best speech-recognized word using the log likelihoods; calculating distance between phonemes for the N-best speech-recognized word; comparing the confidence score with a threshold and the distance with a predetermined mean of distances; and accepting the N-best speech-recognized word when the compared results for the confidence score and the distance correspond to acceptance.
37 Citations
18 Claims
-
1. An utterance verification method for an isolated word N-best speech recognition result, comprising:
-
calculating log likelihoods of a context-dependent phoneme and an anti-phoneme model based on an N-best speech recognition result for an input utterance; measuring a confidence score of an N-best speech-recognized word using the log likelihoods; calculating a distance between phonemes for the N-best speech-recognized word; comparing the confidence score with a threshold and the distance with a mean of distances; and accepting the N-best speech-recognized word when the compared results for the confidence score and the distance correspond to acceptance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An utterance verification apparatus for an isolated word N-best speech recognition result comprising:
-
a pre-processor for extracting a feature vector of an input utterance and performing endpoint detection; an N-best speech recognizer for performing N-best speech recognition through Viterbi search by referring to the context-dependent phoneme model extracted from the feature vector; and an N-best utterance verification unit for calculating log likelihoods of a context-dependent phoneme and an anti-phoneme model for the N-best speech-recognized word, comparing a confidence score measured for the N-best speech-recognized word with a threshold, comparing a distance measured for the N-best speech-recognized-word with a mean of distances, and accepting the N-best speech-recognized word when the compared results for the confidence score and the distance correspond to acceptances. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification