Speech recognition system with improved rejection of words and sounds not in the system vocabulary
First Claim
1. A speech recognition apparatus comprising:
- an acoustic processor for measuring the value of at least one feature of each of a sequence of at least two sounds, said acoustic processor measuring the value of the feature of each sound during each of a series of successive time intervals to produce a series of feature signals representing the feature values of the sound;
means for storing a set of acoustic command models, each acoustic command model representing one or more series of acoustic feature values representing an utterance of a command associated with the acoustic command model.a match score processor for generating a match score for each sound and each of one or more acoustic command models from the set of acoustic command models, each match score comprising an estimate of the closeness of a match between the acoustic command model and a series of feature signals corresponding to the sound; and
means for outputting a recognition signal corresponding to the acoustic command model having a best match score for a current sound if the best match score for the current sound is greater than a recognition threshold score for the current sound, the recognition threshold score for the current sound is equal to (a) a first confidence score if the best match score for a prior sound was greater than a recognition threshold for the prior sound, or (b) a second confidence score greater than the first confidence score if the best match score for the prior sound was less than the recognition threshold for the prior sound.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognizer that selects a command model for a current sound if the best match score for the current sound exceeds its corresponding threshold score. The threshold score is assigned a confidence score based on the best match score and recognition threshold of a prior sound. When the best match score for the current sound exceeds a "poor" confidence score but is less than a "good" confidence score: (a) the word corresponding to the acoustic model having the best match score is accepted as highly likely to correspond to the measured sound if the previously recognized word was accepted as having a high likelihood of corresponding to the previous sound; (b) the word corresponding to the acoustic model having the best match score is rejected as highly unlikely to correspond to the measured sound if the previously recognized word was rejected as having a low likelihood of corresponding to the previous sound; or (c) if there is sufficient intervening silence between a previously rejected word and the current word, then the current word is also accepted as having a high likelihood of corresponding to the measured current sound.
101 Citations
20 Claims
-
1. A speech recognition apparatus comprising:
-
an acoustic processor for measuring the value of at least one feature of each of a sequence of at least two sounds, said acoustic processor measuring the value of the feature of each sound during each of a series of successive time intervals to produce a series of feature signals representing the feature values of the sound;
means for storing a set of acoustic command models, each acoustic command model representing one or more series of acoustic feature values representing an utterance of a command associated with the acoustic command model.a match score processor for generating a match score for each sound and each of one or more acoustic command models from the set of acoustic command models, each match score comprising an estimate of the closeness of a match between the acoustic command model and a series of feature signals corresponding to the sound; and means for outputting a recognition signal corresponding to the acoustic command model having a best match score for a current sound if the best match score for the current sound is greater than a recognition threshold score for the current sound, the recognition threshold score for the current sound is equal to (a) a first confidence score if the best match score for a prior sound was greater than a recognition threshold for the prior sound, or (b) a second confidence score greater than the first confidence score if the best match score for the prior sound was less than the recognition threshold for the prior sound. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A speech recognition method comprising the steps of:
-
measuring the value of at least one feature of each of a sequence of at least two sounds, the value of the feature of each sound being measured during each of a series of successive time intervals to produce a series of feature signals representing the feature values of the sound; storing a set of acoustic command models, each acoustic command model representing one or more series of acoustic feature values representing an utterance of a command associated with the acoustic command model; generating a match score for each sound and each of one or more acoustic command models from the set of acoustic command models, each match score comprising an estimate of the closeness of a match between the acoustic command model and a series of feature signals corresponding to the sound; and outputting a recognition signal corresponding to the acoustic command model having a best match score for a current sound if the best match score for the current sound is greater than a recognition threshold score for the current sound, the recognition threshold score for the current sound is equal to a first confidence score if the best match score for a prior sound was greater than a recognition threshold for the prior sound, or (b) a second confidence score greater than the first confidence score if the best match score for the prior sound was less than the recognition threshold for the prior sound. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification