Method and apparatus of speech recognition and speech control system using the speech recognition method
First Claim
1. A speech recognition method, comprising the steps of:
- registering an acoustic feature of a recognition-desired word desired to be recognized for each of a plurality of recognition-desired words;
registering an acoustic feature of a reception word differing from the recognition-desired words for each of a plurality of recognition-desired words;
receiving an utterance including an uttered word;
calculating a recognition-desired word recognition score indicating a similarity degree between the uttered word and each recognition-desired word by comparing the acoustic feature of the recognition-desired word with an acoustic feature of the uttered word;
calculating a reception word recognition score indicating a similarity degree between the uttered word and each reception word by comparing the acoustic feature of the reception word with the acoustic feature of the uttered word;
recognizing the uttered word as a particular recognition-desired word corresponding to a particular recognition-desired word recognition score in cases where the particular recognition-desired word recognition score is higher than the highest reception word recognition score; and
rejecting the utterance in cases where the highest recognition-desired word recognition score is equal to or lower than the highest reception word recognition score.
3 Assignments
0 Petitions
Accused Products
Abstract
A string of acoustic feature parameters of each of recognition-desired words and a string of acoustic feature parameters of each of reception words are registered in advance. When an uttered word is received, a string of acoustic feature parameters is extracted from the uttered word, the acoustic feature parameters of the uttered word is compared with the string of acoustic feature parameters of each recognition-desired word, and a recognition-desired word recognition score indicating a similarity degree between the uttered word and each recognition-desired word is calculated. Also, a reception word recognition score indicating a similarity degree between the uttered word and each reception word is calculated. In cases where a particular recognition-desired word recognition score corresponding to a particular recognition-desired word is higher than the highest reception word recognition score, the utter word is recognized as the particular recognition-desired word, and an operation of an electric apparatus is controlled according to the particular recognition-desired word. In contrast, in cases where a particular reception word recognition score corresponding to a particular reception word is higher than the highest recognition-desired word recognition score, the utter word is recognized as the particular reception word and is rejected, so that the electric apparatus is not operated.
15 Citations
11 Claims
-
1. A speech recognition method, comprising the steps of:
-
registering an acoustic feature of a recognition-desired word desired to be recognized for each of a plurality of recognition-desired words;
registering an acoustic feature of a reception word differing from the recognition-desired words for each of a plurality of recognition-desired words;
receiving an utterance including an uttered word;
calculating a recognition-desired word recognition score indicating a similarity degree between the uttered word and each recognition-desired word by comparing the acoustic feature of the recognition-desired word with an acoustic feature of the uttered word;
calculating a reception word recognition score indicating a similarity degree between the uttered word and each reception word by comparing the acoustic feature of the reception word with the acoustic feature of the uttered word;
recognizing the uttered word as a particular recognition-desired word corresponding to a particular recognition-desired word recognition score in cases where the particular recognition-desired word recognition score is higher than the highest reception word recognition score; and
rejecting the utterance in cases where the highest recognition-desired word recognition score is equal to or lower than the highest reception word recognition score. - View Dependent Claims (2, 3)
informing a user that the utterance is rejected in cases where the utterance is rejected.
-
-
3. A speech recognition method according to claim 1 in which the step of calculating a recognition-desired word recognition score includes the steps of:
-
analyzing the acoustic feature of the uttered word;
calculating a statistical distance between the acoustic feature of the uttered word and the acoustic feature of each recognition-desired word on a statistical distance scale; and
setting the statistical distance of one recognition-desired word as the recognition-desired word recognition score of the recognition-desired word for each recognition-desired word, and the step of calculating a reception word recognition score includes the steps of; calculating a statistical distance between the string of acoustic feature of the uttered word and the acoustic feature of each reception word on a statistical distance scale; and
setting the statistical distance of one reception word as the reception word recognition score of the reception word for each reception word.
-
-
4. A speech recognition apparatus, comprising:
-
recognition-desired word registering means for registering an acoustic feature of a recognition-desired word desired to be recognized for each of a plurality of recognition-desired words;
reception word registering means for registering an acoustic feature of a reception word differing from the recognition-desired words for each of a plurality of recognition-desired words;
word receiving means for receiving an utterance including an uttered word;
recognition-desired word recognition score calculating means for calculating a recognition-desired word recognition score indicating a similarity degree between the uttered word received by the word receiving means and each recognition-desired word registered by the recognition-desired word registering means by comparing the acoustic feature of the recognition-desired word with an acoustic feature of the uttered word;
reception word recognition score calculating means for calculating a reception word recognition score indicating a similarity degree between the uttered word received by the word receiving means and each reception word registered by the reception word registering means by comparing the acoustic feature of the reception word with the acoustic feature of the uttered word;
word recognizing means for recognizing the uttered word received by the word receiving means as a particular recognition-desired word corresponding to a particular recognition-desired word recognition score calculated by the recognition-desired word recognition score calculating means in cases where the particular recognition-desired word recognition score is higher than the highest reception word recognition score calculated by the reception word recognition score calculating means; and
utterance rejecting means for rejecting the utterance received by the word receiving means in cases where the highest recognition-desired word recognition score calculated by the recognition-desired word recognition score calculating means is equal to or lower than the highest reception word recognition score calculated by the reception word recognition score calculating means. - View Dependent Claims (5, 6)
rejection informing means for informing a user that the utterance received by the word receiving means is rejected in cases where the utterance is rejected by the utterance rejecting means.
-
-
6. A speech recognition apparatus according to claim 4, further comprising:
-
acoustic feature extracting means for extracting the acoustic feature of the uttered word from the uttered word received by the word receiving means, and a statistical distance between the acoustic feature of the uttered word extracted by the acoustic feature extracting means and the acoustic feature of one recognition-desired word registered by the recognition-desired word registering means on a statistical distance scale is set as the recognition-desired word recognition score of the recognition-desired word for each recognition-desired word by the recognition-desired word recognition score calculating means, and a statistical distance between the acoustic feature of the uttered word extracted by the acoustic feature extracting means and the acoustic feature of one reception word registered by the reception word registering means on a statistical distance scale is set as the reception word recognition score of the reception word for each reception word by the reception word recognition score calculating means.
-
-
7. A speech control system, comprising:
-
recognition-desired word registering means for registering an acoustic feature of a recognition-desired word desired to be recognized for each of a plurality of recognition-desired words;
reception word registering means for registering an acoustic feature of a reception word differing from the recognition-desired words for each of a plurality of recognition-desired words;
word receiving means for receiving an utterance including an uttered word;
recognition-desired word recognition score calculating means for calculating a recognition-desired word recognition score indicating a similarity degree between the uttered word received by the word receiving means and each recognition-desired word registered by the recognition-desired word registering means by comparing the acoustic feature of the recognition-desired word with an acoustic feature of the uttered word;
reception word recognition score calculating means for calculating a reception word recognition score indicating a similarity degree between the uttered word received by the word receiving means and each reception word registered by the reception word registering means by comparing the acoustic feature of the reception word with the acoustic feature of the uttered word;
word recognizing means for recognizing the uttered word received by the word receiving means as a particular recognition-desired word corresponding to a particular recognition-desired word recognition score calculated by the recognition-desired word recognition score calculating means in cases where the particular recognition-desired word recognition score is higher than the highest reception word recognition score calculated by the reception word recognition score calculating means;
utterance rejecting means for rejecting the utterance in cases where the highest recognition-desired word recognition score calculated by the recognition-desired word recognition score calculating means is equal to or lower than the highest reception word recognition score calculated by the reception word recognition score calculating means;
operation performing means for performing an operation; and
control means for controlling the operation performing means to perform the operation in cases where the uttered word received by the word receiving means is recognized as the particular recognition-desired word by the word recognizing means. - View Dependent Claims (8, 9, 10, 11)
rejection informing means for informing a user that the utterance received by the word receiving means is rejected in cases where the uttered word is rejected by the utterance rejecting means.
-
Specification