SPEECH RECOGNITION SYSTEM AND PROGRAM THEREOF
1 Assignment
0 Petitions
Accused Products
Abstract
Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM for each speech frame of the inputted speech by use of the composite HMM.
99 Citations
17 Claims
-
1. (canceled)
-
2. A speech recognition apparatus comprising:
-
a characteristic quantity extraction unit for extracting a characteristic quantity of an inputted speech to be recognized, wherein said apparatus performs speech recognition by matching between a predetermined speech and a phoneme hidden Markov model of speech data previously recorded; a composite model generation unit for generating a composite model by synthesizing the phoneme hidden Markov model of speech data and a hidden Markov model of noise data previously recorded; and a speech recognition unit for recognizing the inputted speech by matching the characteristic quantity being extracted in the characteristic quantity extraction unit from the inputted speech to the composite model generated in the composite model generation unit, wherein the speech recognition unit executes matching between the characteristic quantity of the inputted speech and the composite model for each of adequate segments defined by punctuating a speech sequence in the inputted speech, and wherein the speech recognition unit selects the composite model to be matched to the characteristic quantity of the inputted speech independently of each speech frame thereof and executes matching between the characteristic quantity of the inputted speech and the composite model.
-
-
3. (canceled)
-
4. A speech recognition apparatus comprising:
-
a speech database storing speech data as models for speech recognition; a noise database storing noise data assumed to generate under a predetermined noise environment; a composite model generation unit for generating a composite model by synthesizing a speech model generated based on the speech data read out from the speech database and a noise model generated based on the noise data read out from the noise database; and a speech recognition unit for performing speech recognition by matching between a characteristic quantity of an inputted speech to be recognized and the composite model generated in the composite model generation unit independently of each speech frame of the inputted speech.
-
-
5. (canceled)
-
6. (canceled)
-
7. (canceled)
-
8. (canceled)
-
9. (canceled)
-
10. A computer program product comprising a tangible storage medium readable by a processing circuit and storing computer-readable instructions for execution by the processing circuit for performing a method of speech recognition, the method comprising steps of:
-
extracting a characteristic quantity of an inputted speech to be recognized; generating a composite model including synthesizing a phoneme hidden Markov model of speech data previously recorded and a hidden Markov model of noise data previously recorded; recognizing the inputted speech including matching between the characteristic quantity of the inputted speech and the composite model for each of adequate segments defined by punctuating a speech sequence in the inputted speech; and selecting the composite model to be matched to the characteristic quantity of the inputted speech independently of each speech frame thereof and executes matching between the characteristic quantity of the inputted speech and the composite model.
-
-
11. (canceled)
-
12. (canceled)
-
13. (canceled)
-
14. (canceled)
-
15. (canceled)
-
16. (canceled)
-
17. (canceled)
Specification