APPARATUS FOR SPEECH RECOGNITION USING MULTIPLE ACOUSTIC MODEL AND METHOD THEREOF
First Claim
1. An apparatus for recognizing voice using multiple acoustic models, the apparatus comprising:
- a voice data database (DB) configured to store voice data collected in various noise environments;
a model generating means configured to perform classification for each speaker and environment based on the collected voice data, and to generate an acoustic model of a binary tree structure as the classification result; and
a voice recognizing means configured to extract feature data of voice data when the voice data is received from a user, to select multiple models from the generated acoustic model based on the extracted feature data, to parallel recognize the voice data based on the selected multiple models, and to output a word string corresponding to the voice data as the recognition result.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are an apparatus for recognizing voice using multiple acoustic models according to the present invention and a method thereof. An apparatus for recognizing voice using multiple acoustic models includes a voice data database (DB) configured to store voice data collected in various noise environments; a model generating means configured to perform classification for each speaker and environment based on the collected voice data, and to generate an acoustic model of a binary tree structure as the classification result; and a voice recognizing means configured to extract feature data of voice data when the voice data is received from a user, to select multiple models from the generated acoustic model based on the extracted feature data, to parallel recognize the voice data based on the selected multiple models, and to output a word string corresponding to the voice data as the recognition result.
-
Citations
14 Claims
-
1. An apparatus for recognizing voice using multiple acoustic models, the apparatus comprising:
-
a voice data database (DB) configured to store voice data collected in various noise environments; a model generating means configured to perform classification for each speaker and environment based on the collected voice data, and to generate an acoustic model of a binary tree structure as the classification result; and a voice recognizing means configured to extract feature data of voice data when the voice data is received from a user, to select multiple models from the generated acoustic model based on the extracted feature data, to parallel recognize the voice data based on the selected multiple models, and to output a word string corresponding to the voice data as the recognition result. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of recognizing voice using multiple acoustic models, the method comprising:
-
storing voice data collected in various noise environments in voice data DB; performing classification for each speaker and environment based on the collected voice data, and generating an acoustic model of a binary tree structure as the classification result; and extracting feature data of voice data when the voice data is received from a user, selecting multiple models from the generated acoustic model based on the extracted feature data, parallel recognizing the voice data based on the selected multiple models, and outputting a word string corresponding to the voice data as the recognition result. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification