Speech Recognition Device and Speech Recognition Method
First Claim
1. A voice recognition device for executing noise adaptation processing based on a noise model on an input voice signal to carry out voice recognition on the input voice signal is characterized by comprising:
- first storage means for calculating a first parameter representative of plural noise models contained in each of plural noise environmental categories in advance and storing the first parameter every noise environmental category;
second storage means for calculating a second parameter representing relative positional information between each of the plural noise models and the first parameter in advance and storing the second parameter;
estimating means for estimating, on the basis of the characteristic of an environmental noise superposed by the input voice signal, a noise environmental category to which the environmental noise concerned belongs;
selecting means for selecting and extracting the first parameter corresponding to a noise environmental category estimated by the estimating means from the first storage means; and
noise adaptation means for restoring a noise model adaptive to the environmental noise by using the first parameter extracted by the selecting means and the second parameter read out from the second storage means and executing noise adaptation processing on the input voice signal by using the noise model thus restored.
1 Assignment
0 Petitions
Accused Products
Abstract
There is provided a voice recognition device and a voice recognition method that enhance the function of noise adaptation processing in voice recognition processing and reduce the capacity of a memory being used. Acoustic models are subjected to clustering processing to calculate the centroid of each cluster and the differential vector between the centroid and each model, model composition between each kind of assumed noise model and the calculated centroid is carried out, and the centroid of each composition model and the differential vector are stored in a memory. In the actual recognition processing, the centroid optimal to the environment estimated by the utterance environmental estimation is extracted from the memory, model restoration is carried out on the extracted centroid by using the differential vector stored in the memory, and noise adaptation processing is executed on the basis of the restored model.
-
Citations
9 Claims
-
1. A voice recognition device for executing noise adaptation processing based on a noise model on an input voice signal to carry out voice recognition on the input voice signal is characterized by comprising:
-
first storage means for calculating a first parameter representative of plural noise models contained in each of plural noise environmental categories in advance and storing the first parameter every noise environmental category; second storage means for calculating a second parameter representing relative positional information between each of the plural noise models and the first parameter in advance and storing the second parameter; estimating means for estimating, on the basis of the characteristic of an environmental noise superposed by the input voice signal, a noise environmental category to which the environmental noise concerned belongs; selecting means for selecting and extracting the first parameter corresponding to a noise environmental category estimated by the estimating means from the first storage means; and noise adaptation means for restoring a noise model adaptive to the environmental noise by using the first parameter extracted by the selecting means and the second parameter read out from the second storage means and executing noise adaptation processing on the input voice signal by using the noise model thus restored. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A voice recognition method for executing noise adaptation processing based on a noise model on an input voice signal to carry out voice recognition on the input voice signal is characterized by comprising:
-
a step of calculating a first parameter representative of plural noise models contained in each of plural noise environmental categories in advance and storing the first parameter into a first memory every noise environmental category; a step of calculating a second parameter representing relative positional information between each of the plural noise models and the first parameter in advance and storing the second parameter into a second memory; a step of estimating, on the basis of the characteristic of an environmental noise superposed by the input voice signal, a noise environmental category to which the environmental noise concerned belongs; a step of selecting and extracting the first parameter corresponding to an estimated noise environmental category from the first memory; and a step of restoring a noise model adaptive to the environmental noise by using the selected and extracted first parameter and the second parameter read out from the second memory and executing noise adaptation processing on the input voice signal by using the noise model thus restored. - View Dependent Claims (9)
-
Specification