Method and system of estimating clean speech parameters from noisy speech parameters
First Claim
Patent Images
1. A method of estimating clean speech parameters from noisy speech parameters, said method comprising processor implemented steps of:
- acquiring speech signals using a speech acquisition module (202);
estimating noise from said acquired speech signals using a noise estimation module (204), wherein estimation of noise is performed through non-speech frames of the acquired speech signals;
computing noisy speech features from said estimated noise using a feature extraction module (206), wherein Mel-Frequency Cepstral Coefficients are used as said noisy speech features;
estimating noisy model parameters from said computed noisy speech features using a parameter estimation module (208); and
estimating clean parameters from said estimated noise and said estimated noisy model parameters using a clean parameter estimation module (210), wherein the estimation is performed using Reverse Psychoacoustic Compensation (RPC),wherein the RPC is performed iteratively by;
initializing a clean model mean to a certain value of a model mean;
performing Psychoacoustic Compensation on the initialized clean model mean to obtain a compensated clean model mean;
determining if value of the compensated clean model mean is within a certain range of a model mean value;
terminating the iteration in response to determining that the value of the compensated clean model mean is within a certain range of the model mean value; and
updating clean parameters of the clean model mean in response to determining that the value of the compensated clean model mean is outside a certain range of the model mean value.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system is provided for estimating clean speech parameters from noisy speech parameters. The method is performed by acquiring speech signals, estimating noise from the acquired speech signals, computing speech features from the acquired speech signals, estimating model parameters from the computed speech features and estimating clean parameters from the estimated noise and the estimated model parameters.
-
Citations
13 Claims
-
1. A method of estimating clean speech parameters from noisy speech parameters, said method comprising processor implemented steps of:
-
acquiring speech signals using a speech acquisition module (202); estimating noise from said acquired speech signals using a noise estimation module (204), wherein estimation of noise is performed through non-speech frames of the acquired speech signals; computing noisy speech features from said estimated noise using a feature extraction module (206), wherein Mel-Frequency Cepstral Coefficients are used as said noisy speech features; estimating noisy model parameters from said computed noisy speech features using a parameter estimation module (208); and estimating clean parameters from said estimated noise and said estimated noisy model parameters using a clean parameter estimation module (210), wherein the estimation is performed using Reverse Psychoacoustic Compensation (RPC), wherein the RPC is performed iteratively by; initializing a clean model mean to a certain value of a model mean; performing Psychoacoustic Compensation on the initialized clean model mean to obtain a compensated clean model mean; determining if value of the compensated clean model mean is within a certain range of a model mean value; terminating the iteration in response to determining that the value of the compensated clean model mean is within a certain range of the model mean value; and updating clean parameters of the clean model mean in response to determining that the value of the compensated clean model mean is outside a certain range of the model mean value. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system of estimating clean speech parameters from noisy speech parameters, said system comprising:
-
a processor; a data bus coupled to said processor; and a computer-usable medium embodying computer code, said computer-usable medium being coupled to said data bus, said computer program code comprising instructions executable by said processor and configured for; acquiring speech signals; estimating noise from said acquired speech signals, wherein estimation of noise can be performed through non-speech frames of the acquired speech signals; computing noisy speech features from said estimated noise, wherein Mel-Frequency Cepstral Coefficients are used as said noisy speech features; estimating noisy model parameters from said computed noisy speech features; estimating clean parameters from said estimated noise and said estimated noisy model parameters, wherein the estimation is performed using reverse psychoacoustic compensation (RPC), wherein the RPC is performed iteratively by; initializing a clean model mean to a certain value of a model mean; performing Psychoacoustic Compensation on the initialized clean model mean to obtain a compensated clean model mean; determining if value of the compensated clean model mean is within a certain range of a model mean value; terminating the iteration in response to determining that the value of the compensated clean model mean is within a certain range of the model mean value; and updating clean parameters of the clean model mean in response to determining that the value of the compensated clean model mean is outside a certain range of the model mean value.
-
-
8. One or more non-transitory machine readable information storage mediums comprising one or more instructions which when executed by one or more hardware processors causes:
-
acquiring of speech signals using a speech acquisition module (202); estimating of noise from said acquired speech signals using a noise estimation module (204), wherein estimation of noise can be performed through non-speech frames of the acquired speech signals; computing of noisy speech features from said acquired speech signals using a feature extraction module (206), wherein Mel-Frequency Cepstral Coefficients are used as said noisy speech features; estimating of noisy model parameters from said computed noisy speech features using a parameter estimation module (208); estimating of clean parameters from said estimated noise and said estimated noisy model parameters using a clean parameter estimation module (210), wherein the estimation is performed using reverse psychoacoustic compensation (RPC), wherein the RPC is performed iteratively by; initializing a clean model mean to a certain value of a model mean; performing Psychoacoustic Compensation on the initialized clean model mean to obtain a compensated clean model mean; determining if value of the compensated clean model mean is within a certain range of a model mean value; terminating the iteration in response to determining that the value of the compensated clean model mean is within a certain range of the model mean value; and updating clean parameters of the clean model mean in response to determining that the value of the compensated clean model mean is outside a certain range of the model mean value. - View Dependent Claims (9, 10, 11, 12, 13)
-
Specification