Speech modeling and enhancement based on magnitude-normalized spectra
First Claim
Patent Images
1. A method comprising:
- converting a frame of a speech signal into the spectral domain to identify a plurality of frequency components;
determining an energy value for the frame;
dividing the plurality of frequency components of the speech signal by the energy value for the frame to form energy-normalized frequency components; and
constructing a model from the energy-normalized frequency components.
2 Assignments
0 Petitions
Accused Products
Abstract
A frame of a speech signal is converted into the spectral domain to identify a plurality of frequency components and an energy value for the frame is determined. The plurality of frequency components is divided by the energy value for the frame to form energy-normalized frequency components. A model is then constructed from the energy-normalized frequency components and can be used for speech recognition and speech enhancement.
67 Citations
20 Claims
-
1. A method comprising:
-
converting a frame of a speech signal into the spectral domain to identify a plurality of frequency components;
determining an energy value for the frame;
dividing the plurality of frequency components of the speech signal by the energy value for the frame to form energy-normalized frequency components; and
constructing a model from the energy-normalized frequency components. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable medium having computer-executable instructions for performing steps comprising:
-
receiving values representing a noisy speech signal; and
using a model of energy-normalized clean-speech spectral values to estimate a noise-reduced value from the noisy speech signal. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
receiving an air conduction microphone signal;
receiving an alternative sensor signal;
using the air conduction microphone signal, the alternative sensor signal, and a model of energy-normalized clean speech spectral values to estimate a noise-reduced speech value. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification