System and method for the analysis and synthesis of periodic and non-periodic components of speech signals
First Claim
1. A voice coder comprising:
- a microphone for recording a speech signal from a user;
a frame generator configured to parse the speech signal into a plurality of speech frames;
a pitch detector configured to determine a fundamental period of each of the plurality of speech frames;
a Fourier Transform module configured to generate a spectra for each of the plurality of speech frames;
a sub-band generator configured to parse the spectra of each speech frame into a plurality of sub-bands;
a Hilbert Transform module configured to transform each of the plurality of sub-bands into a time-domain envelope signal;
a similarity module configured to generate a plurality of sub-band voicing factors, wherein each sub-band voicing factor indicates a harmonicity of one of the plurality of sub-bands, and each sub-band voicing factor is based on a periodicity of said time-domain envelope signals associated with one of the plurality of sub-bands;
a frame synthesizer configured to generate a plurality of recomposed frames, each recomposed frame being based on;
a) the spectra for one of said plurality of speech frames, andb) the sub-band voicing factors associated with the plurality of sub-bands for one of said plurality of speech frames; and
a waveform generator configured to generate a recomposed speech signal from the plurality of recomposed frames.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice coder configured to resolve periodic and aperiodic components of spectra is disclosed. The method of voice coding includes parsing the speech signal into a plurality of speech frames; for each of the plurality of speech frames: (a) generating the spectra for the speech frame, (b) parsing the spectra of the speech frame into a plurality of sub-bands, (c) transforming each of the plurality of sub-bands into a time-domain envelope signal, and (d) generating a plurality of sub-band voicing factors, wherein each sub-band voicing factor indicates the harmonicity of one of the plurality of sub-bands, and each sub-band voicing factor is based on the periodicity of one of said time-domain envelope signals associated with one of the plurality of sub-bands. The voice coder may regenerate the speech signal by generating a plurality of recomposed frames, each recomposed frame being based on: (a) the spectra for one of said plurality of speech frames, and (b) the sub-band voicing factors associated with the plurality of sub-bands for one of said plurality of speech frames; and then generating a recomposed speech signal from the plurality of recomposed frames.
7 Citations
12 Claims
-
1. A voice coder comprising:
-
a microphone for recording a speech signal from a user; a frame generator configured to parse the speech signal into a plurality of speech frames; a pitch detector configured to determine a fundamental period of each of the plurality of speech frames; a Fourier Transform module configured to generate a spectra for each of the plurality of speech frames; a sub-band generator configured to parse the spectra of each speech frame into a plurality of sub-bands; a Hilbert Transform module configured to transform each of the plurality of sub-bands into a time-domain envelope signal; a similarity module configured to generate a plurality of sub-band voicing factors, wherein each sub-band voicing factor indicates a harmonicity of one of the plurality of sub-bands, and each sub-band voicing factor is based on a periodicity of said time-domain envelope signals associated with one of the plurality of sub-bands; a frame synthesizer configured to generate a plurality of recomposed frames, each recomposed frame being based on; a) the spectra for one of said plurality of speech frames, and b) the sub-band voicing factors associated with the plurality of sub-bands for one of said plurality of speech frames; and a waveform generator configured to generate a recomposed speech signal from the plurality of recomposed frames. - View Dependent Claims (2, 3, 4, 5, 7, 8, 9, 10, 11)
-
-
6. A method of voice coding, the method comprising:
-
recording a speech signal from a user with a microphone; parsing the speech signal into a plurality of speech frames; determining a fundamental period of each of the plurality of speech frames, generating a spectra for each of the plurality of speech frames; parsing the spectra of each speech frame into a plurality of sub-bands, transforming each of the plurality of sub-bands into a time-domain envelope signal; generating a plurality of sub-band voicing factors, wherein each sub-band voicing factor indicates a harmonicity of one of the plurality of sub-bands, and each sub-band voicing factor is based on a periodicity of said time-domain envelope signals associated with one of the plurality of sub-bands; generating a plurality of recomposed frames, each recomposed frame being based on; a) the spectra for one of said plurality of speech frames, and b) the sub-band voicing factors associated with the plurality of sub-bands for one of said plurality of speech frames, and generating a recomposed speech signal from the plurality of recomposed frames.
-
-
12. A method of filtering a speech signal with a voice coder, the method comprising:
-
recording a speech signal from a user with a microphone; parsing the speech signal into a plurality of speech frames; determining a fundamental period of each of the plurality of speech frames; for each of the plurality of speech frames; a) generating a spectra for the speech frame; b) parsing the spectra of the speech frame into a plurality of sub-bands, c) transforming each of the plurality of sub-bands into a time-domain envelope signal; and d) generating a plurality of sub-band voicing factors, wherein each sub-band voicing factor indicates a harmonicity of one of the plurality of sub-bands, and each sub-band voicing factor is based on a periodicity of said time-domain envelope signals associated with one of the plurality of sub-bands; generating a plurality of recomposed frames, each recomposed frame being based on; a) the spectra for one of said plurality of speech frames; and b) the plurality of sub-band voicing factors associated with the plurality of sub-bands for one of said plurality of speech frames; and generating a recomposed speech signal from the plurality of recomposed frames.
-
Specification