Method and system for low bit rate voice encoding and decoding applicable for any reduced bandwidth requirements including wireless
First Claim
1. A system for encoding and decoding a voice, comprising:
- a vocoder transmitter; and
a vocoder receiver;
wherein the transmitter further comprises;
an automatic gain control (AGC) module;
a first formant filter;
an excitation module operable to implement an excitation analysis;
a spectrum analyzer module adapted to provide a short term frequency spectrum;
an analog to digital converter coupled to the output of the spectrum analyzer module;
a synchronous data channel;
an asynchronous data channel;
a multiplexer operable to combine the outputs from the excitation module and the spectrum analyzer module into a single data stream that is clocked by at least one of;
the synchronous data channel or the asynchronous data channel.
3 Assignments
0 Petitions
Accused Products
Abstract
An implementation of the present invention comprises a voice encoder and decoder method and system that uses voice excitation, eliminating the voice/unvoiced pitch tracking, and the first formant up to 2400 Hertz for synchronous and up to 1600 Hertz for asynchronous, does not use pulse code modulation encoding, but uses the zero crossings only of the first formant, frequency dividing by two and sampling at the formant frequency. The resulting combination uses half or less of the bit rate for excitation and the remainder for short-term spectrum analysis. The spectrum could be updated each 20 milliseconds using 49 bits for the spectrum frame and 49 bits for excitation and one frame bit for synchronous Asynchronous operation could be update at 21.25 milliseconds using 49 bits for the spectrum information and 34 bits for excitation with one bit for frame synchronization. The decoder extracts the excitation, multiplies it by two and uses a Hanning modified sawtooth and spectral flattening to excite the spectrum generator. This waveform produces both even and odd harmonics for both periodic (voiced) and aperiodic (unvoiced) frequencies and gives naturalness to all languages and speakers.
17 Citations
20 Claims
-
1. A system for encoding and decoding a voice, comprising:
-
a vocoder transmitter; and a vocoder receiver; wherein the transmitter further comprises; an automatic gain control (AGC) module; a first formant filter; an excitation module operable to implement an excitation analysis; a spectrum analyzer module adapted to provide a short term frequency spectrum; an analog to digital converter coupled to the output of the spectrum analyzer module; a synchronous data channel; an asynchronous data channel; a multiplexer operable to combine the outputs from the excitation module and the spectrum analyzer module into a single data stream that is clocked by at least one of;
the synchronous data channel or the asynchronous data channel. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for encoding and decoding speech, comprising:
-
an encoder including; a first module adapted to generate and output zero crossings in response to voice excitation in a first formant; a second module for dividing the output by two and sampling at 2400 Hertz for synchronous such that a resulting combination uses half of a bit rate for excitation and a remainder for short term spectrum analysis; and means for updating the spectrum each 20 milliseconds using 49 bits for bits for the spectrum and 49 bits for the excitation with one synchronizing bit per frame; and a decoder including; a first module for extracting the excitation; a second module adapted to multiply the excitation by two; a third module adapted to use a Hanning modified sawtooth and spectral flattening to excite a spectrum generator; a fourth module for outputting a waveform that produces both even and odd harmonics for both periodic and aperiodic frequencies.
-
-
20. A system for encoding and decoding speech, comprising:
-
an encoder including; a first module adapted to generate and output zero crossings in response to voice excitation in a first formant; a second module for dividing the output by two and sampling at 1600 Hertz for asynchronous such that a resulting combination uses the 1600 Hertz for excitation and the remainder for short term spectrum analysis; means for updating the spectrum each 21.25 milliseconds using 49 bits for the spectrum and 34 bits and one bit for synchronization giving 84 bits per frame; and a decoder including; a first module for extracting the excitation; a second module adapted to multiply the excitation by two; a third module adapted to use a Hanning modified sawtooth and spectral flattening to excite the spectrum generator; and a fourth module for outputting a waveform that produces both even and odd harmonics for both periodic and aperiodic frequencies.
-
Specification