METHOD AND APPARATUS FOR SIGNAL BANDWIDTH COMPRESSION UTILIZING THE FOURIER TRANSFORM OF THE LOGARITHM OF THE FREQUENCY SPECTRUM MAGNITUDE
First Claim
1. A bandwidth compression system including an analysis section comprising:
- means for generating electrical signals representing the Fourier transform of the logarithm of the magnitudes of the spectrum of an input signal, said input signal having excitation and impulse response information included therein;
first detection means coupled to said means for generating electrical signals and being operative to provide from said electrical signals an output signal representing the excitation information of said input signal; and
second detection means coupled to said means for generating electrical signals and being operative to separate out a predetermined portion of said electrical signals, said predetermined portion representing the impulse response information of said input signal.
0 Assignments
0 Petitions
Accused Products
Abstract
A bandwidth compression system such as a digital vocoder including an analysis section employs a transducer to convert an input speech wave into an electrical signal which is then digitized by an analog to digital converter. The digitized signal is directed through a spectrum device where the magnitudes of the frequency spectrum of the input speech wave are obtained. These magnitudes are then directed to a logging circuit to obtain the logarithm of the frequency spectrum magnitudes of the input speech signal. The logged magnitudes of the frequency spectrum are then directed to a computer where the discrete Fourier transform of the logged spectrum magnitudes are obtained to form the Fourier transform of the logarithm of the frequency spectrum magnitude (FTLSM) of the input speech signal. An encoding unit selects and encodes certain ones of the FTLSM coefficients for transmission to a remote terminal for analysis. The encoded signals include pitch data and vocal tract impulse data, both of which are derived from the FTLSM signals. The analysis section of a vocoder terminal employs a decoding device which decodes the received data and separates it into pitch data and vocal tract impulse data. Connected to the decoding device is a computing device for computing the logarithm of the spectrum envelope of the vocal tract impulse response function using the discrete Fourier transform. The logged spectrum is directed through a delogging device to a fast Fourier transform (FET) computer where the Fourier sine transform of the received spectrum signals (the impulse response) are obtained. A convolution unit then convolves the pitch data with the impulse response data to yield the desired synthesized speech signal.
-
Citations
37 Claims
-
1. A bandwidth compression system including an analysis section comprising:
- means for generating electrical signals representing the Fourier transform of the logarithm of the magnitudes of the spectrum of an input signal, said input signal having excitation and impulse response information included therein;
first detection means coupled to said means for generating electrical signals and being operative to provide from said electrical signals an output signal representing the excitation information of said input signal; and
second detection means coupled to said means for generating electrical signals and being operative to separate out a predetermined portion of said electrical signals, said predetermined portion representing the impulse response information of said input signal.
- means for generating electrical signals representing the Fourier transform of the logarithm of the magnitudes of the spectrum of an input signal, said input signal having excitation and impulse response information included therein;
-
2. A processor according to claim 1 including a synthesis section comprising:
- impulse response means coupled to said second detection means and being operative in response to the predetermined portion of said electrical signals to generate an output signal corresponding to the impulse response information;
excitation means coupled to said first detection means and being operative in response to the output signal from said first detection means to generate an excitation carrier signal; and
convolution means having input connections from said impulse response means and from said excitation means and being operative to convolve the output signals from said impulse response means and from said excitation means to thereby synthesize the input signal.
- impulse response means coupled to said second detection means and being operative in response to the predetermined portion of said electrical signals to generate an output signal corresponding to the impulse response information;
-
3. A digital vocoder including an analysis section comprising:
- means for obtaining spectrum magnitude signals of an input speech signal having voicing and vocal tract information;
logging means coupled to said means for obtaining spectrum magnitude signals and being operative to generate output signals representing the logarithm of the spectrum magnitude of the input speech signal;
first Fourier transform means coupled to said logging means and being operative to generate output signals having magnitude and positions and representing the Fourier transform of the logarithm of spectrum magnitudes of the input speech signal;
pitch detection logic means coupled to said Fourier transform means and being operative to extract a pitch signal from the output signal of said first Fourier transform means, said pitch signal having a magnitude representing the voicing information of the input speech signal; and
selecting means coupled to said first Fourier transform means and being operative to select a predetermined number of the output signals of said first Fourier transform means, said predetermined number of output signals representing the vocal tract information of the input speech signal.
- means for obtaining spectrum magnitude signals of an input speech signal having voicing and vocal tract information;
-
4. A digital vocoder according to claim 3 including an encoding means coupled to said selecting means and being operative to quantize at a predetermined rate and scale by a predetermined factor each of the predetermined number of output signals of said Fourier transform means selected by said selecting means.
-
5. A digitaL vododer according to claim 3 including a synthesis section comprising:
- second Fourier transform means being operative in response to the selected output signals of said first Fourier transform means to generate output signals representing the Fourier transform of said selected output signals of said first Fourier transform means;
delogging means coupled to said second Fourier transform means and being operative to generate output signals representing the antilogarithm of the output signals of said second Fourier transform means;
third Fourier transform means coupled to said delogging means and being operative to generate output signals representing the vocal tract information of the input speech signal;
pitch carrier generator coupled to said pitch detection logic means and being operative in response to said pitch signal to generate pitch carrier signals having predetermined rates; and
convolution unit coupled to said third Fourier transform means and to said pitch carrier generator and being operative to combine the output signals of said third Fourier transform means and the pitch carrier signals from said pitch carrier generator to thereby generate the synthesized version of the input speech signal.
- second Fourier transform means being operative in response to the selected output signals of said first Fourier transform means to generate output signals representing the Fourier transform of said selected output signals of said first Fourier transform means;
-
6. A digital vocoder according to claim 3 wherein said means for obtaining the spectrum magnitude signals of an input speech signal includes:
- transducer means being operative to convert said input signal into an electrical input speech signal;
an analog to digital converter connected to said transducer means and being operative to convert said electrical input speech signal into a digital speech signal;
computer means coupled to said analog to digital converter and being operative to generate real and imaginary signals representing the spectrum of the digital speech signal; and
a magnitude computation circuit connected to said computer means and being operative to combine in a predetermined manner said real and imaginary signals to generate the spectrum magnitude signals of said input speech signal.
- transducer means being operative to convert said input signal into an electrical input speech signal;
-
7. A digital vocoder according to claim 6 further including a normalization unit connected between said analog to digital converter means and said computer means and being operative to change the level of the input signals a predetermined factor to maintain the peak value of the digital speech signal to said computer means within a predetermined dynamic range.
-
8. A digital vocoder according to claim 6 further including a weighting function circuit connected between said analog to digital converter means and said computer means and being operative to weight the digital speech signal to obtain a smooth spectral signal from said computer means.
-
9. A digital vocoder according to claim 3 wherein said pitch detection logic means includes:
- selection means having an input connection from said first Fourier transform means and being operative to select the output signal of said first Fourier transform means having the largest magnitude;
first comparator means having an input connection from said selection means and a first and second output connection, said first comparator means being operative to compare the magnitude of the selected output signal of said selection means to a predetermined threshold level and to generate an output signal at said first output connection if the magnitude of said selected output signal exceeds the predetermined threshold level and to generate a predetermined output signal at said second output connection if the magnitude of said selected output signal is less than the predetermined threshold level; and
buffer storage means having a first input connection connected to the common juncture of said selection means and said first comparator means, a second input connection connected to the first output connection of said first comparator means and an output terminal and being operative to store the output signal from said selection means and to shift the stOred signal to the output terminal upon the receipt of a signal from said first comparator, means, whereby an unvoiced speech signal is indicated when said first comparator means has an output signal at said second output connection and a voiced speech signal is indicated when the output signal of said first Fourier transform means is shifted to the output of said buffer storage means.
- selection means having an input connection from said first Fourier transform means and being operative to select the output signal of said first Fourier transform means having the largest magnitude;
-
10. A digital vocoder according to claim 9 further including means for determining voicing information having input connections connected to said means for obtaining spectrum magnitude signals and the first output connection of said first comparator means, a first output connection connected to the second input connection of said buffer storage means and a second output connection and being operative in response to the spectrum magnitude signals to provide an output at said first output connection when said spectrum magnitude signals include a voiced signal and to provide an output signal at said second output connection when said spectrum magnitude signals include an unvoiced signal.
-
11. A digital vocoder according to claim 10 wherein said means for determining voicing information comprises:
- means connected to said means for obtaining spectrum magnitude signals for computing a first output signal representing the low-band energy of the spectrum magnitude signals and a second output signal representing the high-band energy of the spectrum magnitude signals;
means for combining said first output signal representing the low-band energy with said second output signal representing the high-band energy to form a composite signal representing the ratio of said first and second output signals;
second comparator means having an input connection coupled to said means for computing, an output connection, and a predetermined threshold level and being operative to generate an output signal at its input connection when the output signal representing the low-band energy is greater than its predetermined threshold level;
third comparator means having an input connection coupled to said means for combining, an output connection and a predetermined threshold level and being operative to generate an output signal at its output connection when said composite signal representing the ratio of said first and second output signals is greater than its predetermined threshold level; and
fourth comparator means having a first input connection coupled to the output connection of said second comparator means, a second input connection coupled to the output connection of said third comparator means and a first output connection coupled to said buffer storage means and a second output connection and being operative to generate a signal at its first output connection when two predetermined signals are received at its first and second input connections, respectively, and to generate a signal at its second output connection when only one predetermined signal is received at either its first or second input connection.
- means connected to said means for obtaining spectrum magnitude signals for computing a first output signal representing the low-band energy of the spectrum magnitude signals and a second output signal representing the high-band energy of the spectrum magnitude signals;
-
12. A digital vocoder according to claim 7 further including a denormalizing unit coupled to said normalization unit and to said first Fourier transform means and being operative to alter the magnitude of the output signal of said first Fourier transform means in a predetermined manner related to the predetermined factor of said normalization unit.
-
13. A digital vocoder according to claim 12 wherein said denormalizing unit is a computer capable of solving the equation Co C'"'"''"'"'o - 16 Square Root N log2 (GN) where Co is the altered magnitude, C'"'"''"'"'o is the unaltered magnitude, N is the selected predetermined number of output signals from said first Fourier transform means and GN is the predetermined factor of said normalization unit.
-
14. A digital vocoder according to claim 4 wherein said encoding means comprises:
- scaling factor storage means operative to store a predetermined scaling factor for each of the predetermined number of output signals of said first Fourier transform means;
scaling means coupled to said scaling factor storage means and to selecting means and being operative to add each of the predetermined scaling factors to a separate one of the predetermined number of output signals of said first Fourier transform means to eliminate negative values in said predetermined number of output signals;
ratio storage means operative to store a predetermined ratio signal for each of the predetermined number of output signals of said first Fourier transform means; and
multiplier means coupled to said scaling means and said ratio storage means and being operative to multiply each of the scaled output signals of said scaling means by a corresponding ratio signal stored in said ratio storage means to thereby quantize each of the predetermined numbers of output signals of said first Fourier transform means.
- scaling factor storage means operative to store a predetermined scaling factor for each of the predetermined number of output signals of said first Fourier transform means;
-
15. A digital vocoder according to claim 14 further including gating means coupled to said multiplier means and being operable to gate certain ones of said predetermined number of output signals of said first Fourier transform means at a first predetermined rate and to gate the remainder of the output signals of said first Fourier transform means at a second predetermined rate.
-
16. A digital vocoder according to claim 5 wherein said second Fourier transform means is a Fourier transform computer means operable to solve the expression where Vn is the nth frequency sample of the selected output signals of said first Fourier transform means, Ck is the kth sample of the selected output signals of said first Fourier transform means and K and R are the limits of summation.
-
17. A digital vocoder according to claim 5 wherein said pitch carrier generator includes:
- first means responsive to said pitch signal from said pitch detection logic means for generating a first predetermined pitch carrier signal when the magnitude of the pitch signal indicates a voiced signal;
second means responsive to said pitch signal from said pitch detection logic means for generating a second predetermined pitch carrier signal when the magnitude of the pitch signal indicates an unvoiced signal; and
gating means coupled to said first and second means for generating and being operative to gate a first predetermined pitch carrier signal to said convolution means when the magnitude of the pitch signal is less than a predetermined magnitude and to gate a second predetermined pitch carrier signal to said convolution means when the magnitude of the pitch signal is greater than a predetermined magnitude.
- first means responsive to said pitch signal from said pitch detection logic means for generating a first predetermined pitch carrier signal when the magnitude of the pitch signal indicates a voiced signal;
-
18. A digital vocoder according to claim 17 wherein said first means for generating includes:
- third means for generating signals, the magnitudes of which describe a predetermined function;
fourth means for generating signals, the magnitudes of which describe the slope of a line connecting the magnitudes of two successive pitch signals from said pitch detection means; and
first comparator means having input connections coupled to said third and fourth means for generating and an output connection coupled to said gating means and being operative to generate a first predetermined pulse when the signals from said fourth means for generating are equal to or greater than the magnitude of the signal from said third means for generating.
- third means for generating signals, the magnitudes of which describe a predetermined function;
-
19. A digital vocoder according to claim 18 including an inhibiting means responsive to said pitch signal from said pitch detection logic means to inhibit the second predetermined pitch carrier signal of said second means for generating.
-
20. A digital vocoder according to claim 18 wherein said third means for generating signals includes:
- first storage counter means having a first input connection and an output connection and beiNg operative to store a first predetermined signal, to add to said first predetermined signal a second predetermined signal appearing at said first input connection and to supply the resultant signal to said output connection;
slope means for generating a third predetermined signal; and
first summation means having a first input connection coupled to the output connection of said first storage counter means, a second input connection coupled to said slope means and an output connection coupled to said first input connection of said storage counter means and to said gating means of said pitch carrier generators, said first summation means being operative to add the resultant signal of said first storage counter means to the third predetermined signal from said slope means to form said second predetermined signal and to direct said second predetermined signal simultaneously to said gating means of said pitch carrier generator and to said first storage counter means to update said first predetermined signal stored therein.
- first storage counter means having a first input connection and an output connection and beiNg operative to store a first predetermined signal, to add to said first predetermined signal a second predetermined signal appearing at said first input connection and to supply the resultant signal to said output connection;
-
21. A digital vocoder according to claim 18 wherein said fourth means for generating signals includes:
- means for computing a slope signal m wherein m Tp(n-
1) -Tpn/T, where Tp is a first pitch signal received from said pitch detection logic at a first predetermined time, Tp(n-
1) is a second pitch signal received from said pitch detection logic at a second predetermined time and T is the elapsed time between said first and second predetermined times;
second storage counter means having a first input connection and an output connection and being operative to store a first predetermined signal, to add to said first predetermined signal a second predetermined signal appearing at said first input connection and to supply the resultant signal to said output connection; and
second summation means having a first input connection coupled to the output connection of said second storage counter means, a second input connection coupled to said means for computing a slope signal and an output connection coupled to said first input connection of said second storage counter means and to said gating means of said pitch carrier generator, said second summation means being operative to add the resultant signal of said second storage counter means to the slope signal from said means for computing a slope signal to form said second predetermined signal and to direct said second predetermined signal to said gating means of said pitch carrier generator and to said second storage means to update said first predetermined signal stored therein.
- means for computing a slope signal m wherein m Tp(n-
-
22. A digital vocoder according to claim 5 including a weighting circuit having an input connection coupled to said third Fourier transform means and an output connection coupled to said convolution means and being operative to apply weighting function signals to the output signals of said third Fourier transform means to thereby improve the quality of the synthesized version of the input speech signal.
-
23. A digital vocoder according to claim 22 wherein the weighting circuit includes:
- a masking circuit having an input connection coupled to said third Fourier transform means and being operative to select a predetermined number of the output signals of said third Fourier transform means;
weighting function storage means being operative to store a predetermined number of signals corresponding to the predetermined number of output signals selected by said masking circuit; and
multiplier means having input connections coupled to said masking circuit and to said weighting function storage means and an output connection coupled to said convolution means and being operative to multiply each of the predetermined number of output signals selected by said masking circuit by a different one of the predetermined number of signals stored in said weighting function storage means to thereby weIght the vocal tract data being directed to said convolution means.
- a masking circuit having an input connection coupled to said third Fourier transform means and being operative to select a predetermined number of the output signals of said third Fourier transform means;
-
24. A digital vocoder according to claim 5 wherein said convolution unit includes:
- logic means having a first input connection coupled to said pitch carrier generator, a second input connection coupled to said third Fourier transform means, and first, second, third and fourth output connections, said logic means being operative in response to a first predetermined time period to provide a data path from said first and second input connections to said first and second output connections, respectively, and being operative in response to a second predetermined time period to provide a data path from said first and second input connections to said third and fourth output connections respectively;
first storage means having first and second input connections coupled respectively to said first and second output connections of said logic means and a plurality of output connections, said first storage means being operative to store the output signals representing the vocal tract information received from the third Fourier transform means via the data path established by said logic means during said first predetermined time period and to gate from a different one of said plurality of output connections a complete set of vocal tract signals upon the receipt of each signal from said pitch carrier generator during said first predetermined time period;
second storage means having first and second input connections coupled respectively to said third and fourth output connections of said logic means and a plurality of output connections, said second storage means being operative to store the output signals representing the vocal tract information received from the third Fourier transform means via the data path established by said logic means during said second predetermined time period and to gate from a different one of said plurality of output connections a complete set of vocal tract signals upon receipt of each signal from said pitch carrier generator during said second predetermined time period; and
summing means having a plurality of input connections each coupled to one of said plurality of output connections of said first and second storage means and being operative to add the vocal tract signals from said first and second storage means whereby a synthesized version of the input speech signal is obtained.
- logic means having a first input connection coupled to said pitch carrier generator, a second input connection coupled to said third Fourier transform means, and first, second, third and fourth output connections, said logic means being operative in response to a first predetermined time period to provide a data path from said first and second input connections to said first and second output connections, respectively, and being operative in response to a second predetermined time period to provide a data path from said first and second input connections to said third and fourth output connections respectively;
-
25. A vocoder system for synthesizing a first speech signal and analyzing a second speech signal simultaneously, said first and second speech signals including voicing and vocal tract information, said digital vocoder comprising:
- means for generating a pitch carrier signal from said first speech signal;
means for obtaining the frequency spectrum magnitude signals of said first speech signal;
means coupled to said means for obtaining the frequency spectrum magnitudes of said first speech signal for converting the frequency spectrum magnitudes into signals having a first predetermined symmetry;
means for obtaining the frequency spectrum magnitudes of a second speech signal;
means coupled to said means for obtaining the frequency spectrum magnitudes of said second speech signal for generating signals having a second predetermined symmetry and representing the logarithm of the frequency spectrum magnitudes of said second speech signal;
summing means coupled to said means for converting and to said means for generating and being operative to sum said signals having a first predetermined symmetry and said signals having a second predetermined symmetry to form a composite signal;
computing means having an input connection coupled to said summing means and first and second output connections, said computing means being operative to compute a first and second set of signals representing the complex Fourier transform of said composite signal, said first set of signals having said first predetermined symmetry and being directed to said first outPut connection and said second set of signals having said second predetermined symmetry and being directed to said second output connection;
convolution means coupled to said means for generating a pitch carrier signal and to said first output connection of said computing means and being operative to combine in a predetermined manner said pitch carrier signal and said first set of signals having said first predetermined symmetry to thereby generate a synthesized version of said first speech signal;
pitch detection means coupled to said second output connection of said computing means and being operative to extract the voicing information of said second speech signal from said second set of signals having said second predetermined symmetry; and
selection means coupled to said second output connection of said computing means and being operative to select a predetermined number of said set of signals having said second predetermined symmetry, said selected signals representing the vocal tract information of said second speech signal.
- means for generating a pitch carrier signal from said first speech signal;
-
26. A vocoder system for synthesizing a first speech signal and analyzing a second speech signal simultaneously with said first and second speech signals including voicing and vocal tract information, said digital vocoder system comprising:
- means for generating a pitch carrier signal from said first speech signal;
means for obtaining the frequency spectrum magnitudes of said first speech signal;
means coupled to said means for obtaining the frequency spectrum magnitude of said first speech signal for converting the frequency spectrum magnitudes into signals having a first predetermined symmetry;
computing means having first and second input ports and first, second, third and fourth output ports and being operable to compute simultaneously the Fourier transform of a set of first predetermined input signals at said first input ports, said set of first predetermined signals having a composite symmetry of said first and second predetermined symmetries and the Fourier transform of a set of second predetermined input signals at said second input port, said set of second predetermined signals having first and second predetermined symmetries and operable to direct to said first, second, third and fourth output ports respectively a first set of output signals representing the Fourier transform of the portion of the set of first predetermined input signals having the second predetermined symmetry, a second set of output signals representing the Fourier transform of the portion of the set of first predetermined input signals having the first predetermined symmetry, a third set of output signals having the first predetermined symmetry and representing the Fourier transform of the portion of the set of second predetermined input signals at said second input port and a fourth set of output signals representing the Fourier transform of the portion of the set of second predetermined input signals having the second predetermined symmetry;
sampling means having an output connection coupled to said first input port of said computing means and being operable to sample said second speech signal over a first predetermined time interval, said first and second sets of output signals of said computing means representing the spectrum of said sampled second input speech signal;
magnitude means coupled to said first and second output ports of said computing means and being operative to combine in a predetermined manner said first and second sets of output signals of said computing means to generate signals representing the frequency spectrum magnitudes of said second speech signal;
means coupled to said magnitude means for generating output signals having a second predetermined symmetry and representing the logarithm of the frequency spectrum magnitudes of said second speech signal;
summing means having input connections coupled to said means for converting and to said means for generating and an output connection coupled to said second input port of said computing means and being operative to sum said signals having a first predetermined symmetry with said signals having said second predetermined symmetry to form said set of second predetermined input signals, whereby said third set of output signals of said computing means represents the vocal tract information of said first speech signal and said fourth set of output signals of said computing means is the Fourier transform of the logarithm of the spectrum magnitudes representing the voicing and vocal tract data of said second speech input signal;
pitch detection logic means coupled to the fourth output port of said computing means and being operative to extract a pitch signal from the fourth set of output signals of said computing means to thereby represent the voicing information of said second input speech signal;
selecting means coupled to the fourth output port of said computing means and being operative to select a predetermined number of the fourth set of output signals to represent the vocal tract information of said second input speech signal; and
convolution means coupled to said means for generating a pitch carrier signal from said first speech signal and to the third output port of said computing means and being operative to combine in a predetermined manner the pitch carrier signals with the third set of output signals of said computing means to synthesize the first speech signal.
- means for generating a pitch carrier signal from said first speech signal;
-
27. A vocoder system according to claim 26 wherein said means for generating a pitch carrier signal includes:
- first means responsive to said first speech signal for generating a first predetermined pitch carrier signal when the magnitude of the pitch signal indicates a voiced signal;
second means responsive to said pitch signal from said pitch detection logic means for generating a second predetermined pitch carrier signal when the magnitude of the pitch signal indicates an unvoiced signal; and
gating means coupled to said first and second means for generating and being operative to gate a first predetermined pitch carrier signal to said convolution means when the magnitude of the pitch signal is less than a predetermined magnitude and to gate a second predetermined pitch carrier signal to said convolution means when the magnitude of the pitch signal is greater than a predetermined magnitude.
- first means responsive to said first speech signal for generating a first predetermined pitch carrier signal when the magnitude of the pitch signal indicates a voiced signal;
-
28. A vocoder system according to claim 27 wherein said first means for generating includes:
- third means for generating signals, the magnitudes of which describe a predetermined function;
fourth means for generating signals, the magnitudes of which describe the slope of a line connecting the magnitudes of the voiced information of two successive first input signals; and
first comparator means having input connections coupled to said third and fourth means for generating and an output connection coupled to said gating means of said means for generating a pitch carrier signal and being operative to generate a first predetermined pulse when the signals from said fourth means for generating are equal to or greater than the magnitude of the signal from said third means for generating.
- third means for generating signals, the magnitudes of which describe a predetermined function;
-
29. A vocoder system according to claim 28 including an inhibiting means responsive to said voicing information of said first input speech signal to inhibit the second predetermined pitch carrier signal of said second means for generating when the voicing information exceeds a predetermined magnitude.
-
30. A vocoder system according to claim 29 wherein said third means for generating signals includes:
- first storage counter means having a first input connection and an output connection and being operative to store a first predetermined signal, to add to said first predetermined signal a second predetermined signal appearing at said first input connection and to supply the resultant signal to said output connection;
slope means for generating a third predetermined signal; and
FIRST summation means having a first input connection coupled to the output connection of said first storage counter means, a second input connection coupled to said slope means and an output connection coupled to said first input connection of said storage counter means and to said gating means of said means for generating a carrier generator, said first summation means being operative to add the resultant signal of said first storage counter means to the third predetermined signal from said slope means to form said second predetermined signal and to direct said second predetermined signal simultaneously to said gating means of said means for generating a pitch carrier and to said first storage counter means to update said first predetermined signal stored therein.
- first storage counter means having a first input connection and an output connection and being operative to store a first predetermined signal, to add to said first predetermined signal a second predetermined signal appearing at said first input connection and to supply the resultant signal to said output connection;
-
31. A vocoder system according to claim 30 wherein said fourth means for generating signals includes:
- means for computing a slope signal m wherein m Tp(n-1) -Tpn/T, where Tp is a first voicing signal received from said first input speech signal at a first predetermined time, Tp(n-1) is a voicing signal received from said first input speech signal at a second predetermined time and T is the elapsed time between said first and second predetermined times;
second storage counter means having a first input connection and an output connection and being operative to store a first predetermined signal, to add to said first predetermined signal a second predetermined signal appearing at said first input connection and to supply the resultant signal to said output connection; and
second summation means having a first input connection coupled to the output connection of said second storage counter means, a second input connection coupled to said means for computing a slope signal and an output connection coupled to said first input connection of said second storage counter means and to said gating means of said means for generating a pitch carrier signal, said second summation means being operative to add the resultant signal of said second storage counter means to the slope signal from said means for computing a slope signal to form said second predetermined signal and to direct said second predetermined signal to said gating means of said means for generating a pitch carrier signal and to said second storage counter means to update said first predetermined signal stored therein.
- means for computing a slope signal m wherein m Tp(n-1) -Tpn/T, where Tp is a first voicing signal received from said first input speech signal at a first predetermined time, Tp(n-1) is a voicing signal received from said first input speech signal at a second predetermined time and T is the elapsed time between said first and second predetermined times;
-
32. A vocoder system according to claim 25 wherein said means for obtaining the frequency spectrum magnitude of said first speech signal includes:
- Fourier transform computer means operable to solve the expression where Vn is the nth frequency sample of said first speech signal, Ck is the kth sample of said first speech signal and k and R are predetermined limits of summation; and
delogging computer means operative to obtain the antilogarithm of said expression to yield the frequency spectrum magnitude of said first speech signal.
- Fourier transform computer means operable to solve the expression where Vn is the nth frequency sample of said first speech signal, Ck is the kth sample of said first speech signal and k and R are predetermined limits of summation; and
-
33. A vocoder system according to claim 26 wherein:
- said first and second input ports of said computing means are real and imaginary input ports respectively;
said first and second predetermined symmetries of said set of first predetermined input signals are even and odd symmetries respectively;
said set of first predetermined input signals includes 256 samples of said input speech signal;
said first set of output signals at said first output port of said computing means includes 128 samples having even symmetry and representing the Fourier transform of the even portion of the 256 samples at the real input port of said computing means;
said second set of output signals at said second output port of said computing means includes 128 samples representing the Fourier transform of the portion of the 256 input samples at said real input port having odd symmetry, said first and second sets of output signals representing, respectively, the real and imaginary parts of the frequency spectrum of the 256 samples of the second input speech signal at the real input port of said computing means;
said set of second predetermined input signals at said imaginary input port of said computing means includes 256 samples having even and odd symmetry associated therewith, said even symmetry portion representing the logarithm of the spectrum magnitudes of the second input speech signal and said odd symmetry portion representing the frequency spectrum of the first input speech signal;
said third set of output signals at the third output port of said computing means includes 128 samples having odd symmetry and representing the Fourier transform of the odd symmetry portion of 256 samples at said imaginary input port of said computing means, said 128 samples at said third output port of said computing means represents the vocal tract information of said first speech signal; and
said fourth set of output signals at the fourth output port of said computing means includes 128 samples having even symmetry and representing the Fourier transform of the logarithm of the spectrum magnitudes from which the vocal tract and the voicing information of the second input speech signal are derived.
- said first and second input ports of said computing means are real and imaginary input ports respectively;
-
34. A vocoder system according to claim 33 wherein said pitch detection logic means includes:
- selection means having an input connection coupled to said fourth output port of said computing means and being operative to select the sample of said fourth set of output signals having the largest magnitude;
first comparator means having an input connection coupled to said selection means and a first and second output connection, said first comparator means being operative to compare the magnitude of the selected output signal of said selection means to a predetermined threshold level and to generate an output signal at said first output connection if the magnitude of said selected sample exceeds the predetermined threshold level and to generate a predetermined output signal at said second output connection if the magnitude of said selected sample is less than the predetermined threshold level; and
buffer storage means having a first input connection connected to the common juncture of said selection means and said first comparator means, a second input connection connected to the first output connection of said first comparator means and an output terminal and being operative to store the selected sample from said selection means and to shift the stored sample to said output terminal upon receipt of a signal from said first comparator means, whereby an unvoiced second speech signal is indicated when said first comparator means has an output signal at said second output connection and a voiced signal is indicated when the fourth output signal of said computing means is shifted to the output of said buffer storage means.
- selection means having an input connection coupled to said fourth output port of said computing means and being operative to select the sample of said fourth set of output signals having the largest magnitude;
-
35. A vocoder system according to claim 33 wherein said convolution unit includes:
- logic means having a first input connection coupled to said means for generating a pitch carrier signal, a second input connection coupled to the third output port of said computing means and first, second, third and fourth output connections, said logic means being operative in response to a first predetermined time period to provide a data path from said first and second input connections to said first and second output connections, respectively, and being operative in response to a second predetermined time period to provide a data path from said first and second input connections to said third and fourth output connections respectively;
first storage means having first and second input connections coupled respectively to said first and second output connections of said logic means and a plurality of output connections, said first storage means being operAtive to store the output signals representing the vocal tract information received from the third output port of said computing means via the data path established by said logic means during said first predetermined time period and to gate from a different one of said plurality of output connections a complete set of vocal tract signals upon the receipt of each signal from said means for generating a pitch carrier signal during said first predetermined time period;
second storage means having first and second input connections coupled respectively to said third and fourth output connections of said logic means and a plurality of output connections, said second storage means being operative to store the output signals representing the vocal tract information received from the third output port of said computing means via the data path established by said logic means during said second predetermined time period and to gate from a different one of said plurality of output connections a complete set of vocal tract signals upon receipt of each signal from said pitch carrier generator during said second predetermined time period; and
summing means having a plurality of input connections each coupled to one of said plurality of output connections of said first and second storage means and being operative to add the vocal tract signals from said first and second storage means whereby a synthesized version of the first input speech signal is obtained.
- logic means having a first input connection coupled to said means for generating a pitch carrier signal, a second input connection coupled to the third output port of said computing means and first, second, third and fourth output connections, said logic means being operative in response to a first predetermined time period to provide a data path from said first and second input connections to said first and second output connections, respectively, and being operative in response to a second predetermined time period to provide a data path from said first and second input connections to said third and fourth output connections respectively;
-
36. A method of compressing the bandwidth of an input signal having an excitation portion and an impulse response portion comprising the steps of:
- generating a time variant electrical signal representing the Fourier transform of the logarithm of the spectrum magnitude of the input signal;
separating out a first time interval signal of said time variant electrical signal to represent the impulse response portion of the input signal; and
separating out a second time interval signal of said time variant electrical signal to represent the excitation portion of the input signal, said first and second time interval signals of said time variant electrical signal having a reduced bandwidth.
- generating a time variant electrical signal representing the Fourier transform of the logarithm of the spectrum magnitude of the input signal;
-
37. A method of simultaneously synthesizing a first speech signal and analyzing a second speech signal, said first and second speech signals including voicing and vocal tract data, said method comprising the steps of:
- generating a pitch carrier signal from said first speech signal;
generating the frequency spectrum magnitude signals of said first speech signal;
converting the frequency spectrum magnitude signals into signals having a first predetermined symmetry;
generating the frequency spectrum magnitude signals of the second speech signal;
converting the frequency spectrum magnitude signals of the second speech signal into a series of signals having a second predetermined symmetry and representing the logarithm of the frequency spectrum magnitudes of said second speech signal;
combining the signals having the first predetermined symmetry with the series of signals having the second predetermined symmetry to generate a series of composite signals;
generating from said series of composite signals first and second sets of signals representing the complex Fourier transform of the composite signal, said first set of signals having said first predetermined symmetry and said second set of signals having said second predetermined symmetry;
combining the pitch carrier signal from said first speech signal and said first set of signals having said first predetermined symmetry to thereby generate a synthesized version of the first speech signal;
selecting a predetermined number of said second set of signals to represent the vocal tract data of said second input speech signal; and
selecting a predetermined number of the remaining signals of said second set of signals to represent the voicing information of said second input speech signal.
- generating a pitch carrier signal from said first speech signal;
Specification