Harmonic speech coding arrangement where a set of parameters for a continuous magnitude spectrum is determined by a speech analyzer and the parameters are used by a synthesizer to determine a spectrum which is used to determine senusoids for synthesis
First Claim
1. In a harmonic speech coding arrangement, a method of processing speech signals, said speech signals comprising frames of speech, said method comprisingdetermining from a present one of said frames a magnitude spectrum having a plurality of spectrum points, the frequency of each of said spectrum points being independent of said speech signals,calculating a set of parameters for a continuous magnitude spectrum that models said determined magnitude spectrum at each of said spectrum points, the number of parameters of said set being less than the number of said spectrum points, said continuous magnitude spectrum comprising a sum of a plurality of functions, one of said functions being a magnitude spectrum for a previous one of said frames,encoding said set of parameters as a set of parameter signals representing said speech signals,communicating said set of parameter signals representing said speech signals for use in speech synthesis, andsynthesizing speech based on said communicated set of parameter signals.
1 Assignment
0 Petitions
Accused Products
Abstract
A harmonic coding arrangement where the magnitude spectrum of the input speech is modeled at the analyzer by a relatively small set of parameters and, significantly, as a continuous rather than only a line magnitude spectrum. The synthesizer, rather than the analyzer, determines the magnitude, frequency, and phase of a large number of sinusoids which are summed to generate synthetic speech. Rather than receiving information explicitly defining the sinusoids from the analyzer, the synthesizer receives the small set of parameters and uses those parameters to determine a spectrum, which, in turn, is used by the synthesizer to determine the sinusoids for synthesis.
149 Citations
38 Claims
-
1. In a harmonic speech coding arrangement, a method of processing speech signals, said speech signals comprising frames of speech, said method comprising
determining from a present one of said frames a magnitude spectrum having a plurality of spectrum points, the frequency of each of said spectrum points being independent of said speech signals, calculating a set of parameters for a continuous magnitude spectrum that models said determined magnitude spectrum at each of said spectrum points, the number of parameters of said set being less than the number of said spectrum points, said continuous magnitude spectrum comprising a sum of a plurality of functions, one of said functions being a magnitude spectrum for a previous one of said frames, encoding said set of parameters as a set of parameter signals representing said speech signals, communicating said set of parameter signals representing said speech signals for use in speech synthesis, and synthesizing speech based on said communicated set of parameter signals.
-
12. In a harmonic speech coding arrangement, a method of processing speech signals comprising
determining from said speech signals a magnitude spectrum having a plurality of spectrum points, the frequency of each of said spectrum points being independent of said speech signals, calculating a set of parameters for a continuous magnitude spectrum that models said determined magnitude spectrum at each of said spectrum points, the number of parameters of said set being less than the number of said spectrum points, encoding said set of parameters as a set of parameter signals representing said speech signals, communicating said set of parameter signals representing said speech signals for use in speech synthesis, and synthesizing speech based on said communicated set of parameter signals; - wherein said calculating comprises
calculating said parameter set to fit said continuous magnitude spectrum to said determined magnitude spectrum in accordance with a minimum mean squared error criterion.
- wherein said calculating comprises
-
13. In a harmonic speech coding arrangement, a method of processing speech signals comprising
determining from said speech signals a magnitude spectrum having a plurality of spectrum points, the frequency of each of said spectrum points being independent of said speech signals, calculating a set of parameters for a continuous magnitude spectrum that models said determined magnitude spectrum at each of said spectrum points, the number of parameters of said set being less than the number of said spectrum points, encoding said set of parameters as a set of parameter signals representing said speech signals, communicating said set of parameter signals representing said speech signals for use in speech synthesis, determining a phase spectrum from said speech signals, calculating a second set of parameters modeling said determined phase spectrum, encoding said second set of parameters as a second set of parameter signals representing said speech signals, communicating said second set of parameter signals representing said speech signals for use in speech synthesis, and synthesizing speech based on said communicated sets of parameter signals.
-
19. In a harmonic speech coding arrangement, a method of processing speech signals comprising
determining from said speech signals a magnitude spectrum having a plurality of spectrum points, the frequency of each of said spectrum points being independent of said speech signals, calculating a set of parameters for a continuous magnitude spectrum that models said determined magnitude spectrum at each of said spectrum points, the number of parameters of said set being less than the number of said spectrum points, encoding said set of parameters as a set of parameter signals representing said speech signals, communicating said set of parameter signals representing said speech signals for use in speech synthesis, determining a plurality of sinusoids from said determined magnitude spectrum, determining a phase spectrum from said speech signals, calculating a second set of parameters modeling said determined phase spectrum at the frequencies of said determined sinusoids, and encoding said second set of parameters as a second set of parameter signals representing said speech signals, communicating said second set of parameter signals representing said speech signals for use in speech synthesis, and synthesizing speech based on said communicated sets of parameter signals.
-
20. In a harmonic speech coding arrangement, a method of synthesizing speech comprising
receiving a set of parameters corresponding to input speech comprising frames of input speech, determining a spectrum from said parameter set, said spectrum having amplitude values for a range of frequencies, said determining a spectrum comprising determining an estimated magnitude spectrum for a present one of said frames as a sum of a plurality of functions, one of said functions being an estimated magnitude spectrum for a previous one of said frames, said method further comprising determining a plurality of sinusoids from said spectrum, the sinusoidal frequency of at least one of said sinusoids being determined based on amplitude values of said spectrum, and synthesizing speech as a sum of said sinusoids.
-
29. In a harmonic speech coding arrangement, a method of synthesizing speech comprising
receiving a set of parameters, determining a spectrum having amplitude values for a range of frequencies from said parameter set by estimating a magnitude spectrum as a sum of a plurality of functions, wherein one of said functions is a vector from a codebook, said vector being identified by an index defined by said received parameter set, determining a plurality of sinusoids from said spectrum, the sinusoidal frequency of at least one of said sinusoids being determined based on amplitude values of said spectrum, and synthesizing speech as a sum of said sinusoids.
-
30. In a harmonic speech coding arrangement, a method of synthesizing speech comprising
receiving a set of parameters, determining a spectrum from said parameter set, said spectrum having amplitude values for a range of frequencies, determining a plurality of sinusoids from said spectrum, the sinusoidal frequency of at least one of said sinusoids being determined based on amplitude values of said spectrum, and synthesizing speech as a sum of said sinusoids; -
wherein said determining a spectrum comprises determining an estimated phase spectrum as a sum of a plurality of functions. - View Dependent Claims (31)
-
-
32. In a harmonic speech coding arrangement, a method of synthesizing speech comprising
receiving a set of parameters, determining a spectrum from said parameter set, said spectrum having amplitude values for a range of frequencies, determining a plurality of sinusoids from said spectrum, the sinusoidal frequency of at least one of said sinusoids being determined based on amplitude values of said spectrum, and synthesizing speech as a sum of said sinusoids; -
wherein said determining a spectrum comprises determining an estimated phase spectrum using a pole-zero model and said received parameter set.
-
-
33. In a harmonic speech coding arrangement, a method of synthesizing speech comprising
receiving a set of parameters, determining a spectrum from said parameter set, said spectrum having amplitude values for a range of frequencies, determining a plurality of sinusoids from said spectrum, the sinusoidal frequency of at least one of said sinusoids being determined based on amplitude values of said spectrum, and synthesizing speech as a sum of said sinusiods; -
wherein said determining a spectrum comprises determining an estimated magnitude spectrum, wherein said determining a plurality of sinusoids comprises finding a peak in said estimated magnitude spectrum, subtracting from said estimated magnitude spectrum a spectral component for a sinusoid with the frequency and amplitude of said peak, and repeating said finding and said subtracting until the estimated magnitude spectrum is below a threshold for all frequencies. - View Dependent Claims (34)
-
-
35. In a harmonic speech coding arrangement, a method of synthesizing speech comprising
receiving a set of parameters, determining a spectrum from said parameter set, said spectrum having amplitude values for a range of frequencies, determining a plurality of sinusoids from said spectrum, the sinusoidal frequency of at least one of said sinusoids being determined based on amplitude values of said spectrum, and synthesizing speech as a sum of said sinusoids; -
wherein said determining a spectrum comprises determining an estimated magnitude spectrum, and determining an estimated phase spectrum, wherein said determining a plurality of sinusoids comprises determining sinusoidal amplitude and frequency for each of said sinusoids based on said estimated magnitude spectrum, and determining sinusoidal phase for each of said sinusoids based on said estimated phase spectrum.
-
-
36. In a harmonic speech coding arrangement, a method of processing speech, said speech comprising frames of speech, said method comprising
determining from said speech a magnitude spectrum having a plurality of spectrum points, the frequency of each of said spectrum points being independent of said speech, said magnitude of spectrum having a plurality of points being determined from a present one of said frames, calculating a set of parameters for a continuous magnitude spectrum that models said determined magnitude spectrum at each of said spectrum points, the number of parameters of said set being less than the number of said spectrum points, said continuous magnitude spectrum comprising a sum of a plurality of functions, one of said functions being a magnitude spectrum for a previous one of said frames, communicating said parameter set, receiving said communicated parameter set, determining a spectrum from said received parameter set, determining a plurality of sinusoids from said spectrum determined from said received parameter set, and synthesizing speech as a sum of said sinusoids.
-
37. In a harmonic speech coding arrangement, apparatus comprising
means responsive to speech signals for determining a magnitude spectrum having a plurality of spectrum points, said speech signals comprising frames of speech, said determining means determining said magnitude spectrum having a plurality of spectrum points from a present one of said frames, means responsive to said determining means for calculating a set of parameters for a continuous magnitude spectrum that models said determined magnitude spectrum at each of said spectrum points, the number of parameters of said set being less than the number of said spectrum points, said continuous magnitude spectrum comprising a sum of a plurality of functions, one of said functions being a magnitude spectrum for a previous one of said frames, means for encoding said set of parameters as a set of parameter signals representing said speech signals, means for communicating said set of parameter signals representing said speech signals for use in speech synthesis, and means for synthesizing speech based on said set of parameter signals communicated by said communicating means.
-
38. In a harmonic speech coding arrangement, a speech synthesizer comprising
means responsive to receipt of a set of parameters corresponding to input speech comprising frames of input speech for determining a spectrum, said spectrum having amplitude values for a range of frequencies, said determining means including means for developing an estimated magnitude spectrum for a present one of said frames as a sum of a plurality of functions, one of said functions being an estimated magnitude spectrum for a previous one of said frames, means for determining a plurality of sinusoids from said spectrum, the sinusoidal frequency of at least one said sinusoids being determined based on amplitude values of said spectrum, and means for synthesizing speech as a sum of said sinusoids.
Specification