Multipulse processing with freedom given to multipulse positions of a speech signal
First Claim
1. A multipulse processing method of multipulse encoding an input speech signal on an analyzing side into an encoded speech signal for multipulse synthesis of said encoded speech signal on a synthesizing side into a synthesized speech signal equivalent to said input speech signal, said multipulse processing method comprising on said analyzing side the steps of sampling said input speech signal into a sampled speech signal at a predetermined sampling frequency defining successive analysis frames, linear predictive coding (LPC) analyzing said sampled speech signal of each analysis frame to extract LPC coefficients and to produce original spectrum envelope information of said input speech signal based on said LPC coefficients, multipulse analyzing said LPC coefficients into a sequence of original multipulses having appearance time instants at which said original multipulses appear and multipulse amplitudes in correspondence in each analysis frame to features of excitation source information representative of speech information of said input speech signal in combination with said spectrum envelope information, and encoding said sequence of original multipulses and said spectrum envelope information into an encoded sequence of original multipulses and encoded spectrum envelope information for use in combination as said encoded speech signal, wherein said multipulse analyzing step comprises the step of giving a degree of freedom to said appearance time instants relative to sampling instants of said sampled speech signal, so that said appearance time instants are modified without increasing said predetermined sampling frequency, to modify said original multipulses into modified multipulses to make said encoded sequence comprise said modified multipulses in place of said original multipulses.
1 Assignment
0 Petitions
Accused Products
Abstract
In a multipulse processing device for achieving a high encoding efficiency without using a high sampling frequency for an input signal and with a great degree of freedom given to positions of multipulses, an input speech signal is subjected by an LPC analyzer/processor 3 to LPC analysis of each analysis frame for extraction of LPC coefficients after sampled by an A/D converter 2. Multipulses are retrieved as a result of decision by a multipulse analyzer 20 with a degree of freedom given relative to sampling points of a sampled speech signal supplied through an auditorily weighting filter 4. Encoded by an encoder 41 and together with k parameters used as an example of the LPC coefficients, retrieved multipulses are multiplexed by a multiplexer 42 for delivery to a synthesis side. A multipulse waveform synthesizer 45 synthesizes a waveform by using decoded multipulse data and the LPC coefficients.
24 Citations
36 Claims
- 1. A multipulse processing method of multipulse encoding an input speech signal on an analyzing side into an encoded speech signal for multipulse synthesis of said encoded speech signal on a synthesizing side into a synthesized speech signal equivalent to said input speech signal, said multipulse processing method comprising on said analyzing side the steps of sampling said input speech signal into a sampled speech signal at a predetermined sampling frequency defining successive analysis frames, linear predictive coding (LPC) analyzing said sampled speech signal of each analysis frame to extract LPC coefficients and to produce original spectrum envelope information of said input speech signal based on said LPC coefficients, multipulse analyzing said LPC coefficients into a sequence of original multipulses having appearance time instants at which said original multipulses appear and multipulse amplitudes in correspondence in each analysis frame to features of excitation source information representative of speech information of said input speech signal in combination with said spectrum envelope information, and encoding said sequence of original multipulses and said spectrum envelope information into an encoded sequence of original multipulses and encoded spectrum envelope information for use in combination as said encoded speech signal, wherein said multipulse analyzing step comprises the step of giving a degree of freedom to said appearance time instants relative to sampling instants of said sampled speech signal, so that said appearance time instants are modified without increasing said predetermined sampling frequency, to modify said original multipulses into modified multipulses to make said encoded sequence comprise said modified multipulses in place of said original multipulses.
- 11. A multipulse encoding device comprising sampling means for sampling an input speech signal into a sampled speech signal at a predetermined sampling frequency defining successive analysis frames, linear predictive coding (LPC) analyzing means for LPC analyzing said sampled speech signal of each analysis frame to extract LPC coefficients and to produce spectrum envelope information of said input speech signal based on said LPC coefficients, multipulse analyzing means for multipulse analyzing said LPC coefficients into a multipulse sequence of multipulses having appearance time instants at which said original multipulses appear and multipulse amplitudes in correspondence in each analysis frame to features of excitation source information representative of speech information of said input speech signal in combination with said spectrum envelope information, and encoding means for encoding said excitation source information into an encoded sequence to produce said encoded signal and said spectrum envelope information as an encoded speech signal, wherein said multipulse analyzing means comprises freedom giving means for giving a degree of freedom to said appearance time instants relative to sampling time instants of said sampled speech signal, so that said appearance time instants are modified without increasing said predetermined sampling frequency, to make said encoding means use the excitation source information in which the appearance time instants of said multipulses are given said degree of freedom.
-
20. A multipulse decoding device for decoding an encoded speech signal produced by a multipulse encoder as a combination of an encoded sequence of modified multipulses and encoded spectrum envelope information by sampling an original speech signal into a sampled speech signal at predetermined sampling frequency defining successive analysis frames, by linear predictive coding (LPC) analyzing the sampled speech signal of each analysis frame for extraction of LPC coefficients and for production of original spectrum envelope information of said original speech signal based on said LPC coefficients, by multipulse analyzing said LPC coefficients into original multipulses having appearance time instants at which said original multipulses appear and multipulse amplitudes in correspondence in each analysis frame to features of excitation source information representative of speech information of said original speech signal in combination with said original spectrum envelope information, by modifying said original multipulses into modified multipulses of a sequence with said appearance time instants given a degree of freedom which allows said appearance time instants to be modified without increasing said predetermined sampling frequency, and by encoding said modified multipulses into said encoded sequence of modified multipulses and said original spectrum envelope information into said encoded spectrum envelope information, said multipulse decoding device comprising:
-
decoding means for decoding said encoded sequence into a decoded sequence of modified multipulses and said encoded spectrum envelope information into decoded spectrum envelope information; and multipulse waveform synthesizing means for synthesizing said decoded sequence of modified multipulses and said decoded spectrum envelope information into a synthesized speech signal equivalent to said original speech signal. - View Dependent Claims (21, 22, 23)
-
- 24. A multipulse analyzer comprising sampling means for sampling an input speech signal into a sampled speech signal at a predetermined sampling frequency defining successive analysis frames, linear predictive coding (LPC) analyzing means for LPC analyzing said sampled speech signal of each analysis frame to extract LPC coefficients and to produce spectrum envelope information based on said LPC coefficients, and multipulse analyzing means for multipulse analyzing said LPC coefficients into a multipulse sequence of multipulses having appearance time instants at which said original multipulses appear and multipulse amplitude in correspondence in each analysis frame to features of excitation source information representative of speech information of said input speech signal in combination with said spectrum envelope information, wherein said multipulse analyzing means comprises freedom giving means for giving a degree of freedom to said appearance time instants, so that said appearance time instants are modified without increasing said predetermined sampling frequency, to modify said multipulses into modified multipulses relative to sampling instants of said sampled speech signal with the appearance time instants given said degree of freedom and with said multipulse amplitudes as they are.
- 33. A multipulse synthesizer for multipulse synthesizing a sequence of modified multipulses and spectrum envelope information produced by a multipulse analyzer by sampling an original speech signal into a sampled speech signal at a predetermined sampling frequency defining successive analysis frames, by linear predictive coding (LPC) analyzing the sampled speech signal of each analysis frame for extraction of LPC coefficients and for production of said spectrum envelope information based on said LPC coefficients, by multipulse analyzing said LPC coefficients into original multipulses having appearance time instants at which said original multipulses appear and multipulse amplitudes in correspondence in each analysis frame to features of excitation source information representative of speech information of said original speech signal in combination with said spectrum envelope information, and by modifying said original multipulses into the modified multipulses of said sequence with said appearance time instants given a degree of freedom which allows said appearance time instants to be modified without increasing said predetermined sampling frequency, said multipulse synthesizer comprising multipulse waveform synthesizing means for synthesizing said sequence of modified multipulses and said spectrum envelope information into a synthesized speech signal equivalent to said original speech signal.
Specification