Method for encoding speech wherein pitch periods are changed based upon input speech signal
First Claim
1. A method for encoding speech comprising the steps of:
- obtaining first pitch periods of an input speech signal;
changing the pitch periods according to condition of the input speech signal to obtain second pitch periods;
determining encoding sections corresponding to said second pitch periods, respectively;
generating an excitation signal by which distortion of a synthesized speech signal is minimized for each of said encoding sections, the synthesized speech signal being generated by subjecting the excitation signal to synthesis filtering; and
outputting at least information representing said changed pitch periods and information on said synthesized speech signal as encoded data.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for encoding speech wherein an input speech signal is separated by a component separator into a first component mainly constituted by speech and a second component mainly constituted by a background noise at each predetermined unit of time, a bit allocation selector selects bit allocation for each component based on the first and second components from among a plurality of predetermined candidates for bit allocation, a speech encoder and a noise encoder encode the first and second components from the component separator based on the bit allocation according to predetermined different methods for encoding, and a multiplexer multiplexes encoded data of the first and second components and information on the bit allocation and outputs them as transmitted encoded data.
53 Citations
10 Claims
-
1. A method for encoding speech comprising the steps of:
-
obtaining first pitch periods of an input speech signal;
changing the pitch periods according to condition of the input speech signal to obtain second pitch periods;
determining encoding sections corresponding to said second pitch periods, respectively;
generating an excitation signal by which distortion of a synthesized speech signal is minimized for each of said encoding sections, the synthesized speech signal being generated by subjecting the excitation signal to synthesis filtering; and
outputting at least information representing said changed pitch periods and information on said synthesized speech signal as encoded data. - View Dependent Claims (2)
the steps of determining encoding sections determines encoding sections based on said concatenated pitch periods as well as said changed pitch periods, said step of outputting encoded data comprising the step of outputting information representing said local concatenated pitch periods as well as the information representing said changed pitch periods and the information on said synthesized speech signal as encoded data.
-
-
3. A method for encoding speech comprising the steps of:
-
obtaining synthesis filter characteristic information representing the transfer characteristics of a synthesis filter which receives an excitation signal and generates a synthesized speech signal;
obtaining first pitch periods of an input speech signal;
changing the pitch periods according to condition of the input speech signal to obtain second pitch periods;
determining encoding sections corresponding to said second pitch periods, respectively;
generating said excitation signal by which distortion of said synthesized speech signal is minimized for each of said encoding sections; and
outputting at least said synthesis filter characteristic information, information representing said second pitch periods and information representing said excitation signal as encoded data. - View Dependent Claims (4)
the steps of determining encoding sections determines encoding sections based on said concatenated pitch periods as well as said changed pitch periods, said step of outputting encoded data comprising the step of outputting information representing said concatenated pitch periods as well as said synthesis filter characteristic information, information representing said second pitch periods and information representing said excitation signal as encoded data.
-
-
5. A method for encoding speech comprises:
-
setting a plurality of pitch marks in each frame of an input speech signal, each of the pitch marks indicating a position in the frame at which a pitch wave form is to be put;
obtaining a plurality of pitch periods corresponding pitch marks, respectively, the pitch periods being changed according to condition of the input speech signal;
generating an excitation signal by which distortion of a synthesized speech signal is minimized, for each of said pitch periods, the synthesized speech signal being generated by subjecting the excitation signal to synthesis filtering; and
outputting at least information representing said pitch periods and information on said synthesized speech signal as encoded data. - View Dependent Claims (6, 7, 8)
-
-
9. A method for encoding speech comprising the steps of:
-
obtaining local pitch periods representing time lengths of one-pitch waveforms of an input speech signal from said input speech signal;
determining encoding sections based on said local pitch periods;
generating a synthesized speech signal for which distortion from said input speech signal is minimized in each of said encoding sections;
outputting at least information representing said local pitch periods and information on said synthesized speech signal as encoded data; and
concatenating said local pitch periods which are at least partially adjacent to each other to obtain local concatenated pitch periods, said step of generating a synthesized speech signal comprising the steps of determining encoding sections based on said local pitch periods and said local concatenated pitch periods and generating a synthesized speech signal for which distortion from said input speech signal is minimized in each of said encoding sections, said step of outputting encoded data comprising the step of outputting at least information representing said local pitch periods, information representing said local concatenated pitch periods and information on said synthesized speech signal as encoded data.
-
-
10. A method for encoding speech comprising the steps of:
-
obtaining synthesis filter characteristic information representing the transfer characteristics of a synthesis filter which receives the input of an excitation signal and generates a synthesized speech signal and obtaining local pitch periods representing time lengths of one-pitch waveforms of an input speech signal from said input speech signal;
determining encoding sections based on said local pitch periods;
generating said excitation signal for which distortion of said synthesized speech signal is minimized in each of said encoding sections;
outputting at least said synthesis filter characteristic information, information representing said local pitch periods and information representing said excitation signal as encoded data;
concatenating said local pitch periods which are at least partially adjacent to each other to obtain local concatenated pitch periods, said step of generating an excitation signal comprising the steps of determining encoding sections based on said local pitch periods and said local concatenated pitch periods and generating said excitation signal for which distortion of said synthesized speech signal is minimized in each of said encoding sections, said step of outputting encoded data comprising the step of outputting at least said synthesis filter characteristic information, information representing said local pitch periods, information representing said local concatenated pitch periods and information representing said excitation signal as encoded data.
-
Specification