Speech coding method, device, coding module, system and software program product for pre-processing the phase structure of a to be encoded speech signal to match the phase structure of the decoded signal
First Claim
1. A method for use in speech coding, said method comprising:
- pre-processing a to be encoded speech based signal on a frame-by-frame basis such that a phase structure of said to be encoded speech based signal is approached to a phase structure which would be obtained if said to be encoded speech based signal was encoded and decoded; and
applying an encoding to said pre-processed to be encoded speech based signal;
wherein pre-processing said to be encoded speech based signal comprises for a respective frame of said to be encoded speech signal;
estimating a pitch for said frame;
determining a synthetic phase contour over said frame based on said pitch estimate and a pitch estimate for a preceding frame;
locating at least one pitch pulse position in said determined synthetic phase contour;
locating at least one pitch pulse position in said frame of said to be encoded speech based signal; and
modifying said to be encoded speech based signal in said frame such that the at least one pitch pulse position is shifted to the at least one pitch pulse position of said synthetic phase contour.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to a method for use in parametric speech coding. In order to enable an improved parametric coding of speech signals, the method comprises a first step of pre-processing a to be encoded speech based signal such that a phase structure of the to be encoded speech based signal is approached to a phase structure which is obtained when the to be encoded speech based signal is parametrically encoded and decoded again. Only in a second step, a parametric encoding is applied to this pre-processed to be encoded speech based signal. The invention relates equally to a corresponding device, to a corresponding coding module, to a corresponding system and to a corresponding software program product.
-
Citations
22 Claims
-
1. A method for use in speech coding, said method comprising:
-
pre-processing a to be encoded speech based signal on a frame-by-frame basis such that a phase structure of said to be encoded speech based signal is approached to a phase structure which would be obtained if said to be encoded speech based signal was encoded and decoded; and applying an encoding to said pre-processed to be encoded speech based signal; wherein pre-processing said to be encoded speech based signal comprises for a respective frame of said to be encoded speech signal; estimating a pitch for said frame; determining a synthetic phase contour over said frame based on said pitch estimate and a pitch estimate for a preceding frame; locating at least one pitch pulse position in said determined synthetic phase contour; locating at least one pitch pulse position in said frame of said to be encoded speech based signal; and modifying said to be encoded speech based signal in said frame such that the at least one pitch pulse position is shifted to the at least one pitch pulse position of said synthetic phase contour. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A device for performing a speech coding, said device comprising:
-
a pre-processing portion adapted to pre-process a to be encoded speech based signal on a frame-by-frame basis such that a phase structure of said to be encoded speech based signal is approached to a phase structure which would be obtained if said to be encoded speech based signal was encoded and decoded; and a coding portion which is adapted to apply an encoding to a to be encoded speech based signal; wherein said pre-processing by said pre-processing portion comprises for a respective frame of a to be encoded speech signal; estimating a pitch for said frame; determining a synthetic phase contour over said frame based on said pitch estimate and a pitch estimate for a preceding frame; locating at least one pitch pulse position in said determined synthetic phase contour; locating at least one pitch pulse position in said frame of said to be encoded speech based signal; and modifying said to be encoded speech based signal in said frame such that the at least one pitch pulse position is shifted to the at least one pitch pulse position of said synthetic phase contour. - View Dependent Claims (10, 11, 12)
-
-
13. A coding module for performing a speech coding, said coding module comprising:
-
a pre-processing portion adapted to pre-process a to be encoded speech based signal on a frame-by-frame basis such that a phase structure of said to be encoded speech based signal is approached to a phase structure which would be obtained if said to be encoded speech based signal was encoded and decoded; and a coding portion which is adapted to apply an encoding to a to be encoded speech based signal; wherein said pre-processing by said pre-processing portion comprises for a respective frame of a to be encoded speech signal; estimating a pitch for said frame; determining a synthetic phase contour over said frame based on said pitch estimate and a pitch estimate for a preceding frame; locating at least one pitch pulse position in said determined synthetic phase contour; locating at least one pitch pulse position in said frame of said to be encoded speech based signal; and modifying said to be encoded speech based signal in said frame such that the at least one pitch pulse position is shifted to the at least one pitch pulse position of said synthetic phase contour. - View Dependent Claims (14, 15)
-
-
16. A system comprising at least one device for performing a speech coding, said at least one device comprising:
-
a pre-processing portion adapted to pre-process a to be encoded speech based signal on a frame-by-frame basis such that a phase structure of said to be encoded speech based signal is approached to a phase structure which would be obtained if said to be encoded speech based signal was encoded and decoded; and a coding portion which is adapted to apply an encoding to a to be encoded speech based signal; wherein said pre-processing by said pre-processing portion of said at least one device comprises for a respective frame of a to be encoded speech signal; estimating a pitch for said frame; determining a synthetic phase contour over said frame based on said pitch estimate and a pitch estimate for a preceding frame; locating at least one pitch pulse position in said determined synthetic phase contour; locating at least one pitch pulse position in said frame of said to be encoded speech based signal; and modifying said to be encoded speech based signal in said frame such that the at least one pitch pulse position is shifted to the at least one pitch pulse position of said synthetic phase contour. - View Dependent Claims (17, 18, 19)
-
-
20. A coding module in which a software code for use in speech coding is stored, said software code realizing the following steps when running in a processing unit:
-
pre-processing a to be encoded speech based signal on a frame-by-frame basis such that a phase structure of said to be encoded speech based signal is approached to a phase structure which would be obtained if said to be encoded speech based signal was encoded and decoded; and applying an encoding to said pre-processed to be encoded speech based signal; wherein pre-processing said to be encoded speech based signal comprises for a respective frame of said to be encoded speech signal; estimating a pitch for said frame; determining a synthetic phase contour over said frame based on said pitch estimate and a pitch estimate for a preceding frame; locating at least one pitch pulse position in said determined synthetic phase contour; locating at least one pitch pulse position in said frame of said to be encoded speech based signal; and modifying said to be encoded speech based signal in said frame such that the at least one pitch pulse position is shifted to the at least one pitch pulse position of said synthetic phase contour. - View Dependent Claims (21, 22)
-
Specification