Speech synthesizer employing post-processing for enhancing the quality of the synthesized speech
DCFirst Claim
1. A Linear Predictive Coding (LPC) synthesiser for speech synthesis, comprising:
- an excitation source; and
a LPC decoder comprising post-processing means coupled to an output of said excitation source for operating on a first signal including speech periodicity information derived from said excitation source, wherein the post-processing means modifies the speech periodicity information content of the first signal in accordance with a second signal derivable from said excitation source in order to produce an enhanced synthesised speech signal.
2 Assignments
Litigations
0 Petitions
Accused Products
Abstract
A post-processor 317 and method substantially for enhancing synthesised speech is disclosed. The post-processor 317 operates on a signal ex(n) derived from an excitation generator 211 typically comprising a fixed code book 203 and an adaptive code book 204, the signal ex(n) being formed from the addition of scaled outputs from the fixed code book 203 and adaptive code book 204. The post-processor operates on ex(n) by adding to it a scaled signal pv(n) derived from the adaptive code book 204. A gain or scale factor p is determined by the speech coefficients input to the excitation generator 211. The combined signal ex(n)+pv(n) is normalised by unit 316 and input to an LPC or speech synthesis filter 208, prior to being input to an audio processing unit 209.
68 Citations
46 Claims
-
1. A Linear Predictive Coding (LPC) synthesiser for speech synthesis, comprising:
-
an excitation source; and a LPC decoder comprising post-processing means coupled to an output of said excitation source for operating on a first signal including speech periodicity information derived from said excitation source, wherein the post-processing means modifies the speech periodicity information content of the first signal in accordance with a second signal derivable from said excitation source in order to produce an enhanced synthesised speech signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for use with Linear Predictive Coding (LPC) for enhancing synthesised speech, comprising steps of:
-
deriving a first signal including speech periodicity information from an excitation source, deriving a second signal from the excitation source, and modifying in a LPC decoder the speech periodicity information content of the first signal in accordance with the second signal in order to produce an enhanced synthesised speech signal. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A method for use with Linear Predictive Coding (LPC) for enhancing synthesised speech, comprising steps of:
-
deriving a first signal including speech periodicity information from an excitation source, comprising a fixed code book and an adaptive code book, the first signal comprising a combination of first and second partial excitation signals respectively originating from the fixed and adaptive code books, deriving a second signal from the excitation source, and modifying in a LPC decoder the speech periodicity information content of the first signal in accordance with the second signal in order to produce an enhanced synthesised speech signal, the second signal being substantially the same as the second partial excitation signal and originating from the adaptive code book, the first signal being modified by combining the second signal with the first signal, and a first scaling factor (p) being derivable from an adaptive code book scaling factor (b) in accordance with the following relationship, ##EQU18## where TH represents threshold values, aenh is a linear scaler and f(b) is a function of b.
-
-
31. A method for use with Linear Predictive Coding (LPC) for enhancing synthesised speech, comprising steps of:
-
deriving a first signal including speech periodicity information from an excitation source, comprising a fixed code book and an adaptive code book, the first signal comprising a combination of first and second partial excitation signals respectively originating from the fixed and adaptive code books, deriving a second signal from the excitation source, and modifying in a LPC decoder the speech periodicity information content of the first signal in accordance with the second signal in order to produce an enhanced synthesised speech signal, the second signal being substantially the same as the first partial excitation signal and originating from the fixed code book, the first signal being modified by combining the second signal with the first signal, and a first scaling factor (p) being derivable from an adaptive code book scaling factor (b) in accordance with the following relationship, ##EQU19## where TH represents threshold values, aenh is a linear scaler and f(b) is a function of b.
-
- 32. A Linear Predictive Coding (LPC) synthesiser for speech synthesis, comprising first and second excitation sources for respectively generating first and second excitation signals, and a LPC decoder comprising modifying means for modifying the first excitation signal in accordance with a scaling factor derivable from pitch information associated with the first excitation signal in order to produce an enhanced synthesised speech signal.
- 38. A Linear Predictive Coding (LPC) synthesiser for speech synthesis, comprising first and second excitation sources for respectively generating first and second excitation signals, and a LPC decoder comprising modifying means for modifying the second excitation signal in accordance with a scaling factor derivable from pitch information associated with the first excitation signal in order to produce an enhanced synthesised speech signal.
-
42. A method for use with Linear Predictive Coding (LPC) for speech synthesis, comprising steps of:
-
generating first and second excitation signals, modifying in a LPC decoder the first excitation signal in accordance with a gain factor associated therewith, and further modifying in the LPC decoder the first excitation signal in accordance with a scaling factor derivable from pitch information associated with the first excitation signal in order to produce an enhanced synthesised speech signal.
-
-
43. A method for use with Linear Predictive Coding (LPC) for speech synthesis, comprising steps of:
-
generating first and second excitation signals, modifying in a LPC decoder the first excitation signal in accordance with a gain factor associated therewith, and modifying in the LPC decoder the second excitation signal in accordance with a scaling factor derivable from pitch information associated with the first excitation signal in order to produce an enhanced synthesised speech signal.
-
-
44. A time domain speech synthesiser, comprising:
-
an excitation source providing first and second partial excitation signals having a speech periodicity information content; and a speech quality enhancement post-processor coupled to said excitation source for operating on one of said first and second partial excitation signals, said post-processor modifying the speech periodicity information content of the operated on partial excitation signal in accordance with a signal derivable from at least one of said first and second partial excitation signals.
-
-
45. A synthesiser for speech synthesis, comprising:
-
an input unit for inputting a signal and for extracting coded information from said signal, the coded information comprising fixed codebook and adaptive codebook parameters, including an adaptive codebook gain factor; an excitation source comprising a fixed codebook and an adaptive codebook and having inputs coupled to outputs of said input unit for receiving extracted coded information therefrom, said excitation source being responsive to the received extracted coded information for outputting a first partial excitation signal from said fixed codebook and a second partial excitation signal from said adaptive codebook, said excitation source further comprising means for combining said first and second partial excitation signals into a composite excitation signal; and a perceptual enhancement post-processor coupled to said excitation source for operating on said composite excitation signal by combining said composite excitation signal with a scaled version of said second partial excitation signal, wherein an amount of scaling of said second partial excitation signal is controlled by a scaling factor having a value that is function of a value of said adaptive codebook gain factor. - View Dependent Claims (46)
-
Specification