Code excited linear prediction coder with a short-length codebook for modeling speech having local peak
First Claim
1. A speech coding apparatus, comprising:
- a first code book for storing a plurality of first sound source signals respectively having a first length;
a short-length signal code book for storing a plurality of short-length sound source signals respectively having a second length shorter than the first length;
function detecting means for analyzing a current input speech signal to detect whether or not a function of the first code book is depressed;
selecting means for selecting the first code book to take out a first sound source signal from the first code book in cases where it is detected by the function detecting means that the function of the first code book is not depressed and selecting the short-length signal code book to take out a plurality of short-length sound source signals from the short-length signal code book in cases where it is detected by the function detecting means that the function of the first code book is depressed, a total length of the short-length sound source signal being equal to the first length;
a synthesis filter for generating a synthesized speech signal from the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means; and
controlling means for controlling the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means to reduce a difference between the current input speech signal and the synthesized speech signal generated by the synthesis filter.
1 Assignment
0 Petitions
Accused Products
Abstract
A predicted residual signal is calculated from a current input speech signal and a past input speech signal, and a cross-correlation between the predicted residual signal and the past input speech signal having one speech sub-frame length stored in a first code book is calculated. In cases where the current input speech signal has no local peak, the cross-correlation becomes high, so that a synthesized speech signal is generated from the past input speech signal stored in the first code book or a predetermined sound source signal having one speech sub-frame length stored in the second code book. In contrast, in cases where the current input speech signal has a local peak, the cross-correlation becomes low, so that it is judged that a function of the first code book is depressed. In this case, a synthesized speech signal is generated from a group of short-length sound source signals having a total length equal to one speech sub-frame length stored in a short-length signal code book. Therefore, even though the current input speech signal suddenly has a local peak, because the synthesized speech signal is generated from the short-length sound source signals respectively having a speech length lower than one speech sub-frame length, the local peak can be expressed by the short-length sound source signals, an appropriate exciting sound source signal similar to the current input speech signal can be determined, and the synthesized speech signal can be adequately obtained.
14 Citations
10 Claims
-
1. A speech coding apparatus, comprising:
-
a first code book for storing a plurality of first sound source signals respectively having a first length; a short-length signal code book for storing a plurality of short-length sound source signals respectively having a second length shorter than the first length; function detecting means for analyzing a current input speech signal to detect whether or not a function of the first code book is depressed; selecting means for selecting the first code book to take out a first sound source signal from the first code book in cases where it is detected by the function detecting means that the function of the first code book is not depressed and selecting the short-length signal code book to take out a plurality of short-length sound source signals from the short-length signal code book in cases where it is detected by the function detecting means that the function of the first code book is depressed, a total length of the short-length sound source signal being equal to the first length; a synthesis filter for generating a synthesized speech signal from the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means; and controlling means for controlling the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means to reduce a difference between the current input speech signal and the synthesized speech signal generated by the synthesis filter. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A speech coding apparatus, comprising:
-
a first code book for storing a plurality of past sound source signals respectively having a first length of one speech sub-frame, the past sound source signals being formed of a past input speech signal preceding to a current input speech signal currently input; a second code book for storing a plurality of predetermined sound source signals respectively having the first length of one speech sub-frame length; a short-length signal code book for storing a plurality of short-length second source signals respectively having a second length of one micro-frame shorter than the first length, a plurality of a leech micro-frames being formed by dividing one speech sub-frame; linear prediction analyzing means for analyzing the past input speech signal and the current input speech signal to calculate a plurality of linear prediction coefficients; prediction residual signal calculating means for calculating a predicted residual signal indicating a predicted residual between the current input speech signal and a predicted input speech signal which is obtained by using the linear prediction coefficients calculated by the linear prediction analyzing means; cross-correlation calculating means for calculating a cross-correlation between a past sound source signal taken out from the first code book and the predicted residual signal calculated by the prediction residual signal calculating means to detect a depression of a function of the first code book according to a degree of the cross-correlation; adding means for linearly adding the past sound source signal taken out from the first code book and a predetermined sound source signal taken out from the second code book to form a first excitation sound source signal, a total length of the first excitation sound source signal being equal to the first length; short-length signal connecting means for connecting a plurality of short-length sound source signals taken out from the short-length signal code book in series to form a second excitation sound source signal, a total length of the second excitation sound source signal being equal to the first length; selecting leans for selecting the first excitation sound source signal obtained in the adding means in cases where it is detected by the cross-correlation calculating means that the function of the first code book is not depressed and selecting the second excitation sound source signal obtained in the short-length signal connecting means in cases where it is detected by the cross-correlation calculating means that the function of the first code book is depressed; a synthesized filter for generating a synthesized speech signal from the first excitation sound source signal or the second excitation sound source signal selected by the selecting means according to the linear prediction coefficients calculated by the linear prediction analyzing means; and controlling means for controlling the past sound source signal taken out from the first code book to the adding means and the short-length sound source signals taken out from the short-length signal code book to reduce a difference between the current input speech signal and the synthesis speech signal generated by the synthesized speech signal generated by the synthesis filter. - View Dependent Claims (10)
-
Specification