Code excited linear prediction coder with a short-length codebook for modeling speech having local peak

US 5,699,483 A
Filed: 06/14/1995
Issued: 12/16/1997
Est. Priority Date: 06/14/1994
Status: Expired due to Fees

First Claim

Patent Images

1. A speech coding apparatus, comprising:

a first code book for storing a plurality of first sound source signals respectively having a first length;

a short-length signal code book for storing a plurality of short-length sound source signals respectively having a second length shorter than the first length;

function detecting means for analyzing a current input speech signal to detect whether or not a function of the first code book is depressed;

selecting means for selecting the first code book to take out a first sound source signal from the first code book in cases where it is detected by the function detecting means that the function of the first code book is not depressed and selecting the short-length signal code book to take out a plurality of short-length sound source signals from the short-length signal code book in cases where it is detected by the function detecting means that the function of the first code book is depressed, a total length of the short-length sound source signal being equal to the first length;

a synthesis filter for generating a synthesized speech signal from the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means; and

controlling means for controlling the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means to reduce a difference between the current input speech signal and the synthesized speech signal generated by the synthesis filter.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A predicted residual signal is calculated from a current input speech signal and a past input speech signal, and a cross-correlation between the predicted residual signal and the past input speech signal having one speech sub-frame length stored in a first code book is calculated. In cases where the current input speech signal has no local peak, the cross-correlation becomes high, so that a synthesized speech signal is generated from the past input speech signal stored in the first code book or a predetermined sound source signal having one speech sub-frame length stored in the second code book. In contrast, in cases where the current input speech signal has a local peak, the cross-correlation becomes low, so that it is judged that a function of the first code book is depressed. In this case, a synthesized speech signal is generated from a group of short-length sound source signals having a total length equal to one speech sub-frame length stored in a short-length signal code book. Therefore, even though the current input speech signal suddenly has a local peak, because the synthesized speech signal is generated from the short-length sound source signals respectively having a speech length lower than one speech sub-frame length, the local peak can be expressed by the short-length sound source signals, an appropriate exciting sound source signal similar to the current input speech signal can be determined, and the synthesized speech signal can be adequately obtained.

14 Citations

View as Search Results

10 Claims

1. A speech coding apparatus, comprising:
- a first code book for storing a plurality of first sound source signals respectively having a first length;
  
  a short-length signal code book for storing a plurality of short-length sound source signals respectively having a second length shorter than the first length;
  
  function detecting means for analyzing a current input speech signal to detect whether or not a function of the first code book is depressed;
  
  selecting means for selecting the first code book to take out a first sound source signal from the first code book in cases where it is detected by the function detecting means that the function of the first code book is not depressed and selecting the short-length signal code book to take out a plurality of short-length sound source signals from the short-length signal code book in cases where it is detected by the function detecting means that the function of the first code book is depressed, a total length of the short-length sound source signal being equal to the first length;
  
  a synthesis filter for generating a synthesized speech signal from the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means; and
  
  controlling means for controlling the first sound source signal or the short-length sound source signals which are taken out from the first code book or the short-length signal code book selected by the selecting means to reduce a difference between the current input speech signal and the synthesized speech signal generated by the synthesis filter.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. A speech coding apparatus according to claim 1 in which the first sound source signals stored in the first code book are formed of a past input speech signal preceding to the current input speech signal.
  - 3. A speech coding apparatus according to claim 1 in which the first length of the first sound source signal stored in the first code book is equal to a length of one speech sub-frame, and the second length of the short-length sound source signal stored in the short-length signal code book is equal to a length of one speech micro-frame obtained by dividing the speech sub-frame.
  - 4. A speech coding apparatus according to claim 1 in which the function detecting means comprises:
    - prediction residual signal calculating means for calculating a predicted residual signal indicating a predicted residual between the current input speech signal and a predicted input speech signal; and
      
      cross-correlation calculating means for calculating a cross-correlation between the first sound source signal taken out from the first code book and the predicted residual signal calculated by the prediction residual signal calculating means to detect the depression of the function of the first code book according to a degree of the cross-correlation.
  - 5. A speech coding apparatus according to claim 4, further including:
    - linear prediction analyzing means for analyzing the current input speech signal and a past input speech signal preceding to the current input speech signal to calculate a plurality of linear prediction coefficients, the predicted input speech signal used in the prediction residual signal calculating means being predicted by using the linear prediction coefficients.
  - 6. A speech coding apparatus according to claim 1, further including:
    - sound source signal connecting means for connecting the short-length sound source signals taken out from the short-length signal code book in series, the short-length sound source signals connected in series being changed to the synthesized speech signal in the synthesis filter.
  - 7. A speech coding apparatus according to claim 1, further including:
    - a second code book for storing a plurality of predetermined sound source signals respectively having the first length; and
      
      adding means for linearly adding the first sound source signal taken out from the first code book and a predetermined sound source signal taken out from the second code book to form an excitation sound source signal, the synthesized speech signal being generated from the excitation sound source signal in the synthesis filter.
  - 8. A speech coding apparatus according to claim 1 in which the controlling means comprises:
    - framing means for dividing the current input sound source signal having he first length into a plurality of divided input sound source signals respectively having the second length; and
      
      short-length sound source signal selecting means having a plurality of signal selectors arranged in stages ST₁ to ST_n for receiving the divided input sound source signals divided by the framing means in the signal selectors in one-to-one correspondence, calculating a plurality of signal errors between the divided input sound source signal and a plurality of synthesized speech signals generated from the short-length sound source signals of the short-length signal code book in the synthesis filter in each of the signal selectors, calculating a plurality of accumulated signal errors in each of the signal selectors ST_k (k=2 to n) by adding a limited number of particular accumulated signal errors which are lower than other accumulated signal errors in a signal selector ST_k-1 and the signal errors calculated in the signal selector ST_k to select the limited number of particular accumulated signal errors which are lower than the other accumulated signal errors in the signals selector ST_k, determining a selected accumulated signal error having the lowest value among the particular accumulated signal errors in a final stage ST_n, and selecting a particular short-length sound source signal relating to the selected accumulated signal error from among the short-length sound source signals of the short-length signal code book in each of the signal electors ST₁ to ST_n, the synthesized speech signal being generated from the particular short-length sound source signals selected in the signal selectors ST₁ to ST_n.

9. A speech coding apparatus, comprising:
- a first code book for storing a plurality of past sound source signals respectively having a first length of one speech sub-frame, the past sound source signals being formed of a past input speech signal preceding to a current input speech signal currently input;
  
  a second code book for storing a plurality of predetermined sound source signals respectively having the first length of one speech sub-frame length;
  
  a short-length signal code book for storing a plurality of short-length second source signals respectively having a second length of one micro-frame shorter than the first length, a plurality of a leech micro-frames being formed by dividing one speech sub-frame;
  
  linear prediction analyzing means for analyzing the past input speech signal and the current input speech signal to calculate a plurality of linear prediction coefficients;
  
  prediction residual signal calculating means for calculating a predicted residual signal indicating a predicted residual between the current input speech signal and a predicted input speech signal which is obtained by using the linear prediction coefficients calculated by the linear prediction analyzing means;
  
  cross-correlation calculating means for calculating a cross-correlation between a past sound source signal taken out from the first code book and the predicted residual signal calculated by the prediction residual signal calculating means to detect a depression of a function of the first code book according to a degree of the cross-correlation;
  
  adding means for linearly adding the past sound source signal taken out from the first code book and a predetermined sound source signal taken out from the second code book to form a first excitation sound source signal, a total length of the first excitation sound source signal being equal to the first length;
  
  short-length signal connecting means for connecting a plurality of short-length sound source signals taken out from the short-length signal code book in series to form a second excitation sound source signal, a total length of the second excitation sound source signal being equal to the first length;
  
  selecting leans for selecting the first excitation sound source signal obtained in the adding means in cases where it is detected by the cross-correlation calculating means that the function of the first code book is not depressed and selecting the second excitation sound source signal obtained in the short-length signal connecting means in cases where it is detected by the cross-correlation calculating means that the function of the first code book is depressed;
  
  a synthesized filter for generating a synthesized speech signal from the first excitation sound source signal or the second excitation sound source signal selected by the selecting means according to the linear prediction coefficients calculated by the linear prediction analyzing means; and
  
  controlling means for controlling the past sound source signal taken out from the first code book to the adding means and the short-length sound source signals taken out from the short-length signal code book to reduce a difference between the current input speech signal and the synthesis speech signal generated by the synthesized speech signal generated by the synthesis filter.
- View Dependent Claims (10)
- - 10. A speech coding apparatus according to claim 9 in which the controlling means comprises;
    - framing means for dividing the current input sound source signal having a first length into a plurality of divided input sound source signals respectively having the second length; and
      
      short-length sound source signal selecting means having a plurality of signal selectors arranged in stages ST₁ to ST_n for receiving the divided input sound source signals divided by the framing means in the signal selectors in one-to-one correspondence, calculating a plurality of signal errors between the divided input sound source signal and a plurality of synthesized speech signals generated from the short-length sound source signals of the short-length signal code book in the synthesis filter in each of the signal selectors, calculating a plurality of accumulated signal errors in each of the signal selectors ST_k (k=2 to n) by adding a limited number of particular accumulated signal errors which are lower than other accumulated signal errors in a signal selector ST_k-1 and the signal errors calculated in the signal selector ST_k to select the limited number of particular accumulated signal errors which are lower than the other accumulated signal errors in the signals selector ST_k, determining a selected accumulated signal error having the lowest value among the particular accumulated signal errors in a final stage ST_n, and selecting a particular short-length sound source signal relating to the selected accumulated signal error from among the short-length sound source signals of the short-length signal code book in each of the signal selectors ST₁ to ST_n, the synthesized speech signal being generated from the particular short-length sound source signals selected in the signal selectors ST₁ to ST_n.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Tanaka, Naoya
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
SMITS, TALIVALDIS IVARS

Application Number

US08/490,253
Time in Patent Office

916 Days
Field of Search

395/2.27, 395/2.28, 395/2.29, 395/2.31, 395/2.32
US Class Current

704/219
CPC Class Codes

G10L 19/00   Speech or audio signals ana...

G10L 19/12   the excitation function bei...

G10L 2019/0005   Multi-stage vector quantisa...

Code excited linear prediction coder with a short-length codebook for modeling speech having local peak

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

14 Citations

10 Claims

Specification

Solutions

Use Cases

Quick Links

Code excited linear prediction coder with a short-length codebook for modeling speech having local peak

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

14 Citations

10 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links