×

Speech analysis-synthesis method and apparatus therefor

  • US 5,293,448 A
  • Filed: 09/03/1992
  • Issued: 03/08/1994
  • Est. Priority Date: 10/02/1989
  • Status: Expired due to Fees
First Claim
Patent Images

1. A speech analyzing apparatus comprising:

  • linear predictive analysis means for performing a linear predictive analysis of an input speech signal for each analysis window of a fixed length to obtain prediction coefficients, said linear predictive analysis means including means for determining whether said input speech signal in an analysis window of fixed length is voiced or unvoiced and for providing a voiced/unvoiced decision signal;

    inverse filter means controlled by said prediction coefficients, for deriving a prediction residual from said input speech signal;

    speech phase equalizing filter means for rendering the phase of said input speech signal into a zero phase to obtain a phase-equalized speech signal;

    prediction residual phase equalizing filter means for rendering the phase of said prediction residual into a zero phase to obtain a phase-equalized prediction residual signal;

    reference time point gathering means for detecting impulses of magnitudes larger than a predetermined threshold value in said phase-equalized prediction residual signal and for outputting the positions of said impulses as reference time points;

    impulse position generating means responsive to said reference time points and said voiced/unvoiced decision signal for producing, based on said reference time points when said decision signal indicates that said speech signal is a voiced sound, differences between successive intervals of said reference time points for comparing the differences with a predetermined limit range, and for determining positions of impulses such that when the differences are within said predetermined limit range, said reference time points are determined as impulse positions, and when said difference are in excess of said predetermined limit range, impulse positions are determined by adding a time point to said reference time points or by omission of one of said reference time points or by shift of one of said reference time points so that the differences between the successive intervals of the processed reference time points are held within said limit range, said impulse positions thus determined being one of the parameters representing the excitation signal as a result of the speech analysis;

    impulse sequence generating means for receiving said impulse positions from said impulse position generating means and generating impulses at said impulse positions;

    all-pole filter means controlled by said prediction coefficients and excited by said generated impulse sequence to generate a synthesized speech; and

    impulse magnitude calculating means for determining magnitude values of said impulses generated by said impulse sequence generating means which minimize an error between a waveform of a synthesized speech obtainable by exciting said all-pole filter means with said impulse sequence and a waveform of said phase-equalized speech supplied from said speech phase equalizing filter means, and means for outputting said impulse magnitudes for use as another one of the parameters representing the excitation signal as a result of the speech analysis by said speech analyzing apparatus.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×