×

Speech processing system

  • US 9,466,285 B2
  • Filed: 11/26/2013
  • Issued: 10/11/2016
  • Est. Priority Date: 11/30/2012
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of deriving speech synthesis parameters from an audio signal, the method performed in a device comprising a processor, the method comprising:

  • receiving an input speech audio signal;

    estimating a position of glottal closure incidents from said input speech audio signal;

    deriving a pulsed excitation signal from the position of the glottal closure incidents;

    segmenting said audio signal on the basis of said glottal closure incidents, to obtain segments of said input speech audio signal;

    processing the segments of the input speech audio to obtain a complex cepstrum and deriving a synthesis filter from said complex cepstrum;

    producing a reconstructed speech signal based on the input speech audio signal by passing the pulsed excitation signal derived from the position of the glottal closure incidents through said synthesis filter derived from said complex cepstrum;

    comparing said reconstructed speech signal with said input speech audio signal;

    calculating a difference between the reconstructed speech signal and the input speech audio signal and modifying the pulsed excitation signal and the complex cepstrum to reduce the difference between the reconstructed speech signal and the input speech audio signal,wherein modifying the pulsed excitation signal and the complex cepstrum comprises the process of;

    optimizing the position of the pulses in said excitation signal to reduce a mean between the reconstructed speech signal and the input speech audio signals;

    recalculating the complex cepstrum by optimizing the complex cepstrum by minimizing the difference between the reconstructed speech signal and the input speech audio signal using the optimized pulse positions, andrepeating the process to derive as said speech synthesis parameters the position of the pulses and the complex cepstrum resulting in a minimum difference between the reconstructed speech signal and the input speech audio signal.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×