×

Audio analysis/synthesis system

  • US 5,327,518 A
  • Filed: 08/22/1991
  • Issued: 07/05/1994
  • Est. Priority Date: 08/22/1991
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of extracting a set of parameters representative of input speech signals representing speech from a human vocal tract, the vocal tract having a frequency response capable of representation as a set of coefficients, such that artifact-free, modified synthetic speech signals can be generated from said parameters, comprising the steps of:

  • (a) digitizing the input speech signals into a speech data stream;

    (b) isolating a sequence of overlapping speech data frames from the speech data stream, each of said speech data frames having a fundamental frequency;

    (c) analyzing the sequence of overlapping speech data frames to produce a corresponding sequence of coefficient sets representative of an estimate of the frequency response of the human vocal tract;

    (d) multiplying each of the overlapping speech data frames by an analysis window function to create a corresponding sequence of windowed data frames;

    (e) calculating the discrete Fourier transform of each of the windowed data frames to produce a corresponding sequence of transformed data frames;

    (f) approximating the corresponding sequence of overlapping speech data frames with a sequence of sinusoidal parameter sets using a first iterative analysis-by-synthesis means responsive to the sequence of transformed data frames and a discrete Fourier transform of the analysis window function;

    (g) analyzing the sequence of sinusoidal parameter sets and the corresponding sequence of coefficient sets with a fundamental frequency estimator means to produce a sequence of estimates of the fundamental frequency of the corresponding overlapping speech data frames; and

    (h) analyzing the sequence of fundamental frequency estimates and the corresponding sequence of sinusoidal parameter sets with a harmonic assignment means to produce a sequence of quasi-harmonic sinusoidal model parameter sets;

    the set of parameters representative of the input speech signals comprising the sequence of coefficient sets representative of the estimate of the frequency response of the human vocal tract, the sequence of estimates of the fundamental frequency, and the sequence of quasi-harmonic sinusoidal model parameter sets.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×