×

Method of and apparatus for speech recognition wherein decisions are made based on phonemes

  • US 5,131,043 A
  • Filed: 11/20/1989
  • Issued: 07/14/1992
  • Est. Priority Date: 09/05/1983
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for recognizing speech comprising:

  • (a) performing a linear prediction analysis of plural phonemes including the vowels and a nasal sound to calculate pth order LPC cepstrum coefficients in response to periodic frame derived for plural word utterances by plural speakers;

    (b) in response to the calculated LPC cepstrum coefficients calculating a covariance matrix W that is a function of all the phonemes and a mean value mi for each of the particular phonemes, wherei represents the particular phoneme;

    (c) deriving a weighting coefficient ##EQU25## where j=1,2 . . . pδ

    jj'"'"' =value of element jj'"'"' of inverse matrix W-1 of covariance matrix W;

    (d) deriving the values aij, δ

    jj'"'"', mij'"'"', and mit W-1 mi for each of said phonemes as coefficient values for the phonemes;

    (e) in response to known phoneme sounds being uttered by a speaker deriving the value of an LPC cepstrum coefficient for each phoneme;

    (f) storing these LPC cepstrum coefficients with the previously stored coefficient values of the corresponding phonemes to derive standard patterns for the phonemes;

    (g) during a recognition mode while replicas of unknown words including the phonemes are derived;

    (i) performing phoneme segmentation of each unknown word and(ii) for each segmented phoneme determining the similarity of LPC cepstrum coefficients of each segmented phoneme of the unknown words with the stored coefficient values of the standard patterns for the phonemes in accordance with ##EQU26## where t is a matrix transportation factor;

    (h) selecting the standard phoneme most similar to the uttered phoneme in response to the value of Li ;

    (i) combining the selected standard phonemes to form a phoneme string for an uttered word; and

    (j) comparing the formed phoneme string for an uttered word with stored phoneme strings for known words to determined which of the known words is the uttered word.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×