×

Speaker independent speech recognition system and method using neural network and DTW matching technique

  • US 5,528,728 A
  • Filed: 07/12/1993
  • Issued: 06/18/1996
  • Est. Priority Date: 07/12/1993
  • Status: Expired due to Term
First Claim
Patent Images

1. A speaker independent apparatus for word recognition comprising:

  • a) input means for inputting an utterance by an unspecified person into an electrical signal;

    b) characteristic extracting means for receiving the electrical signal from the input means and converting the electrical signal into a time series of discrete characteristic multidimensional vectors;

    c) phoneme recognition means for selectively receiving the time series of discrete characteristic multidimensional vectors and converting each of said selectively received vectors into a plurality of candidates of phonemes from a first order to an n-th order (n denotes an arbitrary number);

    d) word recognition means for receiving a time series string of phonemes from the phoneme recognition means and comparing the plurality of candidates of phonemes, one at a time, with each phoneme of a reference string of phonemes for words previously stored in a dictionary until a final phoneme of the reference string of phonemes for a last word of the words stored in the dictionary and determining a time series of phonemes derived from said phoneme recognition means having a highest similarity to one of the reference strings of the phonemes for the words stored in the dictionary using a predetermined word matching technique;

    e) output means for outputting at least one of said candidates of phonemes as a result of word recognition carried out by the word recognition means on the basis of a similarity determination on the plurality of candidates of phonemes with respect to the reference strings of the words stored in said dictionary; and

    f) selecting means, interposed between said characteristic extracting means and phoneme recognition means, for selecting a center of a given number of frames of a continued time-series of discrete characteristic multidimensional vectors derived from said characteristic extracting means so that said phoneme recognition means receives the center thereof.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×