×

Dynamic time warping using a digital signal processor

  • US 4,918,733 A
  • Filed: 07/30/1986
  • Issued: 04/17/1990
  • Est. Priority Date: 07/30/1986
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for performing automatic voice recognition comprising the steps of:

  • storing in a reference memory a plurality of reference speech pattern templates each comprising a plurality of reference time frames having sets of acoustic features signals and each representative of a prescribed spoken reference speech pattern;

    analyzing by a feature extraction unit a speech utterance to determine a plurality of unknown time frames to obtain sets of acoustic feature signals;

    initially storing in an unknown word memory and said sets of acoustic feature signals of said plurality of unknown time frames;

    obtaining a final accumulated correspondence signal for one of said templates by a pattern matcher performing the substeps of(1) forming an initial sequence of accumulated correspondence signals in response to the sets of acoustic feature signals of one of said templates and one set of acoustic feature signals of a first one of said plurality of unknown time frames;

    (2) storing in said unknown word memory said initial sequence in a sequential group of memory locations in descending chronological order;

    (3) determining a local correspondence signal corresponding to an individual memory location in said sequential group of memory locations from said sets of acoustic features for said plurality of unknown time frames and said one of said templates;

    (4) calculating a accumulated correspondence signal of the sequence of accumulated correspondence signals corresponding to said individual memory location in said group of memory locations for a second one of said plurality of unknown time frames by adding said local correspondence signal to the minimum of the contents of said individual memory location and absolute values of first and second memory locations following said individual memory location;

    (5) storing the negative of the calculated accumulated correspondence signal into said first individual memory location upon said contents of said individual memory location being the minimum and otherwise storing said calculated accumulated correspondence signal into said individual memory location;

    (6) repeating substeps 3 through 5 to obtain the accumulated correspondence signals in descending order for each memory location of said group of memory locations;

    (7) repeating substeps 3 through 6 to obtain the final accumulated correspondence signal of the sequence of said accumulated correspondence signals for the remaining ones of said plurality of said unknown time frames;

    obtaining a final accumulated correspondence signal for each of the remaining ones of said templates by said pattern matcher performing substeps 1 through 7 for each of the remaining ones of said templates; and

    indicating by a decision unit in response to the final accumulated correspondence signals the speech pattern represented by said speech utterance.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×