×

Low complexity, high accuracy clustering method for speech recognizer

  • US 5,806,030 A
  • Filed: 05/06/1996
  • Issued: 09/08/1998
  • Est. Priority Date: 05/06/1996
  • Status: Expired due to Fees
First Claim
Patent Images

1. A clustering method for processing speech training data to generate a set of low complexity statistical models for use in automated speech recognition, comprising:

  • segmenting the training data into labeled subword units;

    generating Hidden Markov Models to represent said subword units,selecting a desired number of models to be between a predetermined minimum and a predetermined maximum by adjusting a threshold on the number of examples per model;

    training said models with said segmented training data to generate;

    (a) a first plurality of populated models based on instances of training data above a said threshold, and(b) a second plurality of populated models based on instances of training data below a said threshold;

    merging each model of said second plurality with the closest neighbor of the models of said first plurality to form a set of new models and retraining the new models.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×