×

Orthogonalized dictionary speech recognition apparatus and method thereof

  • US 4,979,213 A
  • Filed: 07/12/1989
  • Issued: 12/18/1990
  • Est. Priority Date: 07/15/1988
  • Status: Expired due to Term
First Claim
Patent Images

1. A speech recognition system, comprising:

  • acoustic analyzing means for converting input speech into an electrical signal and obtaining speech pattern data upon acoustic analysis of said electrical signal;

    means for detecting a speech interval of the electrical signal;

    means for generating sampling pattern data by extracting a predetermined number of samples from speech pattern data included in the detected speech interval;

    means for prestoring sampling pattern data of a plurality of speakers for categories of speech to be recognized, said sampling pattern data including learning pattern data;

    means for forming orthogonalized dictionary data for each speaker on the basis of the sampling pattern data, said forming means forming averaged pattern data of a plurality of sampling pattern data obtained from each speaker;

    means for forming dictionary data of a first axis by smoothing the averaged pattern data in a time base direction;

    means for forming dictionary data of a second axis orthogonal to the first axis by differentiating the averaged pattern data in the time base direction;

    an orthogonalized dictionary for storing the dictionary data of the first and second axes as orthogonal dictionary data;

    means for forming additional orthogonal dictionary data representing feature variations in speech of each speaker and orthogonal to the orthogonal dictionary data stored in said orthogonalized dictionary in accordance with sampling pattern data of each of a second and subsequent of said plurality of speakers on the basis of the orthogonal dictionary data obtained with respect to a first of said plurality of speakers;

    means for selectively storing the additional orthogonal dictionary data in said orthogonalized dictionary;

    means for computing a similarity value between the orthogonal dictionary data stored in said orthogonalized dictionary and the sampling pattern data formed by said sampling pattern data generating means; and

    means for recognizing input speech on the basis of the similarity value.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×