×

Context-dependent speech recognizer using estimated next word context

  • US 5,233,681 A
  • Filed: 04/24/1992
  • Issued: 08/03/1993
  • Est. Priority Date: 04/24/1992
  • Status: Expired due to Fees
First Claim
Patent Images

1. A speech recognition apparatus comprising:

  • means for generating a set of two or more speech hypotheses, each speech hypothesis comprising a partial hypothesis of zero or more words followed by a candidate word selected from a vocabulary of candidate words;

    means for storing a set of word models, each word model representing one or more possible coded representations of an utterance of a word;

    means for generating an initial model of each speech hypothesis, each initial model comprising a model of the partial hypothesis followed by a model of the candidate word;

    an acoustic processor for generating a sequence of coded representations of an utterance to be recognized;

    means for generating an initial hypothesis score for each speech hypothesis, each initial hypothesis score comprising an estimate of the closeness of a match between the initial model of the speech hypothesis and the sequence of coded representations of the utterance;

    means for storing an initial subset of one or more speech hypotheses, from the set of speech hypotheses, having the best initial hypothesis scores;

    next context estimating means for estimating, for each speech hypothesis in the initial subset, a likely word, from the vocabulary of words, which is likely to follow the speech hypothesis;

    means for generating a revised model of each speech hypothesis in the initial subset, each revised model comprising a model of the partial hypothesis followed by a revised model of the candidate word, the revised candidate word model being dependent at least on the word which is estimated to be likely to follow the speech hypothesis;

    means for generating a revised hypothesis score for each speech hypothesis in the initial subset, each revised hypothesis score comprising an estimate of the closeness of a match between the revised model of the speech hypothesis and the sequence of coded representations of the utterance;

    means for storing a reduced subset of one or more speech hypotheses, from the initial subset of speech hypotheses, having the best revised match scores; and

    means for outputting at least one word of one or more of the speech hypotheses in the reduced subset.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×