×

Method for recognizing speech

  • US 7,225,127 B2
  • Filed: 12/11/2000
  • Issued: 05/29/2007
  • Est. Priority Date: 12/13/1999
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for recognizing speech, comprising:

  • (a) receiving a speech phrase;

    (b) generating a signal being representative to said speech phrase;

    (c) pre-processing and storing said signal with respect to a determined set of rules;

    (d) generating from said pre-processed signal at least one series of hypothesis speech elements;

    (e) determining at least one series of words being most probable to correspond to said speech phrase by applying a predefined language model to said at least one series of hypothesis speech elements,wherein determining said at least one series of words further comprises;

    (1) determining at least one sub-word, word, or a combination of words most probably being contained as a seed sub-phrase in said received speech phrase,wherein said seed sub-phrase is recognized with an appropriate high degree of reliability, such that segments of speech that are recognized with high reliability are used to constrain the search in other areas of the speech signal where the language model employed cannot adequately restrict the search; and

    (2) continuing determining words or combinations of words, which are consistent with said seed sub-phrase as at least a first successive sub-phrase which is contained in said received speech phrase, by inserting additional, paired and/or higher order information, including semantic and/or pragmatic information, between the sub-phrases, thereby decreasing the burden of searching,wherein said semantic information includes description of said sub-phrases and said pragmatic information includes connecting information connecting said sub-phrases to actual situation, application, and/or action,wherein the predefined language model contains a low-perplexity recognition grammar obtained from a conventional recognition grammar by;

    (3) identifying and extracting word classes of high-perplexity from the conventional grammar;

    (4) generating a phonetic, phonemic and/or syllabic description of the high-perplexity word classes, in particular by applying a sub-word-unit grammar compiler to them, to produce a sub-word-unit grammar for each high-perplexity word class; and

    (5) merging the sub-word-unit grammars with the remaining low-perplexity part of the conventional grammar to yield said low-perplexity recognition grammar; and

    wherein a language model is used containing at least a recognition grammar built up by at least a low-perplexity part and a high-perplexity part, each of which being representative for distinct low- and high-perplexity classes of speech elements; and

    wherein word classes are used as classes for speech elements or fragments.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×