×

Automated tuning of speech recognition parameters

  • US 9,129,599 B2
  • Filed: 10/18/2007
  • Issued: 09/08/2015
  • Est. Priority Date: 10/18/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method for providing dynamically loaded speech recognition parameters from a server to a speech recognition engine, comprising:

  • (A) with the server, on a first occasion after a first speech recognition session has been initiated between a first user and the speech recognition engine;

    executing at least one rule for selecting speech recognition parameters for use by a speech recognition engine, wherein the at least one rule comprises an if-portion including criteria and a then-portion specifying values of speech recognition parameters that must be used by the speech recognition engine for evaluating natural language options of a grammar when the criteria is met;

    selecting first values of a set of speech recognition parameters responsive to executing the at least one rule on the first occasion; and

    communicating to the speech recognition engine the selected first values of the set of speech recognition parameters for performing speech recognition of the first user on the first occasion;

    (B) with the speech recognition engine, for a first utterance by the first user on the first occasion;

    receiving the selected first values of the set of speech recognition parameters from the server;

    using the received selected first values of the set of speech recognition parameters to evaluate the acoustic properties of the first utterance to determine first acoustic scores for the natural language options of the grammar;

    combining the determined first acoustics scores for the natural language options of the grammar with grammar weights for the natural language options of the grammar to compute first computed scores for the natural language options of the grammar by using grammar weights for the natural language options of the grammar to bias the determined first acoustic scores; and

    choosing the natural language option of the grammar having the highest first computed score as the natural language interpretation of the first utterance;

    (C) with the server, on a second occasion after a second speech recognition session has been initiated between a second user and the speech recognition engine;

    executing the at least one rule;

    selecting second values of the set of speech recognition parameters responsive to executing the at least one rule on the second occasion; and

    communicating to the speech recognition engine the selected second values of the set of speech recognition parameters for performing speech recognition of the second user on the second occasion; and

    (D) with the speech recognition engine, for a second utterance by the second user on the second occasion;

    receiving the selected second values of the set of speech recognition parameters from the server;

    using the received selected second values of the set of speech recognition parameters to evaluate the acoustic properties of the second utterance to determine second acoustic scores for the natural language options of the grammar;

    combining the determined second acoustics scores for the natural language options of the grammar with grammar weights for the natural language options of the grammar to compute second computed scores for the natural language options of the grammar by using grammar weights for the natural language options of the grammar to bias the determined second acoustic scores; and

    choosing the natural language option of the grammar having the highest second computed score as the natural language interpretation of the second utterance;

    wherein the set of speech recognition parameters comprises one or both of an accuracy setting and a sensitivity value.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×