Dynamic semantic control of a speech recognition system
DCFirst Claim
1. A method for processing an utterance by a speech recognizer comprising:
- processing a plurality of rules, each rule being associated with a condition external to the speech recognizer in which that rule is applicable and being based at least in part on an underlying meaning of one or more utterances;
receiving an utterance and producing a plurality of recognition hypotheses for the received utterance;
determining rules that are applicable to the utterance;
applying the determined rules to the plurality of recognition hypotheses for the received utterance; and
determining a recognition result for the received utterance based on results of applying the determined rules.
5 Assignments
Litigations
0 Petitions
Accused Products
Abstract
A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment, the speech recognition system recognizes speech and generates one or more word strings, each of which is a hypothesis of the speech, and creates and stores a probability value or score for each of the word strings. The word strings are ordered by probability value. The speech recognition system also creates and stores, for each of the word strings, one or more keyword-value pairs that represent semantic elements and semantic values of the semantic elements for the speech that was spoken. One or more dynamic semantic rules are defined that specify how a probability value of a word string should be modified based on information about external conditions, facts, or the environment of the application in relation to the semantic values of that word string. A speech recognition application, upon receiving the keyword-value pairs, instructs the speech recognizer to modify one or more of the probability values, according to the dynamic semantic rules. The dynamic semantic rules are applied to the word strings and the keyword-value pairs. The speech recognizer modifies one or more of the probability values, re-orders the word strings, and returns control to the application. As a result, the speech recognizer may adjust dynamically to a changing likelihood that a speaker uttered a particular utterance, as the application executes, depending on the context of the application and the external factors.
30 Citations
36 Claims
-
1. A method for processing an utterance by a speech recognizer comprising:
-
processing a plurality of rules, each rule being associated with a condition external to the speech recognizer in which that rule is applicable and being based at least in part on an underlying meaning of one or more utterances; receiving an utterance and producing a plurality of recognition hypotheses for the received utterance; determining rules that are applicable to the utterance; applying the determined rules to the plurality of recognition hypotheses for the received utterance; and determining a recognition result for the received utterance based on results of applying the determined rules. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. Software stored on a computer-readable medium for causing a computer to perform steps comprising:
-
processing a plurality of rules, each rule being associated with a condition external to a speech recognizer in which that rule is applicable and being based at least in part on an underlying meaning of one or more utterances; receiving an utterance and producing a plurality of recognition hypotheses for the received utterance; determining rules that are applicable to the received utterance;
applying the determined rules to the plurality of recognition hypotheses for the received utterance processed by the speech recognizer; anddetermining a recognition result for the received utterance based on results of applying the determined rules. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification