System and method for recognizing speech
First Claim
Patent Images
1. A method for recognizing speech including a sequence of words, comprising:
- generating a set of interpretations of the speech using an acoustic model and a language model;
determining, for each interpretation, a score representing correctness of an interpretation in representing the sequence of words to produce a set of scores;
determining a constraint for recognizing the speech subject to a word sequence constraint;
determining a constraint factor indicating a degree of the consistency with the word sequence constraint;
determining a constrained scoring function based on the constraint factor for updating the set of scores; and
updating the set of scores using the scoring function; and
selecting, according to the updated set of scores, a best interpretation from the set of interpretations as the recognized speech, wherein steps of the method are performed by a processor.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and a method recognize speech including a sequence of words. A set of interpretations of the speech is generated using an acoustic model and a language model, and, for each interpretation, a score representing correctness of an interpretation in representing the sequence of words is determined to produce a set of scores. Next, the set of scores is updated based on a consistency of each interpretation with a constraint determined in response to receiving a word sequence constraint.
15 Citations
18 Claims
-
1. A method for recognizing speech including a sequence of words, comprising:
-
generating a set of interpretations of the speech using an acoustic model and a language model; determining, for each interpretation, a score representing correctness of an interpretation in representing the sequence of words to produce a set of scores; determining a constraint for recognizing the speech subject to a word sequence constraint; determining a constraint factor indicating a degree of the consistency with the word sequence constraint; determining a constrained scoring function based on the constraint factor for updating the set of scores; and updating the set of scores using the scoring function; and selecting, according to the updated set of scores, a best interpretation from the set of interpretations as the recognized speech, wherein steps of the method are performed by a processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for recognizing speech of a user, comprising:
-
recognizing the speech to generate a set of interpretations associated with a corresponding set of scores representing correctness of each interpretation in representing the speech; and updating iteratively the set of scores subject to at least one constraint, such that, for each iteration, a score of each interpretation is increased if the interpretation is consistent with the constraint, and is decreased if the interpretation is inconsistent with the constraint; and selecting, according to the updated set of scores, an interpretation from the set of interpretations as the recognized speech, wherein steps of the method are performed by a processor, wherein the updating comprises; communicating a subset of the set of interpretations to a user; receiving a word sequence constraint in response to the communicating; determining a type of the constraint based on the word sequence constraint, wherein the type is a context type; determining a scoring function testing presence or absence of a specific word in each interpretation; determining a direction of the speech based on a language model; updating the scoring function with a test for presence of words preceding and following the specific word according to the direction of the speech; and determining the constraint based on the type.
-
-
16. A system for recognizing speech, comprising:
-
a processor implementing a speech recognition module and an error correction module, wherein the speech recognition module generates a set of interpretations of the speech input using an acoustic model and a language model, determines, for each interpretation, a score representing correctness of an interpretation in representing the speech and selects, according to the score of each interpretation, a best interpretation from the set of interpretation as the recognized speech; and
whereinthe error correction module determines a constraint for recognizing the speech, and updates the score of each interpretation based on a consistency of the interpretation with the constraint, wherein the constraint is determined by; communicating a subset of the set of interpretations to a user; receiving a word sequence constraint in response to the communicating; determining a type of the constraint based on the word sequence constraint, wherein the type is a context type; determining a scoring function testing presence or absence of a specific word in each interpretation; determining a direction of the speech based on a language model; updating the scoring function with a test for presence of words preceding and following the specific word according to the direction of the speech; and determining the constraint based on the type. - View Dependent Claims (17, 18)
-
Specification