Speech recognition system for providing voice recognition services using a conversational language model
First Claim
1. A computer-implemented method for performing voice recognition, at least in part, using a language model incorporating a plurality of words and a plurality of n-grams, each of the plurality of n-grams formed by two or more of the plurality of words and defining a probability of occurrence of each word in the respective n-gram given the occurrence of one of the two or more of the plurality of words forming the respective n-gram, the method comprising:
- receiving a first text comprising a sequence of one or more words from a first participant in a text-based conversation with a second participant via at least one first user device;
receiving a voice response provided by the second participant in the text-based conversation in response to the first text;
selecting at least one of the plurality of n-grams from the language model that includes at least one word in the first text; and
automatically recognizing at least a portion of the voice response to provide a second text, at least in part, by using the at least one selected n-gram;
maintaining a history of recognized words and received texts in the text-based conversation; and
maintaining an identifier representing a change in speaker from the first or the second participant to the other.
3 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the present invention provide a method, system and article of manufacture for adjusting a language model within a voice recognition system, based on text received from an external application. The external application may supply text representing the words of one participant to a text-based conversation. In such a case, changes may be made to a language model by analyzing the external text received from the external application.
-
Citations
27 Claims
-
1. A computer-implemented method for performing voice recognition, at least in part, using a language model incorporating a plurality of words and a plurality of n-grams, each of the plurality of n-grams formed by two or more of the plurality of words and defining a probability of occurrence of each word in the respective n-gram given the occurrence of one of the two or more of the plurality of words forming the respective n-gram, the method comprising:
-
receiving a first text comprising a sequence of one or more words from a first participant in a text-based conversation with a second participant via at least one first user device; receiving a voice response provided by the second participant in the text-based conversation in response to the first text; selecting at least one of the plurality of n-grams from the language model that includes at least one word in the first text; and automatically recognizing at least a portion of the voice response to provide a second text, at least in part, by using the at least one selected n-gram; maintaining a history of recognized words and received texts in the text-based conversation; and maintaining an identifier representing a change in speaker from the first or the second participant to the other. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. At least one non-transitory computer-readable storage medium containing a program which, when executed on at least one computer, performs a method of performing voice recognition, at least in part, using a language model incorporating a plurality of words and a plurality of n-grams, each of the plurality of n-grams formed by two or more of the plurality of words and defining a probability of occurrence of each word in the respective n-gram given the occurrence of one of the two or more of the plurality of words forming the respective n-gram, the method comprising:
-
receiving a first text comprising a sequence of one or more words from a first participant in a text-based conversation with a second participant via at least one first user device; receiving a voice response provided by the second participant in the text-based conversation in response to the first text; selecting at least one of the plurality of n-grams from the language model that includes at least one word in the first text; and automatically recognizing at least a portion of the voice response to provide a second text, at least in part, by using the at least one selected n-gram; maintaining a history of recognized words and received texts in the text-based conversation; and maintaining an identifier representing a change in speaker from the first or the second participant to the other. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computing device that performs voice recognition, at least in part, using a language model incorporating a plurality of words and a plurality of n-grams, each of the plurality of n-grams formed by two or more of the plurality of words and defining a probability of occurrence of each word in the respective n-gram given the occurrence of one of the two or more of the plurality of words forming the respective n-gram, the computing device comprising:
-
at least one input to receive a first text comprising a sequence of one or more words from a first participant in a text-based conversation with a second participant via at least one user device and receive a voice response provided by the second participant in the text-based conversation in response to the first text; and at least one processor coupled to the at least one input, the at least one processor programmed to; select at least one of the plurality of n-grams from the language model that includes at least one word in the first text; and automatically recognize at least a portion of the voice response to provide a second text, at least in part, by using the at least one selected n-gram; maintain a history of recognized words and received texts in the text-based conversation; and maintain an identifier representing a change in speaker from the first or the second participant to the other. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer-implemented method for performing voice recognition, at least in part, using a language model incorporating a plurality of words and a plurality of n-grams, each of the plurality of n-grams formed by two or more of the plurality of words and defining a probability of occurrence of each word in the respective n-gram given the occurrence of one of the two or more of the plurality of words forming the respective n-gram, the method comprising:
-
receiving a first text comprising a sequence of one or more words from a first participant in a text-based conversation with a second participant via at least one first user device; receiving a voice response provided by the second participant in the text-based conversation in response to the first text; selecting at least one of the plurality of n-grams from the language model that includes at least one word in the first text; and automatically recognizing at least a portion of the voice response to provide a second text, at least in part, by using the at least one selected n-gram, wherein automatically recognizing comprises determining whether a number of messages from the first participant to the second participant exceeds a predetermined threshold. - View Dependent Claims (20, 21)
-
-
22. At least one non-transitory computer-readable storage medium containing a program which, when executed on at least one computer, performs a method of performing voice recognition, at least in part, using a language model incorporating a plurality of words and a plurality of n-grams, each of the plurality of n-grams formed by two or more of the plurality of words and defining a probability of occurrence of each word in the respective n-gram given the occurrence of one of the two or more of the plurality of words forming the respective n-gram, the method comprising:
-
receiving a first text comprising a sequence of one or more words from a first participant in a text-based conversation with a second participant via at least one first user device; receiving a voice response provided by the second participant in the text-based conversation in response to the first text; selecting at least one of the plurality of n-grams from the language model that includes at least one word in the first text; and automatically recognizing at least a portion of the voice response to provide a second text, at least in part, by using the at least one selected n-gram, wherein automatically recognizing comprises determining whether a number of messages from the first participant to the second participant exceeds a predetermined threshold. - View Dependent Claims (23, 24)
-
-
25. A computing device that performs voice recognition, at least in part, using a language model incorporating a plurality of words and a plurality of n-grams, each of the plurality of n-grams formed by two or more of the plurality of words and defining a probability of occurrence of each word in the respective n-gram given the occurrence of one of the two or more of the plurality of words forming the respective n-gram, the computing device comprising:
-
at least one input to receive a first text comprising a sequence of one or more words from a first participant in a text-based conversation with a second participant via at least one user device and receive a voice response provided by the second participant in the text-based conversation in response to the first text; and at least one processor coupled to the at least one input, the at least one processor programmed to; select at least one of the plurality of n-grams from the language model that includes at least one word in the first text; and automatically recognize at least a portion of the voice response to provide a second text, at least in part, by using the at least one selected n-gram, wherein the at least one processor is programmed to automatically recognize at least a portion of the voice response at least in part by determining whether a number of messages from the first participant to the second participant exceeds a predetermined threshold. - View Dependent Claims (26, 27)
-
Specification