System and method of performing speech recognition based on a user identifier
First Claim
1. A method of performing speech recognition comprising:
- receiving a voice request from a speaker;
receiving a data field selection by the speaker;
applying one of a plurality of language models to the received voice request for speech recognition based on the selected data field, wherein the speech recognition uses a different language model for each data field selected by the speaker;
determining an identity of the speaker based, at least in part, on a user identifier;
repeatedly determining parameters of a background model based on sampled information collected at a periodic time interval during the received voice request;
determining parameters of a transducer model; and
adapting a speech recognition model based on user-specific transformations corresponding to the determined identity of the speaker and on at least one of the background model or the transducer model.
4 Assignments
0 Petitions
Accused Products
Abstract
Speech recognition models are dynamically re-configurable based on user information, application information, background information such as background noise and transducer information such as transducer response characteristics to provide users with alternate input modes to keyboard text entry. Word recognition lattices are generated for each data field of an application and dynamically concatenated into a single word recognition lattice. A language model is applied to the concatenated word recognition lattice to determine the relationships between the word recognition lattices and repeated until the generated word recognition lattices are acceptable or differ from a predetermined value only by a threshold amount. These techniques of dynamic re-configurable speech recognition provide for deployment of speech recognition on small devices such as mobile phones and personal digital assistants as well environments such as office, home or vehicle while maintaining the accuracy of the speech recognition.
-
Citations
20 Claims
-
1. A method of performing speech recognition comprising:
-
receiving a voice request from a speaker; receiving a data field selection by the speaker; applying one of a plurality of language models to the received voice request for speech recognition based on the selected data field, wherein the speech recognition uses a different language model for each data field selected by the speaker; determining an identity of the speaker based, at least in part, on a user identifier; repeatedly determining parameters of a background model based on sampled information collected at a periodic time interval during the received voice request; determining parameters of a transducer model; and adapting a speech recognition model based on user-specific transformations corresponding to the determined identity of the speaker and on at least one of the background model or the transducer model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for performing speech recognition, the system comprising:
-
a module configured to receive a voice request from a speaker; a module configured to receive a data field selection by the speaker; a module configured to apply one of a plurality of language models to the received voice request for speech recognition based on the selected data field, wherein the speech recognition uses a different language model for each data field selected by the speaker; a module configured to determine an identity of the speaker based, at least in part, on a user identifier; a module configured to determine parameters of a background model repeatedly based on sampled information collected at a periodic time interval during the received voice request; a module configured to determine parameters of a transducer model; and a module configured to adapt a speech recognition model based on user-specific transformations corresponding to the determined identity of the speaker and on at least one of the background model or the transducer model. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A tangible computer readable medium storing a computer program having instructions for controlling a processor of a computer device to perform speech recognition, the steps comprising:
-
receiving a voice request from a speaker; receiving a data field selection by the speaker; applying one of a plurality of language models to the received voice request for speech recognition based on the selected data field, wherein the speech recognition uses a different language model for each data field selected by the speaker; determining an identity of the speaker based, at least in part, on a user identifier; determining parameters of a background model based on sampled information collected at a periodic time interval during the received voice request; instructions for controlling the processor to determine parameters of a transducer model; and instructions for controlling the processor to adapt a speech recognition model based on user-specific transformations corresponding to the determined identity of the speaker and on at least one of the background model or the transducer model. - View Dependent Claims (18, 19, 20)
-
Specification