Methods and apparatus for performing speech recognition over a network and using speech recognition results
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques for generating, distributing, and using speech recognition models are described. A shared speech processing facility is used to support speech recognition for a wide variety of devices with limited capabilities including business computer systems, personal data assistants, etc., which are coupled to the speech processing facility via a communications channel, e.g., the Internet. Devices with audio capture capability record and transmit to the speech processing facility, via the Internet, digitized speech and receive speech processing services, e.g., speech recognition model generation and/or speech recognition services, in response. The Internet is used to return speech recognition models and/or information identifying recognized words or phrases. Thus, the speech processing facility can be used to provide speech recognition capabilities to devices without such capabilities and/or to augment a device'"'"'s speech processing capability. Voice dialing, telephone control and/or other services are provided by the speech processing facility in response to speech recognition results.
110 Citations
48 Claims
-
1-30. -30. (canceled)
-
31. A method comprising:
-
receiving speech data transmitted over a data network at a speech processing facility, the speech data associated with a user;
receiving a user identifier associated with the user transmitted over the data network at the speech processing facility;
performing a speech recognition operation at the speech processing facility, the speech recognition operation including-retrieving a speaker dependent speech recognition model associated with a user based on the user identifier from a plurality of speaker dependent speech recognition models stored at the speech processing facility, performing speech recognition using the retrieved speaker dependent speech recognition model and the speech data, determining an outcome of the speech recognition;
transmitting the outcome over the data network. - View Dependent Claims (32, 33, 34, 35, 36)
-
-
37. A method, comprising:
-
receiving over a data network, at a speech processing facility connected to the data network, speech data, a text version of the speech data, and a user identifier associated with a user;
generating a set of feature vectors corresponding to the speech data, the set of feature vectors including speech characteristic information;
training at the speech processing facility a speaker dependent speech recognition model associated with the user using the set of feature vectors and the text version of the speech data, producing a trained speaker dependent speech recognition model;
transmitting to the user over the data network the trained speaker dependent speech recognition model. - View Dependent Claims (38, 39, 40, 41, 42)
-
-
43. A method, comprising:
-
receiving an updated speech recognition model over a data network from a remote speech processing facility;
replacing an existing speech recognition model with the updated speech recognition model in a local memory store;
receiving speech data associated with a user;
retrieving the updated speech recognition model from the memory store;
performing speech recognition using the speech data and the updated speech recognition model;
performing an operation based on an outcome of the speech recognition;
- View Dependent Claims (44, 45, 46, 47, 48)
-
Specification