Methods and apparatus for performing speech recognition over a network and using speech recognition results
First Claim
1. A method comprising:
- receiving speech data transmitted over a data network at a speech processing facility, the speech data associated with a user;
receiving a user identifier associated with the user transmitted over the data network at the speech processing facility;
performing a speech recognition operation at the speech processing facility, the speech recognition operation including;
retrieving a speaker dependent speech recognition model associated with the user based on the user identifier from a plurality of speaker dependent speech recognition models stored at the speech processing facility,performing speech recognition using the retrieved speaker dependent speech recognition model and the speech data, anddetermining an outcome of the speech recognition; and
transmitting the outcome over the data network.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques for generating, distributing, and using speech recognition models are described. A shared speech processing facility is used to support speech recognition for a wide variety of devices with limited capabilities including business computer systems, personal data assistants, etc., which are coupled to the speech processing facility via a communications channel, e.g., the Internet. Devices with audio capture capability record and transmit to the speech processing facility, via the Internet, digitized speech and receive speech processing services, e.g., speech recognition model generation and/or speech recognition services, in response. The Internet is used to return speech recognition models and/or information identifying recognized words or phrases. Thus, the speech processing facility can be used to provide speech recognition capabilities to devices without such capabilities and/or to augment a device'"'"'s speech processing capability. Voice dialing, telephone control and/or other services are provided by the speech processing facility in response to speech recognition results.
62 Citations
12 Claims
-
1. A method comprising:
-
receiving speech data transmitted over a data network at a speech processing facility, the speech data associated with a user; receiving a user identifier associated with the user transmitted over the data network at the speech processing facility; performing a speech recognition operation at the speech processing facility, the speech recognition operation including; retrieving a speaker dependent speech recognition model associated with the user based on the user identifier from a plurality of speaker dependent speech recognition models stored at the speech processing facility, performing speech recognition using the retrieved speaker dependent speech recognition model and the speech data, and determining an outcome of the speech recognition; and transmitting the outcome over the data network. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method, comprising:
-
receiving over a data network, at a speech processing facility connected to the data network, speech data, a text version of the speech data, and a user identifier associated with a user; generating a set of feature vectors corresponding to the speech data, the set of feature vectors including speech characteristic information; training at the speech processing facility a speaker dependent speech recognition model associated with the user using the set of feature vectors and the text version of the speech data, producing a trained speaker dependent speech recognition model; and transmitting to the user over the data network the trained speaker dependent speech recognition model. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification