Methods and apparatus for generating, updating and distributing speech recognition models
First Claim
1. A computer-implemented method comprising:
- receiving, from a mobile device, a request to update at least a portion of an existing speech recognition model corresponding to a particular term, the request including (i) a recording of a user of the mobile device speaking the particular term, or (ii) a set of features extracted from the recording of the user of the mobile device speaking the particular term;
determining a speech recognition model type of the existing speech recognition model, wherein the speech recognition model type comprises a speaker-dependent speech recognition model type or a speaker-independent speech recognition model type;
in response to determining the speech recognition model type of the existing speech recognition model, generating an updated speech recognition model of the determined speech recognition model type of the existing speech recognition model; and
transmitting the updated speech recognition model corresponding to the particular term to the mobile device, as a replacement for the speech recognition model corresponding to the particular term.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques for generating, distributing, and using speech recognition models are described. A shared speech processing facility is used to support speech recognition for a wide variety of devices with limited capabilities including business computer systems, personal data assistants, etc., which are coupled to the speech processing facility via a communications channel, e.g., the Internet. Devices with audio capture capability record and transmit to the speech processing facility, via the Internet, digitized speech and receive speech processing services, e.g., speech recognition model generation and/or speech recognition services, in response. The Internet is used to return speech recognition models and/or information identifying recognized words or phrases. Thus, the speech processing facility can be used to provide speech recognition capabilities to devices without such capabilities and/or to augment a device'"'"'s speech processing capability. Voice dialing, telephone control and/or other services are provided by the speech processing facility in response to speech recognition results.
39 Citations
14 Claims
-
1. A computer-implemented method comprising:
-
receiving, from a mobile device, a request to update at least a portion of an existing speech recognition model corresponding to a particular term, the request including (i) a recording of a user of the mobile device speaking the particular term, or (ii) a set of features extracted from the recording of the user of the mobile device speaking the particular term; determining a speech recognition model type of the existing speech recognition model, wherein the speech recognition model type comprises a speaker-dependent speech recognition model type or a speaker-independent speech recognition model type; in response to determining the speech recognition model type of the existing speech recognition model, generating an updated speech recognition model of the determined speech recognition model type of the existing speech recognition model; and transmitting the updated speech recognition model corresponding to the particular term to the mobile device, as a replacement for the speech recognition model corresponding to the particular term. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
one or more computers; and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, from a mobile device, a request to update at least a portion of an existing speech recognition model corresponding to a particular term, the request including (i) a recording of a user of the mobile device speaking the particular term, or (ii) a set of features extracted from the recording of the user of the mobile device speaking the particular term; determining a speech recognition model type of the existing speech recognition model, wherein the speech recognition model type comprises a speaker-dependent speech recognition model type or a speaker-independent speech recognition model type; in response to determining the speech recognition model type of the existing speech recognition model, generating an updated speech recognition model of the determined speech recognition model type of the existing speech recognition model; and transmitting the updated speech recognition model corresponding to the particular term to the mobile device, as a replacement for the speech recognition model corresponding to the particular term. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, from a mobile device, a request to update at least a portion of an existing speech recognition model corresponding to a particular term, the request including (i) a recording of a user of the mobile device speaking the particular term, or (ii) a set of features extracted from the recording of the user of the mobile device speaking the particular term; determining a speech recognition model type of the existing speech recognition model, wherein the speech recognition model type comprises a speaker-dependent speech recognition model type or a speaker-independent speech recognition model type; in response to determining the speech recognition model type of the existing speech recognition model, generating an updated speech recognition model of the determined speech recognition model type of the existing speech recognition model; and transmitting the updated speech recognition model corresponding to the particular term to the mobile device, as a replacement for the speech recognition model corresponding to the particular term. - View Dependent Claims (12, 13, 14)
-
Specification