Computer implemented methods and apparatus for selectively interacting with a server to build a local database for speech recognition at a device
First Claim
1. A method for selectively interacting with a server to build a local database for speech recognition at a computing device, the method comprising:
- maintaining, at a computing device associated with a user, a local database comprising a plurality of audio samples, each audio sample being identified in association with;
any one of a plurality of voice command text files, each voice command text file being configured to store a text string of the audio sample, the text string being a transcription of a vocalization in the audio sample,any one of a plurality of different applications, the plurality of different applications being executable at the computing device, andany one of a plurality of different application command files, each application command file corresponding to a single respective application and associating at least one executable action to be performed within the application with at least one voice command text file;
receiving, at the computing device, a first audio command;
determining, using a local speech recognition algorithm at the computing device, that the first audio command does not match any of the plurality of audio samples of the local database within a margin of error;
transmitting, responsive to the determining step, the first audio command from the computing device to a remote server for detection of one or more voice command text files associated with the first audio command;
receiving, at the computing device from the remote server, the one or more detected voice command text files associated with the first audio command;
identifying an application at the computing device in relation to the one or more detected voice command text files, the identifying comprising analyzing a plurality of application command files at the computing device to locate an application command file matching the detected voice command text to the application; and
updating, at the computing device, the local database to;
include the first audio command in the plurality of audio samples of the local database,associate the first audio command with the identified application, andassociate the first audio command with an application command file corresponding to the identified application.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are methods, apparatus, systems, and computer-readable storage media for selectively interacting with a server to build a local dictation database for speech recognition at a device. In some implementations, a computing device receives an audio sample. The computing device may determine that the received audio sample does not match any of one or more existing audio samples stored in the local dictation database of the computing device. The received audio sample may be transmitted to a remote server for detection of one or more words indicated by the received audio sample. The computing device may receive data identifying the one or more words, and update the local dictation database to store the received audio sample in association with the one or more words.
-
Citations
16 Claims
-
1. A method for selectively interacting with a server to build a local database for speech recognition at a computing device, the method comprising:
-
maintaining, at a computing device associated with a user, a local database comprising a plurality of audio samples, each audio sample being identified in association with; any one of a plurality of voice command text files, each voice command text file being configured to store a text string of the audio sample, the text string being a transcription of a vocalization in the audio sample, any one of a plurality of different applications, the plurality of different applications being executable at the computing device, and any one of a plurality of different application command files, each application command file corresponding to a single respective application and associating at least one executable action to be performed within the application with at least one voice command text file; receiving, at the computing device, a first audio command; determining, using a local speech recognition algorithm at the computing device, that the first audio command does not match any of the plurality of audio samples of the local database within a margin of error; transmitting, responsive to the determining step, the first audio command from the computing device to a remote server for detection of one or more voice command text files associated with the first audio command; receiving, at the computing device from the remote server, the one or more detected voice command text files associated with the first audio command; identifying an application at the computing device in relation to the one or more detected voice command text files, the identifying comprising analyzing a plurality of application command files at the computing device to locate an application command file matching the detected voice command text to the application; and updating, at the computing device, the local database to; include the first audio command in the plurality of audio samples of the local database, associate the first audio command with the identified application, and associate the first audio command with an application command file corresponding to the identified application. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer-readable storage medium storing program code executable by one or more processors for selectively interacting with a server to build a local database for speech recognition at a device, the program code comprising instructions configured to cause:
-
maintaining, at a computing device associated with a user, a local database comprising a plurality of audio samples, each audio sample being identified in association with; any one of a plurality of voice command text files, each voice command text file being configured to store a text string of the audio sample, the text string being a transcription of a vocalization in the audio sample, any one of a plurality of different applications, the plurality of different applications being executable at the computing device, and any one of a plurality of different application command files, each application command file corresponding to a single respective application and associating at least one executable action to be performed within the application with at least one voice command text file; receiving, at the computing device, a first audio command; determining, using a local speech recognition algorithm at the computing device, that the first audio command does not match any of the plurality of audio samples of the local database within a margin of error; transmitting, responsive to the determining step, the first audio command from the computing device to a remote server for detection of one or more voice command text files associated with the first audio command; receiving, at the computing device from the remote server, the one or more detected voice command text files associated with the first audio command; identifying an application at the computing device in relation to the one or more detected voice command text files, the identifying comprising analyzing a plurality of application command files at the computing device to locate an application command file matching the detected voice command text to the application; and updating, at the computing device, the local database to; include the first audio command in the plurality of audio samples of the local database, associate the first audio command with the identified application, and associate the first audio command with an application command file corresponding to the identified application. - View Dependent Claims (13, 14)
-
-
15. One or more computing devices for selectively interacting with a server to build a local database for speech recognition at a device, the one or more computing devices comprising:
one or more processors configured to cause; maintaining, at a computing device associated with a user, a local database comprising a plurality of audio samples, each audio sample being identified in association with; any one of a plurality of voice command text files, each voice command text file being configured to store a text string of the audio sample, the text string being a transcription of a vocalization in the audio sample, any one of a plurality of different applications, the plurality of different applications being executable at the computing device, and any one of a plurality of different application command files, each application command file corresponding to a single respective application and associating at least one executable action to be performed within the application with at least one voice command text file; receiving, at the computing device, a first audio command; determining, using a local speech recognition algorithm at the computing device, that the first audio command does not match any of the plurality of audio samples of the local database within a margin of error; transmitting, responsive to the determining step, the first audio command from the computing device to a remote server for detection of one or more voice command text files associated with the first audio command; receiving, at the computing device from the remote server, the one or more detected voice command text files associated with the first audio command; identifying an application at the computing device in relation to the one or more detected voice command text files, the identifying comprising analyzing a plurality of application command files at the computing device to locate an application command file matching the detected voice command text to the application; and updating, at the computing device, the local database to; include the first audio command in the plurality of audio samples of the local database, associate the first audio command with the identified application, and associate the first audio command with an application command file corresponding to the identified application. - View Dependent Claims (16)
Specification