Hosted voice recognition system for wireless devices
First Claim
1. A computer-implemented method comprising:
- under control of a first computing device executing specific computer-executable instructions,receiving a first portion of audio input captured via a microphone;
in response to receiving the first portion of the audio input, transmitting to a second computing device, first data representing the first portion of the audio input;
receiving a next portion of the audio input, the next portion captured via the microphone directly following the first portion of the audio input;
in response to receiving the next portion of the audio input, transmitting to the second computing device, next data representing the next portion of the audio input;
receiving from the second computing device, first partial speech recognition results determined from the first data representing the first portion of the audio input,wherein the first partial speech recognition results are received prior to the transmitting of the next data;
receiving from the second computing device, next partial speech recognition results determined from the next data representing the next portion of the audio input; and
initiating presentation, on a display of the first computing device, of the first partial speech recognition results,wherein the presentation of the first partial speech recognition results on the display of the first computing device is initiated by the first computing device prior to the receiving of the next partial speech recognition results.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, systems, and software for converting the audio input of a user of a handheld client device or mobile phone into a textual representation by means of a backend server accessed by the device through a communications network. The text is then inserted into or used by an application of the client device to send a text message, instant message, email, or to insert a request into a web-based application or service. In one embodiment, the method includes the steps of initializing or launching the application on the device; recording and transmitting the recorded audio message from the client device to the backend server through a client-server communication protocol; converting the transmitted audio message into the textual representation in the backend server; and sending the converted text message back to the client device or forwarding it on to an alternate destination directly from the server.
255 Citations
20 Claims
-
1. A computer-implemented method comprising:
under control of a first computing device executing specific computer-executable instructions, receiving a first portion of audio input captured via a microphone; in response to receiving the first portion of the audio input, transmitting to a second computing device, first data representing the first portion of the audio input; receiving a next portion of the audio input, the next portion captured via the microphone directly following the first portion of the audio input; in response to receiving the next portion of the audio input, transmitting to the second computing device, next data representing the next portion of the audio input; receiving from the second computing device, first partial speech recognition results determined from the first data representing the first portion of the audio input, wherein the first partial speech recognition results are received prior to the transmitting of the next data; receiving from the second computing device, next partial speech recognition results determined from the next data representing the next portion of the audio input; and initiating presentation, on a display of the first computing device, of the first partial speech recognition results, wherein the presentation of the first partial speech recognition results on the display of the first computing device is initiated by the first computing device prior to the receiving of the next partial speech recognition results. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A computer-readable, non-transitory storage medium storing computer executable instructions that, when executed by a first computing device, configure the first computing device to perform operations comprising:
-
receiving a first portion of audio input captured via a microphone of the first computing device; in direct response to receiving the first portion of the audio input, transmitting to a second computing device, first data representing the first portion of the audio input; receiving a next portion of the audio input that directly follows the first portion of the audio input, the next portion captured via the microphone; in direct response to receiving the next portion of the audio input, transmitting to the second computing device, next data representing the next portion of the audio input; receiving from the second computing device, first partial speech recognition results determined from the first data representing the first portion of the audio input, wherein the first partial speech recognition results are received prior to the transmitting of the next data; receiving from the second computing device, next partial speech recognition results determined from the next data representing the next portion of the audio input; and initiating display of the first partial speech recognition results on a display of the first computing device, wherein the display of the first partial speech recognition results on the display of the first computing device is initiated by the first computing device prior to the receiving of the next partial speech recognition results. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
an electronic data store configured to at least store computer-executable instructions; and a second computing device including at least one processor, the second computing device in communication with the electronic data store and configured to execute the computer-executable instructions to at least; receive, from a first computing device, first data representing a first portion of an audio input; in response to receiving the first data, determine first partial speech recognition results from the first data; receive, from the first computing device, next data representing a next portion of the audio input that directly follows the first portion of the audio input; in response to receiving the next data, determine next partial speech recognition results from the next data; prior to determining the next partial speech recognition results from the next data, transmit the first partial speech recognition results to the first computing device, for display by the first computing device; and transmit the next partial speech recognition results to the first computing device for display by the first computing device, wherein, at the first computing device, the display of the first partial speech recognition results by the first computing device is initiated prior to the display of the next partial speech recognition results. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification