Client-server speech recognition with processing level based on value received from client
First Claim
Patent Images
1. A method comprising:
- at a first client comprising a capability to receive audio speech from a user, after receiving audio speech, encoding the received audio speech;
from the first client, after encoding the received audio speech, transmitting packets of encoded audio speech along with a first value over the Internet;
at a second client comprising a capability to receive audio speech from a user, after receiving audio speech, encoding the received audio speech;
from the second client, after encoding the received audio speech, transmitting packets of encoded audio speech along with a second value over the Internet;
at a server, receiving packets of encoded audio speech from two or more clients; and
at the server, servicing the encoded audio speech in an amount of processing time based on the first value received from the first client and the second value received from the second client,wherein the server comprises the capability to transmit responses to the first and second clients, and the response is a result of the server'"'"'s servicing of the encoded audio speech based on the first and second values received from the clients.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for handling speech recognition processing in effectively real-time, via the Internet, in order that users do not experience noticeable delays from the start until they receive responsive feedback. A user uses a client to access the Internet and a server supporting speech recognition processing. The user inputs speech to the client, which transmits the user speech to the server in approximate real-time. The server evaluates the user speech using processing level based on the value received from each client.
-
Citations
20 Claims
-
1. A method comprising:
-
at a first client comprising a capability to receive audio speech from a user, after receiving audio speech, encoding the received audio speech; from the first client, after encoding the received audio speech, transmitting packets of encoded audio speech along with a first value over the Internet; at a second client comprising a capability to receive audio speech from a user, after receiving audio speech, encoding the received audio speech; from the second client, after encoding the received audio speech, transmitting packets of encoded audio speech along with a second value over the Internet; at a server, receiving packets of encoded audio speech from two or more clients; and at the server, servicing the encoded audio speech in an amount of processing time based on the first value received from the first client and the second value received from the second client, wherein the server comprises the capability to transmit responses to the first and second clients, and the response is a result of the server'"'"'s servicing of the encoded audio speech based on the first and second values received from the clients. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of on-line speech recognition comprising:
-
at a first client, after receiving audio speech from a user, encoding the audio speech before all of the audio speech is received; at the first client comprising the capability to receive audio speech from a user in an uncompressed audio format, storing the encoded speech in one or more buffers; at the first client, packaging the encoded audio speech into one or more packets to be transmitted over the Internet before all of the audio speech is received; from the first client, transmitting a packet of encoded audio speech along with a first value over the Internet before all of the audio speech is received; at a second client comprising the capability to receive audio speech from a user in an uncompressed audio format, after receiving audio speech from a user, encoding the audio speech before all of the audio speech is received; at the second client, storing the encoded speech in one or more buffers; at the second client, packaging the encoded audio speech into one or more packets to be transmitted over the Internet before all of the audio speech is received; from the second client, transmitting a packet of encoded audio speech along with a second value over the Internet before all of the audio speech is received; providing a server, the server comprising the capability to receive packets of encoded audio speech from two or more clients; at the server, decoding each of the packets of audio speech and storing the resultant speech into one or more buffers for the respective client; and at the server, evaluating the resultant speech received from each of the two or more clients, wherein the server comprises the capability to transmit a response to the first and second clients, and the response is a result of the server'"'"'s evaluating of the resultant speech in an amount of processing time based on the first and second values received from the clients. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product stored in a non-transitory storage medium providing program instructions comprising:
-
first executable code executing at two or more clients, the first executable code enabling each client the capability to receive audio speech from a user, store the audio speech in one or more buffers in an uncompressed audio format, each buffer comprising a portion of the received audio speech, encode the stored audio speech in the one or more buffers, package the encoded audio speech into one or more packets to be transmitted over the Internet, and transmit a packet of encoded audio speech along with a respective value for each client over the Internet; and second executable code executing at a server, the executable code enabling the server the capability to receive packets of encoded audio speech from the two or more clients, decode each of the packets of audio speech and store the resultant speech into one or more buffers for the respective client, and evaluate the resultant speech received from each of the two or more clients in an amount of processing time based on the respective value received from a respective client, and the server has the capability to transmit a response to a client based on the evaluated speech speech. - View Dependent Claims (20)
-
Specification