Client-server speech recognition for altering processing time based on a value communicated between client and server
First Claim
1. A system supporting speech recognition comprising:
- two or more clients, each client comprising the capability to receive audio speech from a user, store the audio speech in a first set of one or more buffers in a raw uncompressed audio format, each buffer comprising a portion of the received audio speech, write the stored audio speech from a first buffer in the first set of buffers to a second buffer in a second set of one or more buffers, encode the stored audio speech in the second buffer before all of the audio speech is received, package the encoded audio speech from the second buffer into one or more packets to be transmitted over the Internet before all of the audio speech is received, and transmit a packet of encoded audio speech over the Internet before all of the audio speech is received; and
a server, the server comprising the capability to receive packets of encoded audio speech from the two or more clients, decode each of the packets of audio speech and store the resultant raw speech into one or more buffers for the two or more clients, and evaluate the resultant raw speech received from each of the two or more clients,wherein the server further comprises the capability to transmit a response to a client of the two or more clients, the response a result of the server'"'"'s evaluation of the resultant raw speech received from the client, and the server alters a processing time used to evaluate the resultant raw speech based on a value communicated between the client and the server, and the client further comprises the capability to receive the response from the server.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for handling speech recognition processing in effectively real-time, via the Internet, in order that users do not experience noticeable delays from the start of an exercise until they receive responsive feedback. A user uses a client to access the Internet and a server supporting speech recognition processing, e.g., for language learning activities. The user inputs speech to the client, which transmits the user speech to the server in approximate real-time. The server evaluates the user speech in context of the current speech recognition exercise being executed, and provides responsive feedback to the client, again, in approximate real-time, with minimum latency delays. The client upon receiving responsive feedback from the server, displays, or otherwise provides, the feedback to the user.
-
Citations
20 Claims
-
1. A system supporting speech recognition comprising:
-
two or more clients, each client comprising the capability to receive audio speech from a user, store the audio speech in a first set of one or more buffers in a raw uncompressed audio format, each buffer comprising a portion of the received audio speech, write the stored audio speech from a first buffer in the first set of buffers to a second buffer in a second set of one or more buffers, encode the stored audio speech in the second buffer before all of the audio speech is received, package the encoded audio speech from the second buffer into one or more packets to be transmitted over the Internet before all of the audio speech is received, and transmit a packet of encoded audio speech over the Internet before all of the audio speech is received; and a server, the server comprising the capability to receive packets of encoded audio speech from the two or more clients, decode each of the packets of audio speech and store the resultant raw speech into one or more buffers for the two or more clients, and evaluate the resultant raw speech received from each of the two or more clients, wherein the server further comprises the capability to transmit a response to a client of the two or more clients, the response a result of the server'"'"'s evaluation of the resultant raw speech received from the client, and the server alters a processing time used to evaluate the resultant raw speech based on a value communicated between the client and the server, and the client further comprises the capability to receive the response from the server. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system supporting speech recognition comprising:
-
at least one client, the client comprising the capability to receive audio speech from a user, store the audio speech in a first set of one or more buffers in a raw uncompressed audio format, each buffer comprising a portion of the received audio speech, write the stored audio speech from a first buffer in the first set of buffers to a second buffer in a second set of one or more buffers, encode the stored audio speech in the second buffer before all of the audio speech is received, package the encoded audio speech from the second buffer into one or more packets to be transmitted over the Internet before all of the audio speech is received, and transmit a packet of encoded audio speech over the Internet before all of the audio speech is received; and a server, the server comprising the capability to receive packets of encoded audio speech from the client, decode each of the packets of audio speech and store the resultant raw speech into one or more buffers for the client, and evaluate the resultant raw speech received from the client, wherein the server further comprises the capability to transmit a response to the client, the response a result of the server'"'"'s evaluation of the resultant raw speech received from the client, and the server alters a processing time used to evaluate the resultant raw speech based on a value communicated between the client and the server over the Internet, and the client further comprises the capability to receive the response from the server transmitted over the Internet. - View Dependent Claims (19, 20)
-
Specification