Interactive voice recognition and response over the internet
First Claim
1. A system supporting speech recognition comprising:
- two or more clients, each client comprising the capability to receive audio speech from a user, store the audio speech in one or more buffers in an uncompressed audio format, each buffer comprising a portion of the received audio speech, encode the stored audio speech in the one or more buffers before all of the audio speech is received, package the encoded audio speech into one or more packets to be transmitted over the Internet before all of the audio speech is received, and transmit a packet of encoded audio speech over the Internet before all of the audio speech is received;
a server, the server comprising the capability to receive packets of encoded audio speech from at least two clients, decode each of the packets of audio speech and store the resultant speech into one or more buffers for the respective client, and evaluate the resultant speech received from each of the at least two clients,wherein the server further comprises the capability to transmit a response to a client, the response a result of the server'"'"'s evaluation of the resultant speech received from the client, and the server alters a processing time used to evaluate the resultant speech based on a value communicated between the client and the server, anda client of the two or more clients further comprises the capability to receive the response from the server.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for handling speech recognition processing in effectively real-time, via the Internet, in order that users do not experience noticeable delays from the start until they receive responsive feedback. A user uses a client to access the Internet and a server supporting speech recognition processing. The user inputs speech to the client, which transmits the user speech to the server in approximate real-time. The server evaluates the user speech, and provides responsive feedback to the client, again, in approximate real-time, with minimum latency delays. The client upon receiving responsive feedback from the server, displays, or otherwise provides, the feedback to the user.
65 Citations
26 Claims
-
1. A system supporting speech recognition comprising:
-
two or more clients, each client comprising the capability to receive audio speech from a user, store the audio speech in one or more buffers in an uncompressed audio format, each buffer comprising a portion of the received audio speech, encode the stored audio speech in the one or more buffers before all of the audio speech is received, package the encoded audio speech into one or more packets to be transmitted over the Internet before all of the audio speech is received, and transmit a packet of encoded audio speech over the Internet before all of the audio speech is received; a server, the server comprising the capability to receive packets of encoded audio speech from at least two clients, decode each of the packets of audio speech and store the resultant speech into one or more buffers for the respective client, and evaluate the resultant speech received from each of the at least two clients, wherein the server further comprises the capability to transmit a response to a client, the response a result of the server'"'"'s evaluation of the resultant speech received from the client, and the server alters a processing time used to evaluate the resultant speech based on a value communicated between the client and the server, and a client of the two or more clients further comprises the capability to receive the response from the server. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 26)
-
-
14. A system supporting speech recognition comprising:
-
two or more clients, each client comprising the capability to receive audio speech from a user in an uncompressed audio format, encode the audio speech before all of the audio speech is received and storing the encoded speech in one or more buffers, package the encoded audio speech into one or more packets to be transmitted over the Internet before all of the audio speech is received, and transmit a packet of encoded audio speech over the Internet before all of the audio speech is received; and a server, the server comprising the capability to receive packets of encoded audio speech from at least two clients, decode each of the packets of audio speech and store the resultant speech into one or more buffers for the respective client, and evaluate the resultant speech received from each of the at least two clients, wherein the server further comprises the capability to transmit a response to a client, the response a result of the server'"'"'s evaluation of the resultant speech received from the client, and the server alters a processing time used to evaluate the resultant speech based on a value communicated between the client and the server, and a client of the two or more clients further comprises the capability to receive the response from the server. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
Specification