Conversational networking via transport, coding and control conversational protocols
First Claim
1. A computer readable medium embodying instructions executable by a processor to perform a method for distributed speech processing, the method comprising:
- executing an application by a client; and
executing audio processing engines by a server, wherein the audio processing engines are asynchronously programmed by the application to process an audio stream to perform an audio processing function, and wherein the audio processing engines process the audio stream by exchanging audio and control messages between the audio processing engines in an audio processing control flow that is decoupled from, and independent of, application control and application level exchanges, wherein the server is a speech rendering browser for receiving the audio stream from the client, performing the audio processing function for determining a resulting audio stream, and returning the resulting audio stream for playback by the client.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method for implementing conversational protocols for distributed conversational networking architectures and/or distributed conversational applications, as well as real-time conversational computing between network-connected pervasive computing devices and/or servers over a computer network. The implementation of distributed conversational systems/applications according to the present invention is based, in part, on a suitably defined conversational coding, transport and control protocols. The control protocols include session control protocols, protocols for exchanging of speech meta-information, and speech engine remote control protocols.
152 Citations
18 Claims
-
1. A computer readable medium embodying instructions executable by a processor to perform a method for distributed speech processing, the method comprising:
-
executing an application by a client; and executing audio processing engines by a server, wherein the audio processing engines are asynchronously programmed by the application to process an audio stream to perform an audio processing function, and wherein the audio processing engines process the audio stream by exchanging audio and control messages between the audio processing engines in an audio processing control flow that is decoupled from, and independent of, application control and application level exchanges, wherein the server is a speech rendering browser for receiving the audio stream from the client, performing the audio processing function for determining a resulting audio stream, and returning the resulting audio stream for playback by the client. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A distributed speech processing system, comprising:
-
a client computing device executing a conversational application; and a server device in communication with the client computing device over a network, the server device providing an audio I/O processing Web service, which is programmable by control messages generated by the conversational application to provide audio I/O services for the conversational application and a speech engine Web service, which is programmable by control messages generated by the conversational application to provide speech processing services for the conversational application, wherein the audio I/O and speech processing Web services are programmed to perform audio processing tasks specified by the control messages but wherein the audio I/O and speech processing Web services perform said audio processing tasks by executing a flow control that is decoupled and independent from control of the conversational application. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
Specification