Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources
First Claim
1. A distributed speech processing system, comprising:
- a conversational application and a task manager that abstracts from the conversational application, the discovery and remote control of audio I/O and speech engine services;
an audio I/O processing service, which is programmable by control messages generated by the task manager on behalf of the conversational application to provide audio I/O services for the conversational application; and
a speech engine service, which is programmable by control messages generated by the task manager on behalf of the conversational application to provide speech processing services for the conversational application.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for conversational computing and, in particular, to systems and methods for building distributed conversational applications using a Web services-based model wherein speech engines (e.g., speech recognition) and audio I/O systems are programmable services that can be asynchronously programmed by an application using a standard, extensible SERCP (speech engine remote control protocol), to thereby provide scalable and flexible IP-based architectures that enable deployment of the same application or application development environment across a wide range of voice processing platforms and networks/gateways (e.g., PSTN (public switched telephone network), Wireless, Internet, and VoIP (voice over IP)). Systems and methods are further provided for dynamically allocating, assigning, configuring and controlling speech resources such as speech engines, speech pre/post processing systems, audio subsystems, and exchanges between speech engines using SERCP in a web service-based framework.
-
Citations
22 Claims
-
1. A distributed speech processing system, comprising:
-
a conversational application and a task manager that abstracts from the conversational application, the discovery and remote control of audio I/O and speech engine services;
an audio I/O processing service, which is programmable by control messages generated by the task manager on behalf of the conversational application to provide audio I/O services for the conversational application; and
a speech engine service, which is programmable by control messages generated by the task manager on behalf of the conversational application to provide speech processing services for the conversational application. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A speech processing web service, comprising:
-
a listener for receiving and parsing control messages that are used for programming the speech processing web service, wherein the control message are encoded using XML (eXtensible Markup Language) and exchanged using SOAP (Simple Object Access Protocol);
a business interface layer for exposing speech processing services offered by the web service, wherein the services are described and accessed using WSDL (web services description language); and
a business logic layer for providing speech processing services, the speech processing services comprising one of automatic speech recognition, speech synthesis, natural language understanding, acoustic feature extraction, audio encoding/decoding, audio recording, audio playback, and any combination thereof. - View Dependent Claims (20, 21)
-
-
22. A method for providing distributed speech processing, comprising the steps of:
-
receiving an incoming call by a client application;
assigning the call to an application having a task manager that is abstracted from the application for discovering and controlling speech processing services including audio I/O and speech engine services;
the task manager generating a control message to a router/load manager for requesting a speech processing service on behalf of the application to service the incoming call;
the router/load manager dynamically allocating a speech processing service to the application and providing an address of the allocated speech processing service to the task manager;
the task manager generating a control message for dynamically programming the allocated speech service based on requirements of the application; and
the application processing the incoming call using the programmed speech service.
-
Specification