Scalable low resource dialog manager
First Claim
1. Apparatus for providing a spoken language interface between a user and at least one application or system, the apparatus operating in accordance with a computer processing system including a processor, an audio input system for receiving speech data provided by the user, an audio output system for outputting speech data to the user, a speech decoding engine and a speech synthesizing engine, the apparatus comprising:
- a dialog manager operatively coupled to the application or system, the audio input system, the audio output system, the speech decoding engine and the speech synthesizing engine; and
at least one user interface data set operatively coupled to the dialog manager, the user interface data set representing spoken language interface elements and data recognizable by the application;
wherein;
(i) the dialog manager enables connection between the input audio system and the speech decoding engine such that the spoken utterance provided by the user is provided from the input audio system to the speech decoding engine;
(ii) the speech decoding engine decodes the spoken utterance to generate a decoded output which is returned to the dialog manager;
(iii) the dialog manager uses the decoded output to search the user interface data set for a corresponding spoken language interface element and data which is returned to the dialog manager when found;
(iv) the dialog manager provides the spoken language interface element associated data to the application or system for processing in accordance therewith;
(v) the application, on processing that element, provides a reference to an interface element to be spoken;
(vi) the dialog manager enables connection between the audio output system and the speech synthesizing engine such that the speech synthesizing engine which, accepting data from that element, generates a synthesized output that expresses that element; and
(vii) the audio output system audibly presenting the synthesized output to the user.
2 Assignments
0 Petitions
Accused Products
Abstract
A spoken language interface between a user and at least one application or system includes a dialog manager operatively coupled to the application or system, an audio input system, an audio output system, a speech decoding engine and a speech synthesizing engine; and at least one user interface data set operatively coupled to the dialog manager, the user interface data set representing spoken language interface elements and data recognizable by the application. The dialog manager enables connection between the input audio system and the speech decoding engine such that a spoken utterance provided by the user is provided from the input audio system to the speech decoding engine. The speech decoding engine decodes the spoken utterance to generate a decoded output which is returned to the dialog manager. The dialog manager uses the decoded output to search the user interface data set for a corresponding spoken language interface element and data which is returned to the dialog manager when found, and provides the spoken language interface element associated data to the application for processing in accordance therewith. The application, on processing that element, provides a reference to an interface element to be spoken. The dialog manager enables connection between the audio output system and the speech synthesizing engine such that the speech synthesizing engine which, accepting data from that element, generates a synthesized output that expresses that element, the audio output system audibly presenting the synthesized output to the user.
-
Citations
44 Claims
-
1. Apparatus for providing a spoken language interface between a user and at least one application or system, the apparatus operating in accordance with a computer processing system including a processor, an audio input system for receiving speech data provided by the user, an audio output system for outputting speech data to the user, a speech decoding engine and a speech synthesizing engine, the apparatus comprising:
-
a dialog manager operatively coupled to the application or system, the audio input system, the audio output system, the speech decoding engine and the speech synthesizing engine; and
at least one user interface data set operatively coupled to the dialog manager, the user interface data set representing spoken language interface elements and data recognizable by the application;
wherein;
(i) the dialog manager enables connection between the input audio system and the speech decoding engine such that the spoken utterance provided by the user is provided from the input audio system to the speech decoding engine;
(ii) the speech decoding engine decodes the spoken utterance to generate a decoded output which is returned to the dialog manager;
(iii) the dialog manager uses the decoded output to search the user interface data set for a corresponding spoken language interface element and data which is returned to the dialog manager when found;
(iv) the dialog manager provides the spoken language interface element associated data to the application or system for processing in accordance therewith;
(v) the application, on processing that element, provides a reference to an interface element to be spoken;
(vi) the dialog manager enables connection between the audio output system and the speech synthesizing engine such that the speech synthesizing engine which, accepting data from that element, generates a synthesized output that expresses that element; and
(vii) the audio output system audibly presenting the synthesized output to the user.- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. A method for providing a spoken language interface between a user and at least one application or system, the method comprising the steps of:
-
enabling connection between an input audio system and a speech decoding engine such that a spoken utterance provided by the user is provided from the input audio system to the speech decoding engine;
decoding the spoken utterance to generate a decoded output which is returned to a dialog manager;
using the decoded output to search a user interface data set for a corresponding spoken language interface element and data which is returned to the dialog manager when found;
providing the spoken language interface element associated data to the application or system for processing in accordance therewith;
the application, on processing the spoken language interface element, providing a reference to an interface element to be spoken;
enabling connection between an audio output system and a speech synthesizing engine such that the speech synthesizing engine which, accepting data from that element, generates a synthesized output that expresses that element; and
audibly presenting the synthesized output to the user.
-
-
44. An article of manufacture for providing a spoken language interface between a user and at least one application or system, comprising a machine readable medium containing one or more programs which when executed implement the steps of:
-
enabling connection between an input audio system and a speech decoding engine such that a spoken utterance provided by the user is provided from the input audio system to the speech decoding engine;
decoding the spoken utterance to generate a decoded output which is returned to a dialog manager;
using the decoded output to search a user interface data set for a corresponding spoken language interface element and data which is returned to the dialog manager when found;
providing the spoken language interface element associated data to the application or system for processing in accordance therewith;
the application, on processing the spoken language interface element, providing a reference to an interface element to be spoken;
enabling connection between an audio output system and a speech synthesizing engine such that the speech synthesizing engine which, accepting data from that element, generates a synthesized output that expresses that element; and
audibly presenting the synthesized output to the user.
-
Specification