Voice browser dialog enabler for a communication system
First Claim
1. A voice browser dialog enabler for a communication system, the browser enabler comprising:
- a speech recognition application comprising a plurality of units of application interaction, wherein each unit has associated voice dialog forms defining fragments;
a voice browser driver, the voice browser driver resident on a communication device;
the voice browser driver providing the fragments from the application and generating identifiers that identify the fragments; and
a voice browser implementation resident on a remote voice server, the voice browser implementation receiving the fragments from the voice browser driver and downloading a plurality of speech grammars, wherein subsequent input speech is matched against those speech grammars associated with the corresponding identifiers received in a speech recognition request from the voice browser driver.
4 Assignments
0 Petitions
Accused Products
Abstract
A voice browser dialog enabler for multimodal dialog uses a multimodal markup document with fields have markup-based forms associated with each field and defining fragments. A voice browser driver resides on a communication device and provides the fragments and identifiers that identify the fragments. A voice browser implementation resides on a remote voice server and receives the fragments from the driver and downloads a plurality of speech grammars. Input speech is matched against those speech grammars associated with the corresponding identifiers received in a recognition request from the voice browser driver.
-
Citations
20 Claims
-
1. A voice browser dialog enabler for a communication system, the browser enabler comprising:
-
a speech recognition application comprising a plurality of units of application interaction, wherein each unit has associated voice dialog forms defining fragments;
a voice browser driver, the voice browser driver resident on a communication device;
the voice browser driver providing the fragments from the application and generating identifiers that identify the fragments; and
a voice browser implementation resident on a remote voice server, the voice browser implementation receiving the fragments from the voice browser driver and downloading a plurality of speech grammars, wherein subsequent input speech is matched against those speech grammars associated with the corresponding identifiers received in a speech recognition request from the voice browser driver. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A voice browser for multimodal dialog in a communication system, the browser comprising:
-
a multimodal markup document split into a displayable markup portion and a voice markup portion comprising fields, wherein the fields have associated forms defining fragments of the document page;
a voice browser stub including a voice browser driver portion of a voice browser, the voice browser driver resident on a communication device;
the voice browser stub generating the fragments and the voice browser driver generating identifiers that identify the fragments; and
a voice browser implementation portion of the voice browser resident on a remote voice server, the voice browser implementation downloading the fragments from the voice browser stub and downloading a plurality of speech grammars, wherein subsequent input speech is matched against those speech grammars associated with the corresponding identifiers received in a speech. recognition request from the voice browser driver. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A method for enabling dialog with a voice browser for a communication system, the method comprising the steps of:
-
providing a voice browser driver resident on a communication device and a voice browser implementation containing a plurality speech grammars resident on a remote voice server;
running a speech recognition application comprising a plurality of units of application interaction, wherein each unit has associated voice dialog forms defining fragments;
defining identifiers associated with each fragment;
supplying the fragments to the voice browser implementation;
focusing on a field in one of the units of application interaction;
sending a speech recognition request including the identifier of the form associated with the focused field from the voice browser driver to the voice browser implementation;
inputting and recognizing speech;
matching the speech to the acceptable speech grammar associated with the identifier; and
obtaining speech recognition results. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification