Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access
First Claim
1. A method for permitting a subscriber to perform an action available on a communications network using a spoken utterance, comprising:
- maintaining a system state database comprising a tree structure having a plurality of nodes, each respective node of said plurality of nodes representing a particular system state of a plurality of possible system states, each state comprising a plurality of possible steps in a call flow, including an always connected state in which a feature may be accessed even when a call is not in progress and being associated with a predetermined node-specific, speech-based grammar for the respective node;
awaiting from the subscriber a spoken utterance at the particular system state;
recognizing the spoken utterance by comparing the spoken utterance to the predetermined grammar for the respective node for correspondence to the particular system state; and
performing an action at the network represented by the spoken utterance when the spoken utterance has been recognized as the predetermined grammar for the respective node, wherein the action activates a control sequence at the network for accessing a feature available on the network, including always-on connected features.
5 Assignments
0 Petitions
Accused Products
Abstract
A real-time networked telephony or computer system has a feature complex and/or applications that offer a class of features to a subscriber, including call information, and permits the subscriber to manage incoming and existing calls through available features accessed using spoken utterances. A speech processing unit coupled to the system interprets a subscriber'"'"'s spoken utterances without requiring the subscriber to train the system to recognize his or her voice. The interpretation of spoken utterances is enabled by a system state database that is maintained at the speech processing unit and comprises a database of the possible system slates, including possible call flows for a call, and a database associated with the system state database comprising context-specific grammar that a subscriber may recite at respective points in the call flow. The speech processing unit may also convert message signals from the network to speech which is read to the subscriber using a text to speech translator. The network can identify the voice or subscriber voice, or language used and will thereafter recognize all further commands using specific grammar for that language as well as perform text-to-speech conversion using the identified language. Use of the features can be applied to update of grammars, profiles and templates, etc. by transmitting results of transactions.
67 Citations
32 Claims
-
1. A method for permitting a subscriber to perform an action available on a communications network using a spoken utterance, comprising:
-
maintaining a system state database comprising a tree structure having a plurality of nodes, each respective node of said plurality of nodes representing a particular system state of a plurality of possible system states, each state comprising a plurality of possible steps in a call flow, including an always connected state in which a feature may be accessed even when a call is not in progress and being associated with a predetermined node-specific, speech-based grammar for the respective node; awaiting from the subscriber a spoken utterance at the particular system state; recognizing the spoken utterance by comparing the spoken utterance to the predetermined grammar for the respective node for correspondence to the particular system state; and performing an action at the network represented by the spoken utterance when the spoken utterance has been recognized as the predetermined grammar for the respective node, wherein the action activates a control sequence at the network for accessing a feature available on the network, including always-on connected features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A communications system for permitting a subscriber to perform an action available on a communications network using a spoken utterance comprising:
-
a device coupled to the network and into which an utterance may be spoken by a user, a system state database accessible to the network and defining a tree structure having a plurality of nodes, each respective node of said plural nodes representing a particular system state of a plurality of possible system states, each state comprising a plurality of possible steps in a call flow, including an always connected state in which a feature may be accessed even when a call is not in progress and being associated with a predetermined node-specific, speech-based grammar for the respective node; means for interpreting the user-spoken utterance; means for comparing the interpreted spoken utterance to the predetermined grammar for the respective node corresponding to the particular system state to recognize the spoken utterance as corresponding to the predetermined grammar associated with the respective node; and means for performing an action represented by the spoken utterance at the network when the spoken utterance has been recognized as corresponding to the predetermined grammar associated with the respective node, wherein the action activates a control sequence at the network for accessing a feature available on the network, including always-on connected features. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification