Speech-responsive voice messaging system and method
First Claim
1. A method of recognizing one or more audible instructions from a user, the method comprising the steps of:
- a. selecting a speech parameter block, wherein the speech parameter block comprises a data structure having a plurality of data fields, wherein the plurality of data fields includes a first data field for storing a plurality of speech recognition parameters and a second data field for storing a plurality of commands;
b. audibly prompting the user for a response;
c. detecting the response, wherein the response is one selected from a group consisting of an audible utterance by the user and a dual tone multi frequency signal input actuated by the user;
d. performing a speech recognition operation on the audible utterance guided by the plurality of speech recognition parameters to determine a potential match between the audible utterance and one or more commands stored in the second data field; and
e. scoring the potential match relative to the plurality of speech recognition parameters stored in the speech parameter block to produce a score.
14 Assignments
0 Petitions
Accused Products
Abstract
A system and method for speech-responsive voice messaging, in which a Speech-Responsive Voice Messaging System (SRVMS) preferably provides a hierarchically-simple speech user interface (UI) that enables subscribers to use speech to specify commands such as mailboxes, passwords, and digits. The SRVMS generates and evaluates candidate results. The SRVMS invokes a speech UI navigation operation or a voice messaging operation according to the outcome of the evaluation of the candidate results. In the preferred embodiment, the SRVMS determines whether the candidate results are good, questionable, or bad; and whether two or more candidate results are ambiguous due to a likelihood that each such result could be a valid command. If the candidate results are questionable or ambiguous, an ambiguity resolution UI prompts the subscriber to confirm whether the best candidate result is what the subscriber intended. In response to repeated speech recognition failures, the SRVMS transfers the subscriber to a Dual Tone Multi Frequency (DTMF) UI. Transfer to the DTMF UI is also performed in response to detection of predetermined DTMF signals issued by the subscriber while the speech UI is in context. The SRVMS provides a logging unit and a reporting unit which operate in parallel with the speech UI, in a manner that is transparent to subscribers. The logging unit directs the selective logging of subscriber utterances, and the reporting unit selectively generates and maintains system performance statistics on multiple detail levels.
-
Citations
34 Claims
-
1. A method of recognizing one or more audible instructions from a user, the method comprising the steps of:
-
a. selecting a speech parameter block, wherein the speech parameter block comprises a data structure having a plurality of data fields, wherein the plurality of data fields includes a first data field for storing a plurality of speech recognition parameters and a second data field for storing a plurality of commands;
b. audibly prompting the user for a response;
c. detecting the response, wherein the response is one selected from a group consisting of an audible utterance by the user and a dual tone multi frequency signal input actuated by the user;
d. performing a speech recognition operation on the audible utterance guided by the plurality of speech recognition parameters to determine a potential match between the audible utterance and one or more commands stored in the second data field; and
e. scoring the potential match relative to the plurality of speech recognition parameters stored in the speech parameter block to produce a score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
a. setting a confirmation indicator if the score is designated with a first confidence label; and
b. setting a repeat speech recognition indicator if the score is designated with a second confidence label.
-
-
11. The method according to claim 10 further comprising the step of performing a confirmation operation if the confirmation indicator is set.
-
12. The method according to claim 11 wherein the step of performing a confirmation operation includes the steps of:
-
a. selecting a confirmation speech parameter block, wherein the confirmation speech parameter block stores a plurality of parameters for guiding the confirmation operation;
b. generating an appropriate confirmation voice prompt to the user;
c. detecting a confirmation utterance by the user; and
d. evaluating the confirmation utterance.
-
-
13. The method according to claim 12 wherein the step of evaluating the confirmation utterance includes the steps of:
-
a. performing a confirmation speech recognition operation on the confirmation utterance guided by the plurality of parameters; and
b. determining whether an unsuccessful confirmation condition exists.
-
-
14. The method according to claim 13 further comprising the step of repeating the step of performing a confirmation operation upon the occurrence of the unsuccessful confirmation condition.
-
15. The method according to claim 10 further comprising the step of initiating a second audible prompt to the user if the repeat speech recognition indicator is set.
-
16. The method according to claim 1 further comprising the step of performing a dual tone multi frequency recognition operation on the dual tone multi frequency signal input.
-
17. A voice messaging system capable of recognizing one or more voice instructions from a user, the voice messaging system comprising:
-
a. a speech parameter block library having a plurality of speech parameter blocks for regulating voice communication with the user, wherein each speech parameter block includes a data structure having a plurality of data fields;
b. means coupled to the speech parameter block library for selecting an appropriate one of the plurality of speech parameter blocks to control voice communication with the user;
c. means coupled to the speech parameter block library for audibly prompting the user for a response;
d. means coupled to the speech parameter block library for detecting the response, wherein the response is one selected from a group consisting of an utterance by the user and a dual tone multi frequency signal input actuated by the user; and
e. means coupled to the speech parameter block library for performing a speech recognition operation on the utterance. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
a. a confirmation indicator which is set if the score is designated with a first confidence label; and
b. a repeat speech recognition indicator which is set if the score is designated with a second confidence label.
-
-
25. The voice messaging system according to claim 24 further including means for performing a confirmation operation if the confirmation indicator is set.
-
26. The voice messaging system according to claim 25 wherein the means for performing a confirmation operation comprises:
-
a. means for transferring to a confirmation speech parameter block, wherein the confirmation speech parameter block stores a plurality of confirmation parameters;
b. means for generating an appropriate confirmation voice prompt to the user;
c. means for detecting a confirmation utterance by the user; and
d. means for evaluating the confirmation utterance.
-
-
27. The voice messaging system according to claim 26 wherein the means for evaluating the confirmation utterance includes:
-
a. means for performing a confirmation speech recognition operation on the confirmation utterance by applying the plurality of confirmation parameters; and
b. means for determining whether an unsuccessful confirmation condition exists.
-
-
28. The voice messaging system according to claim 27 wherein the means for performing a confirmation operation performs the confirmation operation upon the occurrence of the unsuccessful confirmation condition.
-
29. The voice messaging system according to claim 24 wherein the means for audibly prompting the user prompts the user for an additional response if the repeat speech recognition indicator is set.
-
30. The voice messaging system according to claim 17 further comprising a template library having a plurality of speech templates, each speech template corresponding to one of the plurality of commands, wherein the template library facilitates performing the speech recognition operation on the utterance.
-
31. The voice messaging system according to claim 30 wherein the plurality of speech templates includes a plurality of speaker independent speech templates.
-
32. The voice messaging system according to claim 31 wherein the speech recognition operation includes a speaker independent speech recognition operation.
-
33. The voice messaging system according to claim 32 wherein the plurality of speech templates includes a plurality of speaker dependent speech templates.
-
34. The voice messaging system according to claim 33 wherein the speech recognition operation includes a speaker dependent speech recognition operation.
Specification