Method for efficient, safe and reliable data entry by voice under adverse conditions
First Claim
1. A method of data entry by voice under adverse conditions for efficient and robust form filling, the method comprising:
- communicating an input utterance from a speaker to a speech recognition means;
spotting a plurality of spotted words of at least one recognized spoken word within the input utterance, wherein the spotted words form a phrase containing at least one of field-specific values and commands;
echoing recognized values back to the speaker via a text-to-speech system;
rejecting unreliable or unsafe inputs for which a confidence measure is found to be low; and
maintaining a dialogue history enabling editing operations and correction operations on all active fields.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for data entry by voice under adverse conditions is disclosed. More specifically it provides a way for efficient and robust form filling by voice. A form can typically contain one or several fields that must be filled in. The user communicates to a speech recognition system and word spotting is performed upon the utterance. The spotted words of an utterance form a phrase that can contain field-specific values and/or commands. Recognized values are echoed back to the speaker via a text-to-speech system. Unreliable or unsafe inputs for which the confidence measure is found to be low (e.g. ill-pronounced speech or noises) are rejected by the spotter. Speaker adaptation is furthermore performed transparently to improve speech recognition accuracy. Other input modalities can be additionally supported (e.g. keyboard and touch-screen). The system maintains a dialogue history to enable editing and correction operations on all active fields.
70 Citations
20 Claims
-
1. A method of data entry by voice under adverse conditions for efficient and robust form filling, the method comprising:
-
communicating an input utterance from a speaker to a speech recognition means;
spotting a plurality of spotted words of at least one recognized spoken word within the input utterance, wherein the spotted words form a phrase containing at least one of field-specific values and commands;
echoing recognized values back to the speaker via a text-to-speech system;
rejecting unreliable or unsafe inputs for which a confidence measure is found to be low; and
maintaining a dialogue history enabling editing operations and correction operations on all active fields. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An article of manufacture for data entry by voice under adverse conditions enabling efficient and robust form filling, the article of manufacture comprising:
-
an operating system;
a memory in communication with said operating system;
a speech recognition means in communication with said operating system;
a speech generation means in communication with said operating system; and
a dialogue history maintenance means in communication with said operating system, wherein said operating system manages said memory, said speech recognition means, said speech generation means, and said dialogue history maintenance means in a manner permitting the user to monitor speech recognition of an input utterance by means of a generated speech corresponding to at least one of field-specific values and commands contained within the phrase formed by spotted words within the input utterance, and to perform editing operations and correction operations on all active fields. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification