Method and apparatus for automatic recognition of long sequences of spoken digits
First Claim
1. A method of recognizing speech in systems that accept speech input, comprising:
- (a) receiving at least a current subgroup of speech units that form part of a complete speech sequence that is to be input from a user;
(b) detecting a natural pause between input subgroups;
(c) recognizing the speech units of the subgroup to provide a recognition result; and
(d) immediately feeding back the recognition result for verification by the user,
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system of recognizing speech based in part on an observation that a speaker naturally pauses and speaks smaller subgroups of speech units or digits that form part of a complete longer speech sequence. In the method, subgroups of speech units are processed by the system between a human'"'"'s natural pauses. This pause is detected by the system and the subgroup is processed in order to provide a recognition result, which is a best representation of the input subgroup. The recognition result is immediately repeated back to the user for verification. The user is prompted to repeat a subgroup for re-recognition and re-verification if a rejection criteria is met; otherwise the processing steps are repeated for remaining subgroups until it has been determined that the complete speech sequence has been accurately recognized.
42 Citations
27 Claims
-
1. A method of recognizing speech in systems that accept speech input, comprising:
-
(a) receiving at least a current subgroup of speech units that form part of a complete speech sequence that is to be input from a user;
(b) detecting a natural pause between input subgroups;
(c) recognizing the speech units of the subgroup to provide a recognition result; and
(d) immediately feeding back the recognition result for verification by the user, - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An automatic speech recognition system, comprising:
-
a receiver for receiving at least a current subgroup of speech units that form part of a complete speech sequence that is to be input by a user;
a detector for detecting a natural pause after receiving the subgroup;
a decoder for detecting a natural pause between input subgroups to output a recognition result representative of the current subgroup; and
a controller for evaluating the output recognition result and feeding back the recognition result to the user. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
Specification