METHOD, SYSTEM AND COMPUTER PROGRAM FOR ENHANCED SPEECH RECOGNITION OF DIGITS INPUT STRINGS
First Claim
1. A method for speech recognition comprising:
- for an expected input string divided into a plurality of expected string segments, receiving a speech segment for each expected string segment;
performing speech recognition separately on each said speech segment, wherein said performing speech recognition comprisesgenerating, for each said speech segment, a segment n-best list comprising n highest confidence score results of said speech recognition, where n is an integer;
generating a global n-best list corresponding to said expected input string utilizing said segment n-best lists; and
determining a final global speech recognition result corresponding to said expected input string, wherein said determining said final global speech recognition result comprisespruning results of said global n-best list utilizing a pruning criterion.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention proposes a method, system and computer program for speech recognition. According to one embodiment, a method is provided wherein, for an expected input string divided into a plurality of expected string segments, a speech segment is received for each expected string segment. Speech recognition is then performed separately on each said speech segment via the generation, for each said speech segment, of a segment n-best list comprising n highest confidence score results. A global n-best list is then generated corresponding to the expected input string utilizing the segment n-best lists and a final global speech recognition result corresponding to said expected input string is determined via the pruning of the results of the global n-best list utilizing a pruning criterion.
-
Citations
14 Claims
-
1. A method for speech recognition comprising:
-
for an expected input string divided into a plurality of expected string segments, receiving a speech segment for each expected string segment; performing speech recognition separately on each said speech segment, wherein said performing speech recognition comprises generating, for each said speech segment, a segment n-best list comprising n highest confidence score results of said speech recognition, where n is an integer; generating a global n-best list corresponding to said expected input string utilizing said segment n-best lists; and determining a final global speech recognition result corresponding to said expected input string, wherein said determining said final global speech recognition result comprises pruning results of said global n-best list utilizing a pruning criterion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. One or more machine-readable media having stored therein a program product, which when executed by a set of one or more processors causes the set of one or more processors to perform a method comprising:
-
for an expected input string divided into a plurality of expected string segments, receiving a speech segment for each expected string segment; performing speech recognition separately on each said speech segment, wherein said performing speech recognition comprises generating, for each said speech segment, a segment n-best list comprising n highest confidence score results of said speech recognition, where n is an integer; generating a global n-best list corresponding to said expected input string utilizing said segment n-best lists; and determining a final global speech recognition result corresponding to said expected input string, wherein said determining said final global speech recognition result comprises pruning results of said global n-best list utilizing a pruning criterion.
-
-
14. A system for speech recognition comprising:
-
a set of one or more processors; a memory unit coupled with the set of one or more processors; and a speech recognition unit operable to, for an expected input string divided into a plurality of expected string segments, receive a speech segment for each expected string segment; perform speech recognition separately on each said speech segment, wherein said speech recognition comprises generating, for each said speech segment, a segment n-best list comprising n highest confidence score results of said speech recognition, where n is an integer; generate a global n-best list corresponding to said expected input string utilizing said segment n-best lists; and determine a final global speech recognition result corresponding to said expected input string, wherein determining said final global speech recognition result comprises pruning results of said global n-best list utilizing a pruning criterion.
-
Specification