Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
First Claim
Patent Images
1. A method of verifying a speech input comprising:
- determining pronunciation data for a received user spoken utterance specifying a word;
speech recognizing further user spoken utterances specifying individual characters of the word, wherein an N-best list is generated for each character;
automatically generating word candidates using the N-best list for each character; and
comparing the pronunciation data with the word candidates to determine at least one match.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of verifying a speech input can include determining pronunciation data for a received user spoken utterance specifying a word and speech recognizing further user spoken utterances specifying individual characters of the word. An N-best list can be generated for each character. Word candidates can be generated using the N-best list for each character. The pronunciation data can be compared with the word candidates to determine at least one match.
177 Citations
20 Claims
-
1. A method of verifying a speech input comprising:
-
determining pronunciation data for a received user spoken utterance specifying a word;
speech recognizing further user spoken utterances specifying individual characters of the word, wherein an N-best list is generated for each character;
automatically generating word candidates using the N-best list for each character; and
comparing the pronunciation data with the word candidates to determine at least one match. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of processing a speech input comprising:
-
selecting a domain of words;
determining pronunciation data for a word specified by a received user spoken utterance;
comparing the pronunciation data for the word with a list of common words of the domain to find a match;
if a match is found, discontinuing further speech processing; and
if a match is not found, speech recognizing further user spoken utterances specifying a plurality of individual characters of the word for comparison to the pronunciation data. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A machine readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
-
determining pronunciation data for a received user spoken utterance specifying a word;
speech recognizing further user spoken utterances specifying individual characters of the word, wherein an N-best list is generated for each character;
automatically generating word candidates using the N-best list for each character; and
comparing the pronunciation data with the word candidates to determine at least one match. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification