SYSTEM AND METHOD FOR SPELLING RECOGNITION USING SPEECH AND NON-SPEECH INPUT
First Claim
1. A system for recognizing a combination of speech and alternate input, the method comprising:
- a processor;
a module configured to control the processor to generate an unweighted grammar permitting all letter sequences that map to a received non-speech input;
a module configured to control the processor to select a database of words;
a module configured to control the processor to generate a weighted grammar using the unweighted grammar and a statistical letter model trained on the database of words;
a module configured to control the processor to receive speech from a user associated with the non-speech input after receiving the non-speech input and after generating the weighted grammar; and
a module configured to control the processor to process the received speech and non-speech input using the weighted grammar.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method for non-speech input or keypad-aided word and spelling recognition is disclosed. The method includes generating an unweighted grammar, selecting a database of words, generating a weighted grammar using the unweighted grammar and a statistical letter model trained on the database of words, receiving speech from a user after receiving the non-speech input and after generating the weighted grammar, and performing automatic speech recognition on the speech and non-speech input using the weighted grammar. If a confidence is below a predetermined level, then the method includes receiving non-speech input from the user, disambiguating possible spellings by generating a letter lattice based on a user input modality, and constraining the letter lattice and generating a new letter string of possible word spellings until a letter string is correctly recognized.
-
Citations
20 Claims
-
1. A system for recognizing a combination of speech and alternate input, the method comprising:
-
a processor; a module configured to control the processor to generate an unweighted grammar permitting all letter sequences that map to a received non-speech input; a module configured to control the processor to select a database of words; a module configured to control the processor to generate a weighted grammar using the unweighted grammar and a statistical letter model trained on the database of words; a module configured to control the processor to receive speech from a user associated with the non-speech input after receiving the non-speech input and after generating the weighted grammar; and a module configured to control the processor to process the received speech and non-speech input using the weighted grammar. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of recognizing input from a user, the method comprising:
-
receiving input from a user; performing spelling recognition via an automatic speech recognition (ASR) system on the input, the speech recognition being performed using a statistical letter model trained on a database of words; disambiguating possible spellings by generating a letter lattice based on a user input modality; and performing, with each letter received, until a letter string is correctly recognized; constraining the letter lattice; and generating a new letter string of possible word spellings. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-readable storage medium storing instructions for controlling a computing device having a processor to recognize input from a user, the instructions comprising controlling the processor to perform the steps of:
-
generating an unweighted grammar permitting all letter sequences that map to a received non-speech input; selecting a database of words; generating a weighted grammar using the unweighted grammar and a statistical letter model trained on the database of words; receiving speech from a user associated with the non-speech input after receiving the non-speech input and after generating the weighted grammar; performing recognition via automatic speech recognition (ASR) on the received speech and non-speech input using the weighted grammar; and if an ASR confidence is below a predetermined level; disambiguating possible spellings by generating a letter lattice based on a user input modality; and constraining the letter lattice and generating a new letter string of possible word spellings, with each letter received, until a letter string is correctly recognized; - View Dependent Claims (18, 19, 20)
-
Specification