Multimodal disambiguation of speech recognition
First Claim
1. A method for processing language input in a data processing system, comprising the steps of:
- receiving a first input comprising a voice input;
determining one or more word candidates according to the first input;
receiving a second input comprising a non-voice input; and
determining one or more word candidates according to the first input and the second input.
11 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a speech recognition system combined with one or more alternate input modalities to ensure efficient and accurate text input. The speech recognition system achieves less than perfect accuracy due to limited processing power, environmental noise, and/or natural variations in speaking style. The alternate input modalities use disambiguation or recognition engines to compensate for reduced keyboards, sloppy input, and/or natural variations in writing style. The ambiguity remaining in the speech recognition process is mostly orthogonal to the ambiguity inherent in the alternate input modality, such that the combination of the two modalities resolves the recognition errors efficiently and accurately. The invention is especially well suited for mobile devices with limited space for keyboards or touch-screen input.
-
Citations
20 Claims
-
1. A method for processing language input in a data processing system, comprising the steps of:
-
receiving a first input comprising a voice input;
determining one or more word candidates according to the first input;
receiving a second input comprising a non-voice input; and
determining one or more word candidates according to the first input and the second input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A machine readable medium having instructions stored therein which, when executed on a data processing system, cause the data processing system to perform a method for processing language input, the method comprising the steps of:
-
receiving a first input comprising a voice input;
determining one or more word candidates according to the first input;
receiving a second input comprising a non-voice input; and
determining one or more word candidates according to the first input and the second input. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A mobile device for processing language input, comprising:
-
a speech recognition module for processing a first input comprising a voice input; and
one or more second input modules for processing second input comprising a non-voice input;
a processing module coupled to the one or more second input modules and the speech recognition module, the processing module determining a first plurality of word candidates according to the first input and subsequently determining one or more word candidates according to the first input and the second input. - View Dependent Claims (17, 18, 19, 20)
-
Specification