×

Multimodal disambiguation of speech recognition

  • US 9,786,273 B2
  • Filed: 11/14/2013
  • Issued: 10/10/2017
  • Est. Priority Date: 06/02/2004
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • receiving, by a mobile device, a voice input;

    displaying, by the mobile device at a text insertion point of a touch screen display, a most likely interpretation of the voice input, the most likely interpretation resulting from a speech recognition process;

    receiving, by the mobile device on the touch screen display, a first non-voice input that selects said most likely interpretation;

    responsive to the first non-voice input, displaying for selection, by the mobile device on the touch screen display, two or more word candidates that are ordered by phonemic similarity to the most likely interpretation,wherein the most likely interpretation and the two or more word candidates are displayed in a single window, andwherein selection of the two or more word candidates from a list of known words is based at least in part on a confusability matrix that considers error frequency of one or more phonemes included in the most likely interpretation and positional context of the one or more phonemes within the most likely interpretation;

    receiving, by the mobile device, a second non-voice input that represents a selection of an intended word candidate from among said two or more word candidates; and

    automatically replacing, by the mobile device, the most likely interpretation with the intended word candidate at the text insertion point.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×