Multimodal disambiguation of speech recognition

  • US 7,881,936 B2
  • Filed: 06/01/2005
  • Issued: 02/01/2011
  • Est. Priority Date: 12/04/1998
  • Status: Active Grant
  • ×
    • Pin Icon | RPX Insight
    • Pin
First Claim
Patent Images

1. A computer-implemented method for processing a user'"'"'s speech using a mobile computer that includes a microphone, a display, and a reduced-character keypad, the method comprising operations of:

  • the computer receiving user speech via the microphone, the speech comprising a series of spoken words;

    the computer performing speech recognition upon the speech to compute an original N-best list of words for each discrete utterance of the speech;

    the computer operating the display to present a proposed sequence of multiple words, each word comprising;

    for each given one of the discrete utterances, a best word of the N-best list for said discrete utterance;

    the computer receiving and processing user entered correction to at least a given one of the displayed best words of the proposed sequence of words, comprising operations of;

    in response to the computer receiving user selection of the given word from the proposed sequence of words, the computer presenting a list of alternate hypothesis including others of the N-best list of words for the selected word;

    the computer receiving user input from the keypad spelling a desired word, where said user input is inherently ambiguous because the keypad includes multiple letters on some or all keys;

    responsive to receiving the user input, preparing a revised N-best list by limiting entries of the N-best list of words to words that are spelled by the user input from the keyboard;

    where the revised N-best list is further computed considering context and grammar of the selected word in conjunction with any words of the proposed sequence of words that the user has previously accepted or corrected;

    receiving user choice of a word from the revised N-best list in correction of the selected word;

    the computer updating the proposed sequence of words to incorporate the user entered correction; and

    the computer operating the display to present the updated proposed sequence of words.

View all claims
    ×
    ×

    Thank you for your feedback

    ×
    ×