×

Combined speech recognition and text-to-speech generation

  • US 7,577,569 B2
  • Filed: 09/24/2004
  • Issued: 08/18/2009
  • Est. Priority Date: 09/05/2001
  • Status: Active Grant
First Claim
Patent Images

1. A computing device for performing large vocabulary speech recognition comprising:

  • processor readable memory;

    one or more processors capable of executing program instructions read from said memory;

    a microphone or audio input for providing an electronic signal representing an utterance to be recognized;

    a speaker or audio output for enabling an electronic representation of sound produced in said device to be transduced into a corresponding sound;

    programming recorded in the memory including;

    speech recognition programming for performing large vocabulary speech recognition that responds to the electronic representations of a sequence of one or more utterances received from the microphone or audio input by producing a text output corresponding to the one or more words recognized as corresponding to the utterances; and

    TTS programming for providing TTS output to said speaker or audio output saying one or more words of said text recognized by said speech recognition;

    shared speech modeling data stored in said memory that is used by said speech recognition programming to recognize words corresponding to spoken utterances and by said TTS programming to generate sounds corresponding to the speaking of a sequence of one or more; and

    wherein the computing device is capable of responding to text navigation commands by moving a cursor backward and forward in the one or more words of said text output, and responding to each movement in response to one of said text navigation commands by providing a TTS output to said sneaker or audio output saying one or more words either starting or ending with the location of the cursor after each of said movements.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×