VARIOUS APPARATUS AND METHODS FOR A SPEECH RECOGNITION SYSTEM

  • US 20100223056A1
  • Filed: 02/27/2009
  • Published: 09/02/2010
  • Est. Priority Date: 02/27/2009
  • Status: Active Grant
  • ×
    • Pin Icon | RPX Insight
    • Pin
First Claim
Patent Images

1. A continuous speech recognition engine, comprisingfront-end filters and sound data parsers configured to convert a supplied audio file of a continuous voice communication into a time coded sequence of sound feature frames for speech recognition;

  • a fine speech recognizer model having an input to receive the time coded sequence of sound feature frames from the front-end filters as an input, where the fine speech recognizer model applies a speech recognition process to the sound feature frames and determines at least a best guess at each recognizable word that corresponds to the sound feature frames;

    a coarse sound representation generator having an input to receive both

         1) a start and stop times for a time segment associated with the recognized word from the fine model speech recognizer and

         2) a series of identified individual phonemes from a phoneme decoder as inputs, where the coarse sound representation generator outputs the series of identified individual phonemes occurring within the duration of the start and stop times of the recognized word as a coarse sound representation of the recognized word;

    a coarse match generator having an input to receive the coarse sound representation of the recognized word from the coarse sound representation generator as an input as well as the recognized word from the fine model speech recognizer, wherein the coarse match generator then determines a likelihood of the coarse sound representation actually being the recognized word based on comparing the coarse sound representation of the recognized word to a database containing the known sound of that recognized word, where the coarse match generator assigns the likelihood as a robust confidence level parameter to that recognized word from the fine speech recognition model and includes the start and stop time codes of the recognized word from the common time line with the supplied audio file, wherein each word in the supplied audio file is stored in a memory with a robust confidence level parameter and the start and stop time codes from the common time line; and

    a user interface configured to allow speech data analytics on each word in the supplied audio file of continuous voice communications stored in the memory based on the robust confidence level parameter.

View all claims
    ×
    ×

    Thank you for your feedback

    ×
    ×