×

Dynamic adaptation of language models and semantic tracking for automatic speech recognition

  • US 9,858,923 B2
  • Filed: 09/24/2015
  • Issued: 01/02/2018
  • Est. Priority Date: 09/24/2015
  • Status: Active Grant
First Claim
Patent Images

1. A system for recognizing phrases of speech from a conversation, said system comprising:

  • an information gathering circuit to collect textual information associated with a user using general knowledge sources in combination with user textual information sources including electronic documents, emails, text messages and social media communications;

    a text clustering circuit to analyze said collected textual information and organize that information into clusters;

    a knowledge domain generation circuit to generate domains of knowledge based upon each cluster and to map those domains to a plurality of personalized language models (PLM) in an offline mode prior to automatic speech recognition (ASR) operation to generate said plurality of PLMs;

    a first ASR circuit to initially transcribe speech, of a user of said system, to a first estimated text sequence, based on a generalized language model;

    a language model matching circuit to analyze said first estimated text sequence to determine a context and to select a PLM, from said plurality of PLMs, based on said context;

    a second ASR circuit to re-transcribe said speech based on said selected PLM to generate a lattice of paths of estimated text sequences, wherein each of said paths of estimated text sequences comprise one or more words and an acoustic score associated with each of said words; and

    a semantic analysis circuit to select one of said paths of estimated text sequences, from said lattice, as a currently recognized phrase of speech from said conversation, wherein the semantic analysis circuit includes;

    a semantic distance calculation circuit to estimate a semantic distance between each of said paths of estimated text sequences to one or more previously recognized phrases of speech from said conversation, anda condition random field (CRF) classifier circuit torank each of said paths of estimated text sequences based on contextual relationships between said words in said paths, andwherein the semantic analysis circuit selects one of said paths of estimated text sequences, from said lattice, as a currently recognized phrase of speech from said conversation, based on said semantic distance and subsequent said CRF ranking.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×