×

Systems and methods for automatic repair of speech recognition engine output using a sliding window mechanism

  • US 10,410,622 B2
  • Filed: 07/13/2017
  • Issued: 09/10/2019
  • Est. Priority Date: 07/13/2016
  • Status: Active Grant
First Claim
Patent Images

1. A processor implemented method comprising:

  • receiving a text output from a general purpose automatic speech recognition (GP-ASR) engine;

    identifying in real-time, a current environment of a speaker associated with the text output based on one or more environment variables; and

    automatically identifying one or more erroneously recognized terms in the text output and selectively correcting the output of the GP-ASR by replacing the one or more erroneously recognized terms in the text output with one or more best-fit terms by performing multi-stage correction steps on the received text output, using a pre-determined threshold for a fitness function, wherein the pre-determined threshold is a function of environment and is determined based on the identified current environment, wherein the fitness function is a function to determine candidate terms for replacement of the one or more erroneously recognized terms in the text output, and wherein the multi-stage correction steps comprise;

    a first stage of domain ontology based correction, wherein at least one of a phonetic-level match and an edit-distance-level-match is performed to obtain a matching stage output, wherein the matching stage output comprises matching of a first set of candidate terms from domain ontology with the text output based on a sliding window mechanism which consists of a predetermined number of words of the text output;

    a second stage of contextual correction of the matching stage output, wherein at least one of the contextual phonetic-level match and the edit-distance-level-match is performed to obtain a mapping stage output, mapping the matching stage output with a second set of candidate terms from the domain ontology based on a sliding window mechanism; and

    a third stage of linguistic correction of the mapping stage output, wherein semantic and linguistic repair rules based on the identified current environment are applied on the mapping stage output to obtain a domain specific repaired output corresponding to the output of the GP-ASR.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×