×

Speech recognition process

  • US 8,775,177 B1
  • Filed: 10/31/2012
  • Issued: 07/08/2014
  • Est. Priority Date: 03/08/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method performed by one or more processing devices, comprising:

  • performing a preliminary recognition process on first audio, the preliminary recognition process comprising;

    identifying one or more candidates for the first audio;

    determining a plurality of path costs for the identified candidates, the plurality of path costs corresponding to sequences of sub-phonemes identified in the first audio;

    determining a best path cost for each of the identified candidates based on the plurality of path costs;

    associating the best path costs with the identified candidates; and

    providing the identified candidates and associated best path costs;

    generating first templates corresponding to the first audio, each first template comprising a number of elements corresponding to a sequence of sub-phonemes of the first audio;

    selecting second templates corresponding to the identified candidates, the second templates representing second audio, each second template comprising elements that correspond to the elements in the first templates;

    comparing the first templates to the second templates, wherein comparing comprises determining similarity metrics between the first templates and corresponding second templates, wherein the similarity metrics are based onexponentiated and scaled dynamic time warping (DTW) distances between the selected ones of the first templates and selected ones of the second templates;

    applying weights to the similarity metrics to produce weighted similarity metrics, the weights being associated with corresponding second templates;

    applying the weighted similarity metrics to corresponding best path costs to produce re-scored path costs, the re-scored path costs being associated with corresponding identified candidates; and

    using the re-scored path costs to determine which of the identified candidates corresponds to the first audio.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×