×

Training speech recognition systems using word sequences

  • US 10,388,272 B1
  • Filed: 12/04/2018
  • Issued: 08/20/2019
  • Est. Priority Date: 12/04/2018
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • obtaining first audio data of a communication session between a first device of a first user and a second device of a second user, the communication session configured for verbal communication;

    obtaining, during the communication session, a text string that is a transcription of the first audio data from an automatic transcription system;

    selecting, during the communication session, a contiguous sequence of words from the text string as a first word sequence;

    comparing, during the communication session, the first word sequence to a plurality of word sequences obtained before the communication session, each of the plurality of word sequences associated with a corresponding one of a plurality of counters;

    in response to the first word sequence corresponding to one of the plurality of word sequences based on the comparison, incrementing, during the communication session, a counter of the plurality of counters associated with the one of the plurality of word sequences;

    after incrementing the counter of the plurality of counters, deleting the text string and the first word sequence, wherein the first word sequence is deleted during the communication session; and

    training, after deleting the text string and the first word sequence, a language model of the automatic transcription system using the plurality of word sequences and the plurality of counters.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×