×

Training acoustic models using distributed computing techniques

  • US 8,959,014 B2
  • Filed: 06/29/2012
  • Issued: 02/17/2015
  • Est. Priority Date: 06/30/2011
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising;

    receiving speech data and data identifying a transcription for the speech data;

    accessing a phonetic representation for the transcription;

    extracting training sequences from the phonetic representation for a particular phone in the phonetic representation, the training sequences comprising two or more training sequences that include (i) a particular sequence of multiple phones and (ii) a different number of contextual phones surrounding the particular phone;

    identifying a partitioning key for the training sequences based on the particular sequence of multiple phones that occurs in the two or more training sequences;

    selecting, from among a plurality of processing modules, a processing module to which the identified partitioning key is assigned, the processing module being designated to train a portion of an acoustic model that corresponds to the identified partitioning key; and

    transmitting, to the selected processing module, (i) data identifying the training sequences and (ii) a portion of the speech data that corresponds to the training sequence that includes the most contextual phones.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×