×

Training acoustic models using connectionist temporal classification

  • US 10,803,855 B1
  • Filed: 01/25/2019
  • Issued: 10/13/2020
  • Est. Priority Date: 12/31/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • accessing, by data processing hardware, acoustic model training data comprising training audio data and word-level transcriptions for the training audio data;

    flat start training, by data processing hardware, a first connectionist temporal classification (CTC) acoustic model on the acoustic model training data to generate phonetic sequences corresponding to the word-level transcriptions, the first CTC acoustic model trained without using any previously determined fixed alignment targets between the training audio data and the word-level transcriptions;

    generating, by the data processing hardware using the trained first CTC acoustic model, a context-dependent state inventory from approximate phonetic alignments between the training audio data and the phonetic sequences corresponding to the word-level transcriptions; and

    training, by the data processing hardware, a second CTC acoustic model using the context-dependent state inventory to generate outputs corresponding to one or more context-dependent states.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×