×

Constructing speech decoding network for numeric speech recognition

  • US 10,699,699 B2
  • Filed: 05/30/2018
  • Issued: 06/30/2020
  • Est. Priority Date: 03/29/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method for constructing a speech decoding network for recognizing digits in speech, comprising:

  • acquiring primary training data comprising a plurality of speech segments, and each speech segment comprising a plurality of digits;

    performing acoustic feature extraction on the primary training data to obtain a plurality of feature sequences from the plurality of speech segments;

    performing progressive training to obtain a tri-phoneme acoustic model based on the plurality of feature sequences and a plurality of phonemes corresponding to the digits in the speech segments in the primary training data, including;

    obtaining a mono-phoneme acoustic model by training a model with the plurality of feature sequences according to divided states of a plurality of mono-phonemes corresponding to the digits in the plurality of speech segments;

    decoding the primary training data with the mono-phoneme acoustic model to obtain secondary training data;

    obtaining the tri-phoneme acoustic model by training a model with a plurality of feature sequences in the secondary training data according to divided states of a plurality of tri-phonemes corresponding to digits in a plurality of speech segments in the secondary training data;

    acquiring a language model by modeling matching relations of the plurality of digits in the primary training data; and

    constructing a speech decoding network by using the language model and the tri-phoneme acoustic model obtained by training.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×