×

Method and apparatus for improving spontaneous speech recognition performance

  • US 10,388,275 B2
  • Filed: 09/07/2017
  • Issued: 08/20/2019
  • Est. Priority Date: 02/27/2017
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus for improving spontaneous speech recognition performance, the apparatus comprising a computer including a processor and memory, the processor comprising:

  • a frequency transformer that divides a voice signal into frames and applies a discrete Fourier transform (DFT) to transform the voice signal from the time domain to the frequency domain;

    a magnitude feature extractor that extracts a magnitude feature from a magnitude of the voice signal transformed to the frequency domain;

    a phase feature extractor that extracts a phase feature from a phase of the voice signal transformed to the frequency domain;

    a syllabic nucleus detector that detects a syllabic nucleus by using the magnitude feature and the phase feature as an input of a deep neural network;

    a voice detector that detects a voice section and a non-voice section from the voice signal;

    a speaking rate determiner that determines a speaking rate by using the detected syllabic nucleus and an interval of the detected voice section;

    a calculator that calculates a degree of time scale modification by using the speaking rate; and

    a time scale modifier that converts a voice into a length appropriate for an acoustic model by using the degree of time scale modification,and the deep neural network of the computer detects a syllabic nucleus from the syllabic nucleus detector and outputs a phoneme classification item as a multi-frame output.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×