×

Speech processing device and method

  • US 9,672,809 B2
  • Filed: 04/24/2014
  • Issued: 06/06/2017
  • Est. Priority Date: 06/17/2013
  • Status: Active Grant
First Claim
Patent Images

1. A speech processing device comprising:

  • a processor; and

    a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute;

    obtaining input speech, the input speech including a plurality of vowel segments and a plurality of consonant segments,detecting the vowel segments contained in the input speech,estimating a stress segment among the plurality of vowel segments by comparing pitch variation rate or power variation rate per unit time of the plurality of vowel segments, respectively, the stress segment being a segment that has a strong trend of decrease in the pitch variation rate or the power variation rate per unit time,detecting sound lengths of each of the plurality of vowel segments,transforming the input speech so that a first sound length becomes longer than each of second sound lengths when the input speech includes at least one of the second sound lengths that is longer than the first sound length, the first sound length being a sound length of a vowel segment containing the stress segment, the second sound lengths being sound lengths of vowel segments excluding the stress segment, the transforming including extending the first sound length or shortening at least one of the second sound lengths, the first sound length being extended by inserting a part of segment obtained based on the vowel segment containing the stress segment into the vowel segment containing the stress segment, the at least one of the second sound lengths being shortened by deleting a part of segment from the at least one of the second sound lengths, a length to be inserted or to be shortened being determined based on the detected first sound length and the detected second sound length and a prescribed target scaling factor, andoutputting the transformed input speech in which the first sound length is extended or in which the at least one of the second sound lengths is shortened.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×