Segment information generation device, speech synthesis device, speech synthesis method, and speech synthesis program
First Claim
Patent Images
1. A segment information generation device comprising:
- a waveform cutout unit implemented at least by hardware including a processor that cuts out a speech waveform from natural speech at a time period not depending on a pitch frequency of the natural speech, continuously;
a feature parameter extraction unit implemented at least by hardware including a processor that extracts a feature parameter of a speech waveform from the speech waveform cut out by the waveform cutout unit;
a time domain waveform generation unit implemented at least by hardware including a processor that generates a time domain waveform based on the feature parameter;
a spectrum shape change degree estimation unit implemented at least by hardware including a processor that estimates a degree of change in spectrum shape indicating a degree of change in spectrum shape of natural speech; and
a period control unit implemented at least by hardware including a processor that determines a time period to cut out a speech waveform from the natural speech based on the degree of change in spectrum shape.
1 Assignment
0 Petitions
Accused Products
Abstract
A segment information generation device includes a waveform cutout unit cuts out a speech waveform from natural speech at a time period not depending on a pitch frequency of the natural speech. A feature parameter extraction unit extracts a feature parameter of a speech waveform from the speech waveform cut out by the waveform cutout unit. A time domain waveform generation unit generates a time domain waveform based on the feature parameter.
18 Citations
12 Claims
-
1. A segment information generation device comprising:
-
a waveform cutout unit implemented at least by hardware including a processor that cuts out a speech waveform from natural speech at a time period not depending on a pitch frequency of the natural speech, continuously; a feature parameter extraction unit implemented at least by hardware including a processor that extracts a feature parameter of a speech waveform from the speech waveform cut out by the waveform cutout unit; a time domain waveform generation unit implemented at least by hardware including a processor that generates a time domain waveform based on the feature parameter; a spectrum shape change degree estimation unit implemented at least by hardware including a processor that estimates a degree of change in spectrum shape indicating a degree of change in spectrum shape of natural speech; and a period control unit implemented at least by hardware including a processor that determines a time period to cut out a speech waveform from the natural speech based on the degree of change in spectrum shape. - View Dependent Claims (2, 3, 4, 7, 8, 9, 10, 11, 12)
-
-
5. A speech synthesis device comprising:
-
a waveform cutout unit implemented at least by hardware including a processor that cuts out a speech waveform from natural speech at a time period not depending on a pitch frequency of the natural speech, continuously; a feature parameter extraction unit implemented at least by hardware including a processor that extracts a feature parameter of a speech waveform from the speech waveform cut out by the waveform cutout unit; a time domain waveform generation unit implemented at least by hardware including a processor that generates a time domain waveform based on the feature parameter; a segment information storage unit implemented by a storage device that stores segment information indicating a segment and containing the time domain waveform; a segment information selection unit implemented at least by hardware including a processor that selects segment information corresponding to an input character string; a waveform generation unit implemented at least by hardware including a processor and that generates a speech synthesis waveform by use of the segment information selected by the segment information selection unit; a spectrum shape change degree estimation unit implemented at least by hardware including a processor that estimates a degree of change in spectrum shape indicating a degree of change in spectrum shape of natural speech; and a period control unit implemented at least by hardware including a processor that determines a time period to cut out a speech waveform from the natural speech based on the degree of change in spectrum shape.
-
-
6. A segment information generating method, implemented by a processor, comprising:
-
cutting out a speech waveform from natural speech at a time period not depending on a pitch frequency of the natural speech, continuously; extracting a feature parameter of the speech waveform from the speech waveform; generating a time domain waveform based on the feature parameter; estimating a degree of change in spectrum shape indicating a degree of change in spectrum shape of natural speech; and determining a time period to cut out a speech waveform from the natural speech based on the degree of change in spectrum shape.
-
Specification