Speech information processing method and apparatus and storage meidum
First Claim
Patent Images
1. A speech information processing method comprising:
- a step of obtaining a duration of a predetermined unit of phonological series based on a duration model for an entire segment;
a step of obtaining a duration of each or phonemes constructing said phonological series based on a duration model for a partial segment;
a setting step of setting a duration of each of said phonemes based on said duration of the phonological series and said duration of each of said phonemes; and
a speech synthesis step of synthesizing speech based on said duration of each of said phonemes set at said setting step.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech information processing apparatus which sets the duration of phonological series with accuracy, and sets a natural phoneme duration in accordance with phonemic/linguistic environment. For this purpose, the duration of predetermined unit of phonological series is obtained based on a duration model for entire segment (S302). Then duration of each of phonemes constructing the phonological series is obtained based on the duration model for the entire segment (S303). Then duration of each phoneme is set based on the duration of the phonological series and the duration of each phoneme (S304).
161 Citations
11 Claims
-
1. A speech information processing method comprising:
-
a step of obtaining a duration of a predetermined unit of phonological series based on a duration model for an entire segment;
a step of obtaining a duration of each or phonemes constructing said phonological series based on a duration model for a partial segment;
a setting step of setting a duration of each of said phonemes based on said duration of the phonological series and said duration of each of said phonemes; and
a speech synthesis step of synthesizing speech based on said duration of each of said phonemes set at said setting step. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A speech information processing apparatus comprising:
-
means for obtaining a duration of a predetermined unit of phonological series based on a duration model for an entire segment;
means for obtaining a duration of each or phonemes constructing said phonological series based on a duration model for a partial segment;
setting means for setting a duration of each of said phonemes based on said duration of the phonological series and said duration of each of said phonemes; and
speech synthesis means for synthesizing speech based on said duration of each of said phonemes set by said setting means. - View Dependent Claims (8, 9, 10, 11)
-
Specification