STREAMING ENCODER, PROSODY INFORMATION ENCODING DEVICE, PROSODY-ANALYZING DEVICE, AND DEVICE AND METHOD FOR SPEECH SYNTHESIZING
First Claim
1. A speech-synthesizing device, comprising:
- a hierarchical prosodic module generating at least a first hierarchical prosodic model;
a prosody-analyzing device, receiving a low-level linguistic feature, a high-level linguistic feature and a first prosodic feature, and generating at least a prosodic tag based on the low-level linguistic feature, the high-level linguistic feature, the first prosodic feature and the first hierarchical prosodic model; and
a prosody-synthesizing unit synthesizing a second prosodic feature based on the hierarchical prosodic module, the low-level linguistic feature and the prosodic tag.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech-synthesizing device includes a hierarchical prosodic module, a prosody-analyzing device, and a prosody-synthesizing unit. The hierarchical prosodic module generates at least a first hierarchical prosodic model. The prosody-analyzing device receives a low-level linguistic feature, a high-level linguistic feature and a first prosodic feature, and generates at least a prosodic tag based on the low-level linguistic feature, the high-level linguistic feature, the first prosodic feature and the first hierarchical prosodic model. The prosody-synthesizing unit synthesizes a second prosodic feature based on the hierarchical prosodic module, the low-level linguistic feature and the prosodic tag.
15 Citations
20 Claims
-
1. A speech-synthesizing device, comprising:
-
a hierarchical prosodic module generating at least a first hierarchical prosodic model; a prosody-analyzing device, receiving a low-level linguistic feature, a high-level linguistic feature and a first prosodic feature, and generating at least a prosodic tag based on the low-level linguistic feature, the high-level linguistic feature, the first prosodic feature and the first hierarchical prosodic model; and a prosody-synthesizing unit synthesizing a second prosodic feature based on the hierarchical prosodic module, the low-level linguistic feature and the prosodic tag. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A prosodic information encoding apparatus, comprising:
-
a speech segmentation and prosodic feature extracting device receiving a speech input and a low-level linguistic feature to generate a first prosodic feature; a prosodic structure analysis unit receiving the first prosodic feature, the low-level linguistic feature and a high-level linguistic feature, and generating a prosodic tag based on the first prosodic feature, the low-level linguistic feature and the high-level linguistic feature; and an encoder receiving the prosodic tag and the low-level linguistic feature to generate a code stream.
-
-
11. A code stream generating apparatus, comprising:
-
a prosodic feature extractor generating a first prosodic feature; a hierarchical prosodic module providing a prosodic structure meaning for the first prosodic feature; and an encoder generating a code stream based on the first prosodic feature having the prosodic structure meaning, wherein the hierarchical prosodic module has at least two parameters being ones selected from the group consisting of a syllable duration, a pitch contour, a pause timing, a pause frequency, a pause duration and a combination thereof.
-
-
12. A method for synthesizing a speech, comprising steps of:
-
providing a hierarchical prosodic module, a low-level linguistic feature, a high-level linguistic feature and a first prosodic feature; generating at least a prosodic tag based on the low-level linguistic feature, the high-level linguistic feature, the first prosodic feature and the hierarchical prosodic module; and outputting the speech according to the prosodic tag. - View Dependent Claims (13)
-
-
14. A prosodic structure analysis unit, comprising:
-
a first input terminal receiving a first prosodic feature; a second input terminal receiving a low-level linguistic feature; a third input terminal receiving a high-level linguistic feature; and an output terminal, wherein the prosodic structure analysis unit generates a prosodic tag at the output terminal based on the first prosodic feature, the low-level and the high-level linguistic features.
-
-
15. A speech-synthesizing device, comprising:
-
a decoder receiving a code stream and restoring the code stream to generate a low-level linguistic feature and a prosodic tag; a hierarchical prosodic module receiving the low-level linguistic feature and the prosodic tag to generate a second prosodic feature; and a speech synthesizer generating a synthesized speech based on the low-level linguistic feature and the second prosodic feature.
-
-
16. A prosodic structure analysis apparatus, comprising:
-
a hierarchical prosodic module generating a hierarchical prosodic model; and a prosodic structure analysis unit receiving a first prosodic feature, a low-level linguistic feature and a high-level linguistic feature, and generating a prosodic tag based on the first prosodic feature, the low-level and the high-level linguistic features and the hierarchical prosodic model. - View Dependent Claims (17, 18, 19, 20)
-
Specification