Unit selection module and method for Chinese text-to-speech synthesis
First Claim
1. A Chinese Text-To-Speech (TTS) synthesis system comprising:
- a word pre-processing module, a unit selection module, a speech generation module, and a corpus;
characterized in that;
said unit selection module comprises;
a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified variable-length unit selection scheme;
said PCFG parser parses a Chinese sentence to obtain the CFG of said Chinese sentence as its target unit;
said LSI module estimates the structural distance between the candidate synthesis units and the target unit in said corpus; and
through said modified variable-length unit selection scheme, tagged with a dynamic program algorithm, the units are searched to find the best synthesis unit concatenation sequence of said Chinese sentence.
1 Assignment
0 Petitions
Accused Products
Abstract
This invention relates to a unit selection module for Chinese Text-to-Speech (TTS) synthesis, mainly comprising a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified variable-length unit selection scheme; any Chinese sentence is firstly input and then parsed into a context-free grammar (CFG) by the PCFG parser; wherein there are several possible CFGs for every Chinese sentence, and the CFG (or the syntactic structure) with the highest probability is then taken as the best CFG (or the syntactic structure) of the Chinese sentence; the LSI module is then used to calculate the structural distance between all the candidate synthesis units and the target unit in a corpus; through the modified variable-length unit selection scheme, tagged with the dynamic programming algorithm, the units are searched to find the best synthesis unit concatenation sequence.
-
Citations
17 Claims
-
1. A Chinese Text-To-Speech (TTS) synthesis system comprising:
-
a word pre-processing module, a unit selection module, a speech generation module, and a corpus;
characterized in that;
said unit selection module comprises;
a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified variable-length unit selection scheme;
said PCFG parser parses a Chinese sentence to obtain the CFG of said Chinese sentence as its target unit;
said LSI module estimates the structural distance between the candidate synthesis units and the target unit in said corpus; and
through said modified variable-length unit selection scheme, tagged with a dynamic program algorithm, the units are searched to find the best synthesis unit concatenation sequence of said Chinese sentence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for Chinese Text-To-Speech (TTS) synthesis comprising:
-
a word pre-processing module, a unit selection module, and a speech generation module;
said unit selection procedure comprising the following steps;
parsing the CFG of Chinese sentences after they have been subject to said word pre-processing;
building the target unit structural tree of said CFG;
from a corpus, building a plurality of candidate unit structural trees;
said LSI module is used to estimate the structural distance between the target unit structural tree and said plurality of candidate synthesis unit structural trees; and
said dynamic program algorithm is used to search the units so as to find the best synthesis unit concatenation sequence of said Chinese sentence. - View Dependent Claims (10)
-
-
11. A unit selection module used in the Chinese Text-To-Speech (TTS) synthesis system comprising:
-
a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified variable-length unit selection scheme;
said PCFG parser parses a Chinese sentence to obtain the CFG of said Chinese sentence as its target unit;
said LSI module estimates the structural distance between the candidate synthesis units and the target unit in said corpus; and
through said modified variable-length unit selection scheme, tagged with a dynamic program algorithm, the units are searched to find the best synthesis unit concatenation sequence of said Chinese sentence. - View Dependent Claims (12, 13, 14)
-
-
15. A unit selection method for the Chinese Text-To-Speech (TTS) synthesis system comprising:
-
parsing the CFG of a Chinese sentence;
building the target unit structural tree of said CFG of said Chinese sentence;
from a corpus, building a plurality of candidate unit structural trees;
said LSI module is used to estimate the structural distance between said target unit structural tree and a plurality of said candidate synthesis unit structural trees; and
said dynamic program algorithm is used to search the units so as to find the best synthesis unit concatenation sequence of said Chinese sentence. - View Dependent Claims (16, 17)
-
Specification