Method and apparatus of generating text script for a corpus-based text-to speech system
First Claim
1. A method of text script generation for a corpus-based text-to-speech system configured with a computing device for text script searching and processing and a memory device for corpus storage, comprising:
- (a) searching in a source corpus being stored in said memory device and having L sentences, selecting N sentences with a best integrated efficiency as N best cases, and setting iteration k to be 1, k, L and N being natural numbers, N≦
L;
(b) for each case n of the N best cases, 1≦
n≦
N, searching in said source corpus and selecting by the computing device, Mk+1 best sentences with the best integrated efficiency from the unselected sentences in said source corpus, 1≦
Mk+1≦
L;
(c) searching in said source corpus and keeping N best cases out of the total unselected sentences for next iteration, and increasing iteration k by 1; and
(d) if a termination criterion being reached, setting the best case in the N traced cases as the text script, otherwise, returning to step (b);
wherein said best integrated efficiency depends on a function of combining the covering rate efficiency of unit types, the hit rate efficiency of unit types, and the text script size.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of text script generation for a corpus-based text-to-speech system includes searching in a source corpus having L sentences, selecting N sentences with a best integrated efficiency as N best cases, and setting iteration k to be 1; for each case n of the N best cases, selecting Mk+1 best sentences with the best integrated efficiency from the unselected sentences in the source corpus; keeping N best cases out of the total unselected sentences for next iteration, and increasing iteration k by 1; and if a termination criterion being reached, setting the best case in the N traced cases as the text script, otherwise, returning to the (k+1)th iteration of searching in the unselected sentences for (k+1)th sentence; wherein the best integrated efficiency depends on a function of combining the covering rate of the synthesis unit type, the hit rate of the synthesis unit type, and the text script size.
1 Citation
14 Claims
-
1. A method of text script generation for a corpus-based text-to-speech system configured with a computing device for text script searching and processing and a memory device for corpus storage, comprising:
-
(a) searching in a source corpus being stored in said memory device and having L sentences, selecting N sentences with a best integrated efficiency as N best cases, and setting iteration k to be 1, k, L and N being natural numbers, N≦
L;(b) for each case n of the N best cases, 1≦
n≦
N, searching in said source corpus and selecting by the computing device, Mk+1 best sentences with the best integrated efficiency from the unselected sentences in said source corpus, 1≦
Mk+1≦
L;(c) searching in said source corpus and keeping N best cases out of the total unselected sentences for next iteration, and increasing iteration k by 1; and (d) if a termination criterion being reached, setting the best case in the N traced cases as the text script, otherwise, returning to step (b); wherein said best integrated efficiency depends on a function of combining the covering rate efficiency of unit types, the hit rate efficiency of unit types, and the text script size. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A text script generator for a corpus-based text-to-speech system configured with a computing device for text script searching and processing and a memory device for corpus storage, comprising:
-
a search criteria selector constructed in said computing device for searching in a source corpus being stored in said memory device and having L sentences, and selecting N sentences with a best integrated efficiency as N best cases, L and N being natural numbers, N≦
L;a performance index constructor constructed in said computing device and coupled to said search criteria selector, for providing covering rate and hit rate corresponding to all unit types in said source corpus; and a termination criteria detector constructed in said computing device and coupled to said search criteria selector, for generating a best case in the N traced cases as a text script upon detecting a termination criterion is reached; wherein said best integrated efficiency depends on a function of combining the covering rate efficiency of unit types, the hit rate efficiency of unit types, and the size of said text script. - View Dependent Claims (12, 13, 14)
-
Specification