Method for generating text script of high efficiency
First Claim
1. A method of generation text script of high efficiency, said method comprising:
- selecting N1 sentences with best integrated efficiency from a source corpus comprised by at least a sentence and resulting N1 sets, wherein each set of said N1 sets has at least a sentence;
repeating procedures for generating text script of high efficiency until satisfying a termination criterion, said procedures comprising;
deleting the sentences in said Ni sets from said source corpus and resulting in Ni corpora, wherein Ni is equal to or greater than two;
correspondingly selecting Mi+l sentences with best integrated efficiency from each of said Ni corpora and resulting in Ni×
Mi+1 sets, wherein each of the Ni×
Mi+1 sets is generated by placing each of the Mi+1 sentences into a corresponding set of the Ni sets of a previous procedure;
selecting Ni+1 sets with best integrated efficiency from said Ni×
Mi+1 sets;
replacing said Ni sets with said Ni+1 sets when a termination criterion is satisfied and the set with best integrated efficiency among said Ni+1 sets is said text script of high efficiency; and
storing said text script in a memory, and said text script being used as text script for corpus of TTS (text to speech);
wherein i meaning an ith procedure, i=1, 2, . . . ;
Ni+1 being a number of said selected sets with best integrated efficiency in said ith procedure;
Mi+1 being a number of said selected sentences with best integrated efficiency from one of the Ni corpora;
Mi and Ni being an integer and greater than or equal to one, j=1, 2, . . . ; and
said integrated efficiency being decided upon an integrated efficiency function that comprising reciprocals of total unit instances of said selected sentence or set of sentences.
2 Assignments
0 Petitions
Accused Products
Abstract
This proposal presents performance indices and search criteria for the text script generation in the design of corpus-based TTS systems. Based on our criteria a new search method is presented to solve the text selection problem more systematically and efficiently, unlike previous researches either concentrated on covering rate or on hit rate. By control a weighting factor, the covering rate of unit types can be increased to improve the robustness of the TTS system. Finally, the scalable and controllable design of the multi-stage search can produce various kinds of text scripts ideally suitable for the requirement of various kinds of corpus-based TTS systems.
2 Citations
18 Claims
-
1. A method of generation text script of high efficiency, said method comprising:
-
selecting N1 sentences with best integrated efficiency from a source corpus comprised by at least a sentence and resulting N1 sets, wherein each set of said N1 sets has at least a sentence; repeating procedures for generating text script of high efficiency until satisfying a termination criterion, said procedures comprising; deleting the sentences in said Ni sets from said source corpus and resulting in Ni corpora, wherein Ni is equal to or greater than two; correspondingly selecting Mi+l sentences with best integrated efficiency from each of said Ni corpora and resulting in Ni×
Mi+1 sets, wherein each of the Ni×
Mi+1 sets is generated by placing each of the Mi+1 sentences into a corresponding set of the Ni sets of a previous procedure;selecting Ni+1 sets with best integrated efficiency from said Ni×
Mi+1 sets;replacing said Ni sets with said Ni+1 sets when a termination criterion is satisfied and the set with best integrated efficiency among said Ni+1 sets is said text script of high efficiency; and storing said text script in a memory, and said text script being used as text script for corpus of TTS (text to speech); wherein i meaning an ith procedure, i=1, 2, . . . ;
Ni+1 being a number of said selected sets with best integrated efficiency in said ith procedure;
Mi+1 being a number of said selected sentences with best integrated efficiency from one of the Ni corpora;
Mi and Ni being an integer and greater than or equal to one, j=1, 2, . . . ; and
said integrated efficiency being decided upon an integrated efficiency function that comprising reciprocals of total unit instances of said selected sentence or set of sentences. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 18)
-
-
10. A method of scalably generating text script of high efficiency, said method comprising:
-
selecting N1 sentences aimed at a unit-class with best N1 integrated efficiency from a source corpus comprised by at least a sentence and resulting N1 sets, wherein said source corpus comprising by at least a unit instance corresponding to at least a unit type, said unit-class separated different classes according to said unit types, and each set of said N1 sets comprised by at least a sentence; repeating procedures for generating text script of high efficiency until satisfying a termination criterion of unit-class, said procedures comprising; selecting N1 sentences with best integrated efficiency from a source corpus comprised by at least a sentence and resulting N1 sets, wherein each set of said N1 sets comprised by at least a sentence; repeating procedures for generating text script of high efficiency until satisfying a termination criterion, said procedures comprising; deleting the sentences in said Ni sets from said source corpus and resulting in Ni corpora, wherein Ni is equal to or greater than two; correspondingly selecting Mi+l sentences with best integrated efficiency from each of said Ni corpora and resulting in Ni×
Mi+1 sets, wherein each of the Ni×
Mi+1 sets is generated by placing each of the Mi+1 sentences into a corresponding set of the Ni sets of a previous procedure;selecting Ni+1 sets with best integrated efficiency from said Ni×
Mi+1 sets;replacing said Ni sets with said Ni sets when a termination criterion is satisfied and the set with best integrated efficiency among said Ni+1 sets is said text script of high efficiency; and storing said text script in a memory, and said text script being used as text script for corpus of TTS (text to speech); wherein i meaning an ith procedure, i=1, 2, . . . ;
Ni+1 being a number of said selected sets with best integrated efficiency in said ith procedure;
Mi+1 being a number of said selected sentences with best integrated efficiency from one of the Ni corpora;
Mi and Ni being an integer and greater than or equal to one, j=1, 2, . . . ; and
said integrated efficiency being decided upon an integrated efficiency function that comprising reciprocals of total unit instances of said selected sentence or set of sentences. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
Specification