Apparatus, medium, and method for generating record sentence for corpus and apparatus, medium, and method for building corpus using the same
First Claim
Patent Images
1. A method for generating a record sentence to establish a speech corpus, comprising:
- generating a synthesized sentence of speech and synthesis information related to speech synthesis by performing speech synthesis for a predetermined sentence of text using candidate synthesis units transmitted from synthesis database;
selecting an unseen sentence including an unseen unit according to the synthesis information;
generating a weight indicating a recording priority of the unseen unit included in the selected unseen sentence;
generating a record sentence by combining the unseen unit with the speech synthesis information according to the generated weight; and
updating the speech corpus by storing the record sentence including the unseen unit,wherein the synthesis database is updated based on the updated speech corpus,wherein the unseen unit is selected as a synthesis unit when a speech unit of satisfactory quality cannot be obtained from candidate synthesis units extracted from the synthesis database, and is updated based on the updated synthesis database.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, medium, and apparatus for generating a record sentence to establish a speech corpus, including generating a synthesized sentence of speech and synthesis information related to speech synthesis by performing speech synthesis for a predetermined sentence of text, selecting an unseen sentence including an unseen unit according to the synthesis information, generating a weight indicating a recording priority of the unseen unit included in the selected unseen sentence, and generating a record sentence by combining the unseen unit with the speech synthesis information according to the generated weight.
8 Citations
44 Claims
-
1. A method for generating a record sentence to establish a speech corpus, comprising:
-
generating a synthesized sentence of speech and synthesis information related to speech synthesis by performing speech synthesis for a predetermined sentence of text using candidate synthesis units transmitted from synthesis database; selecting an unseen sentence including an unseen unit according to the synthesis information; generating a weight indicating a recording priority of the unseen unit included in the selected unseen sentence; generating a record sentence by combining the unseen unit with the speech synthesis information according to the generated weight; and updating the speech corpus by storing the record sentence including the unseen unit, wherein the synthesis database is updated based on the updated speech corpus, wherein the unseen unit is selected as a synthesis unit when a speech unit of satisfactory quality cannot be obtained from candidate synthesis units extracted from the synthesis database, and is updated based on the updated synthesis database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method of establishing a speech corpus, comprising:
-
performing speech synthesis for a predetermined sentence of text using candidate synthesis units transmitted from a synthesis database; extracting an unseen unit from an unseen sentence by using synthesis information related to the speech synthesis; generating a record sentence according to the extracted unseen unit; converting the record sentence including the unseen unit into a speech signal; and updating by storing the record sentence converted into the speech signal in the speech corpus, wherein the synthesis database is updated based on the updated speech corpus, wherein the unseen unit is selected as a synthesis unit when a speech unit of satisfactory quality cannot be obtained from candidate synthesis units extracted from the synthesis database, and is updated based on the updated synthesis database, the generating of the record sentence is performed by combining the selected unseen unit with the speech synthesis information, and the combining of the selected unseen unit with the speech synthesis information comprises generating a weight according to a linguistic criterion for the unseen unit and extracting the unseen unit in order according to the generated weight. - View Dependent Claims (24, 25, 26, 27)
-
-
28. An apparatus for generating a record sentence for establishing a speech corpus, the apparatus comprising:
-
a speech synthesis unit that generates a synthesized sentence of speech and synthesis information indicating information related to speech synthesis by performing speech synthesis for a predetermined sentence of text using candidate synthesis units transmitted from a synthesis database; an unseen sentence selection unit that selects an unseen sentence including an unseen unit according to the generated synthesis information; a generation unit extraction unit that generates a weight indicating a recording priority of an unseen unit included in the selected unseen sentence; and a record sentence generation unit that generates a record sentence by combining an unseen unit with the speech synthesis information according to the generated weight and automatically updating the speech corpus by storing the record sentence including the unseen unit, wherein the synthesis database is updated based on the updated speech corpus, wherein the unseen unit is selected as a synthesis unit when a speech unit of satisfactory quality cannot be obtained from candidate synthesis units extracted from the synthesis database, and is updated based on the updated synthesis database. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification