System And Method For Supporting Text-To-Speech
First Claim
1. A system for supporting text-to-speech, comprising:
- a learning data generating unit which recognizes inputted speech, and generates first learning data in which wordings of phrases are associated with readings thereof;
a frequency data generating unit which generates, on the basis of the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases;
a language processing unit; and
a setting unit which sets frequency data in the language processing unit for generating, from a wording of text, a reading corresponding to the wording, on the basis of appearance frequencies of readings corresponding to the wording in order to approximate outputted speech of text-to-speech to the inputted speech.
8 Assignments
0 Petitions
Accused Products
Abstract
A system for generating high-quality synthesized text-to-speech includes a learning data generating unit, a frequency data generating unit, and a setting unit. The learning data generating unit recognizes inputted speech, and then generates first learning data in which wordings of phrases are associated with readings thereof. The frequency data generating unit generates, based on the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases. The setting unit sets the thus generated frequency data for a language processing unit in order to approximate outputted speech of text-to-speech to the inputted speech. Furthermore, the language processing unit generates, from a wording of text, a reading corresponding to the wording, on the basis of the appearance frequencies.
12 Citations
35 Claims
-
1. A system for supporting text-to-speech, comprising:
-
a learning data generating unit which recognizes inputted speech, and generates first learning data in which wordings of phrases are associated with readings thereof; a frequency data generating unit which generates, on the basis of the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases; a language processing unit; and a setting unit which sets frequency data in the language processing unit for generating, from a wording of text, a reading corresponding to the wording, on the basis of appearance frequencies of readings corresponding to the wording in order to approximate outputted speech of text-to-speech to the inputted speech. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for supporting text-to-speech, comprising:
-
a learning data generating unit which recognizes inputted speech, and generates first learning data in which wordings of phrases are associated with readings thereof; a language processing unit; and a learning unit which causes the language processing unit to learn on the basis of the first learning data, the language processing unit generating, from a wording of text, a reading corresponding to the wording, on the basis of appearance frequencies in the first learning data in order to approximate outputted speech of text-to-speech to the inputted speech. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method of supporting text-to-speech, comprising the steps of:
-
recognizing inputted speech, and generating first learning data in which wordings of phrases are associated with readings thereof; generating, on the basis of the first learning data, frequency data indicating appearance frequencies of both wordings, and readings of phrases; and setting frequency data in a language processing unit which generates, from a wording of text, a reading corresponding to the wording, on the basis of appearance frequencies of readings corresponding to the wording in order to approximate outputted speech of text-to-speech to the inputted speech. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A program product for allowing an information processing apparatus to function as a system for supporting text-to-speech, the program product causing the information system to function as:
-
a learning data generating unit which recognizes inputted speech, and generates first learning data in which wordings of phrases are associated with readings thereof; a frequency data generating unit which generates, on the basis of the first learning data, frequency data indicating appearance frequencies of both wordings, and readings of phrases; and a setting unit which, in order to approximate outputted speech of text-to-speech to the inputted speech, sets frequency data in a language processing unit for generating, from a wording of text, a reading corresponding the wording, on the basis of appearance frequencies of readings corresponding to the wording. - View Dependent Claims (23, 24, 25, 26, 27, 28)
-
-
29. An article of manufacture comprising a computer usable medium having computer readable program code means embodied therein for supporting text-to-speech, the computer readable program code means in said article of manufacture comprising:
-
computer readable program code means for causing a computer to effect recognizing inputted speech and generating first learning data in which wordings of phrases are associated with readings thereof; computer readable program code means for causing a computer to effect generating, on the basis of the first learning data, frequency data indicating appearance frequencies of both wordings, and readings of phrases; and computer readable program code means for causing a computer to effect setting frequency data in a language processing unit which generates, from a wording of text, a reading corresponding to the wording, on the basis of appearance frequencies of readings corresponding to the wording in order to approximate outputted speech of text-to-speech to the inputted speech. - View Dependent Claims (30, 31, 32, 33, 34, 35)
-
Specification