APPARATUS FOR SYNCHRONOUSLY PROCESSING TEXT DATA AND VOICE DATA
First Claim
1. An apparatus for synchronously processing text data and voice data,comprising:
- a storing unit for storing text data constituted by a plurality of phrases and voice data of the text data;
a text data dividing section for dividing the text data stored in the storing unit into the phrases and storing the divided text data, with identifiers which respectively correspond to the divided text data and indicate the division order, in the storing unit;
a text data phoneme converting section for phonemically converting the divided text data, phrase by phrase, to obtain text data phoneme conversion values and storing the text data phoneme conversion values, which respectively correspond to the phrases, in the storing unit;
a text data phoneme conversion accumulated value calculating section for calculating accumulated values of the text data phoneme conversion value of each phrase of the divided text data and storing the accumulated values, which respectively correspond to the phrases of the divided text data, in the storing unit;
a voice data dividing section for extracting a silent segment, from the voice data, on the basis of a predetermined silent segment decision datum, dividing the voice data in the extracted silent segment, and storing the divided voice data, with identifiers which respectively correspond to the divided voice data and indicate the division order, in the storing unit;
a reading data phoneme converting section for phonemically converting the divided voice data, which have been divided division range by division range, to obtain voice data phoneme conversion values and storing the voice data phoneme conversion values, which respectively correspond to the division ranges, in the storing unit;
a voice data phoneme conversion accumulated value calculating section for calculating accumulated values of the voice data phoneme conversion value of each division range of the divided voice data and storing the accumulated values, which respectively correspond to the division ranges of the divided voice data, in the storing unit;
a phrase corresponding data producing section for extracting the nearest approximate values of the voice data phoneme accumulated values with respect to the text data phoneme conversion accumulated values corresponding to the phrases of the divided text data, and producing phrase corresponding data, in which the voice data phoneme conversion accumulated values respectively corresponding to the phrases of the divided text data are associated with identifiers indicating playback order of the phrases of the divided text data; and
an output section for outputting the corresponding phrases of the text data and the divided voice data, which correspond to each other, on the basis of the phrase corresponding data.
1 Assignment
0 Petitions
Accused Products
Abstract
The apparatus for synchronously processing text data and voice data, comprises: a storing unit for storing text data and voice data; a text data dividing section for dividing the text data; a text data phoneme converting section for phonemically converting the divided text data; a text data phoneme conversion accumulated value calculating section for calculating accumulated values of text data phoneme conversion values; a voice data dividing section for dividing the voice data; a reading data phoneme converting section for phonemically converting the divided voice data; a voice data phoneme conversion accumulated value calculating section for calculating accumulated values of voice data phoneme conversion values; a phrase corresponding data producing section for producing phrase corresponding data; and an output section for synchronously outputting the text data and the divided voice data.
16 Citations
6 Claims
-
1. An apparatus for synchronously processing text data and voice data,
comprising: -
a storing unit for storing text data constituted by a plurality of phrases and voice data of the text data; a text data dividing section for dividing the text data stored in the storing unit into the phrases and storing the divided text data, with identifiers which respectively correspond to the divided text data and indicate the division order, in the storing unit; a text data phoneme converting section for phonemically converting the divided text data, phrase by phrase, to obtain text data phoneme conversion values and storing the text data phoneme conversion values, which respectively correspond to the phrases, in the storing unit; a text data phoneme conversion accumulated value calculating section for calculating accumulated values of the text data phoneme conversion value of each phrase of the divided text data and storing the accumulated values, which respectively correspond to the phrases of the divided text data, in the storing unit; a voice data dividing section for extracting a silent segment, from the voice data, on the basis of a predetermined silent segment decision datum, dividing the voice data in the extracted silent segment, and storing the divided voice data, with identifiers which respectively correspond to the divided voice data and indicate the division order, in the storing unit; a reading data phoneme converting section for phonemically converting the divided voice data, which have been divided division range by division range, to obtain voice data phoneme conversion values and storing the voice data phoneme conversion values, which respectively correspond to the division ranges, in the storing unit; a voice data phoneme conversion accumulated value calculating section for calculating accumulated values of the voice data phoneme conversion value of each division range of the divided voice data and storing the accumulated values, which respectively correspond to the division ranges of the divided voice data, in the storing unit; a phrase corresponding data producing section for extracting the nearest approximate values of the voice data phoneme accumulated values with respect to the text data phoneme conversion accumulated values corresponding to the phrases of the divided text data, and producing phrase corresponding data, in which the voice data phoneme conversion accumulated values respectively corresponding to the phrases of the divided text data are associated with identifiers indicating playback order of the phrases of the divided text data; and an output section for outputting the corresponding phrases of the text data and the divided voice data, which correspond to each other, on the basis of the phrase corresponding data. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification