×

Apparatus for synchronously processing text data and voice data

  • US 9,679,566 B2
  • Filed: 06/29/2015
  • Issued: 06/13/2017
  • Est. Priority Date: 06/30/2014
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus for synchronously processing text data and voice data, comprising:

  • a storing unit that stores text data constituted by a plurality of phrases and voice data of the text data; and

    a central processing unit (CPU) which performs;

    dividing the text data stored in the storing unit into the phrases and storing the divided text data, with identifiers which respectively correspond to the divided text data and indicate the division order, in the storing unit;

    phonemically converting the divided text data, phrase by phrase, to obtain text data phoneme conversion values and storing the text data phoneme conversion values, which respectively correspond to the phrases, in the storing unit;

    calculating accumulated values of the text data phoneme conversion value of each phrase of the divided text data by calculating percentage of the text data phoneme conversion accumulated value TN of each of the phrases to the text data phoneme conversion accumulated value TN of the final phrase of the text data TD, to the second decimal point and by multiplying the percentage of the text data phoneme conversion accumulated value TN of each of the divided text data DTD by 100 and storing the accumulated values, which respectively correspond to the phrases of the divided text data, in the storing unit;

    extracting a silent segment, from the voice data, on the basis of a predetermined silent segment decision datum, dividing the voice data in the extracted silent segment, and storing the divided voice data, with identifiers which respectively correspond to the divided voice data and indicate the division order, in the storing unit;

    phonemically converting the divided voice data, which have been divided division range by division range, to obtain voice data phoneme conversion values and storing the voice data phoneme conversion values, which respectively correspond to the division ranges, in the storing unit;

    calculating accumulated values of the voice data phoneme conversion value of each division range of the divided voice data by calculating percentage of the calculated voice data phoneme conversion accumulated values SN to a total value of the calculated voice data phoneme conversion accumulated values SN, to the second decimal point and multiplying the percentage by 100 and storing the accumulated values, which respectively correspond to the division ranges of the divided voice data, in the storing unit;

    extracting the nearest approximate values of the voice data phoneme accumulated values with respect to the text data phoneme conversion accumulated values corresponding to the phrases of the divided text data, and producing phrase corresponding data, in which the voice data phoneme conversion accumulated values respectively corresponding to the phrases of the divided text data are associated with identifiers indicating playback order of the phrases of the divided text data; and

    outputting the corresponding phrases of the text data and the divided voice data, which correspond to each other, on the basis of the phrase corresponding data.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×