Speech translation device and method
First Claim
Patent Images
1. A speech translation device comprising:
- a speech input unit configured to acquire speech data of an arbitrary language;
a speech recognition unit configured to obtain recognition data by performing a recognition processing of the speech data of the arbitrary language and to obtain a recognition likelihood of each of segments of the recognition data;
a translation unit configured to translate the recognition data into translation data of another language other than the arbitrary language and to obtain a translation likelihood of each of segments of the translation data;
a parameter setting unit configured to set a parameter necessary for performing speech synthesis from the translation data by using the recognition likelihood and the translation likelihood;
a speech synthesis unit configured to convert the translation data into speech data for speaking in the another language by using the parameter for each of the segments; and
a speech output unit configured to output a speech sound from the speech data of the another language.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech translation device includes a speech input unit, a speech recognition unit, a machine translation unit, a parameter setting unit, a speech synthesis unit, and a speech output unit, and a speech volume value of speech data to be outputted is determined from plural likelihoods obtained by the speech recognition/machine translation. With respect to a word with a low likelihood, the speech volume value is made small and is made hard to transmit to the user, and on the other hand, with respect to a word with a high likelihood, the speech volume value is made large and is especially emphasized and is transmitted to the user.
34 Citations
12 Claims
-
1. A speech translation device comprising:
-
a speech input unit configured to acquire speech data of an arbitrary language; a speech recognition unit configured to obtain recognition data by performing a recognition processing of the speech data of the arbitrary language and to obtain a recognition likelihood of each of segments of the recognition data; a translation unit configured to translate the recognition data into translation data of another language other than the arbitrary language and to obtain a translation likelihood of each of segments of the translation data; a parameter setting unit configured to set a parameter necessary for performing speech synthesis from the translation data by using the recognition likelihood and the translation likelihood; a speech synthesis unit configured to convert the translation data into speech data for speaking in the another language by using the parameter for each of the segments; and a speech output unit configured to output a speech sound from the speech data of the another language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speech translation method comprising:
-
acquiring speech data of an arbitrary language; obtaining recognition data by performing a recognition processing of the speech data of the arbitrary language and obtaining a recognition likelihood of each of segments of the recognition data; translating the recognition data into translation data of another language other than the arbitrary language and obtaining a translation likelihood of each of segments of the translation data; setting a parameter necessary for performing speech synthesis from the translation data by using the recognition likelihood and the translation likelihood; converting the translation data into speech data for speaking in the another language by using the parameter for each of the segments; and outputting a speech sound from the speech data of the another language.
-
-
12. A program product stored in a computer readable medium for speech translation, the program product comprising instructions of:
-
acquiring speech data of an arbitrary language; obtaining recognition data by performing a recognition processing of the speech data of the arbitrary language and obtaining a recognition likelihood of each of segments of the recognition data; translating the recognition data into translation data of another language other than the arbitrary language and obtaining a translation likelihood of each of segments of the translation data; setting a parameter necessary for performing speech synthesis from the translation data by using the recognition likelihood and the translation likelihood o; converting the translation data into speech data for speaking in the another language by using the parameter for each of the segments; and outputting a speech sound from the speech data of the another language.
-
Specification