Videophone with continuous speech-to-subtitles translation
First Claim
1. An apparatus for providing continuous speech-to-subtitle translation of a signal containing a video portion, and an audio portion comprising:
- (a) means for converting said audio portion to a corresponding first textual signal, wherein said converting means is located at a sending party'"'"'s location;
(b) means for translating said corresponding first textual signal to a second textual signal wherein said second textual signal is in a target language and wherein said translating means is located remotely from said sending party'"'"'s location;
(c) means for combining said video portion with said second textual signal to form a display signal, wherein said display signal displays said second textual signal as subtitles; and
(d) means for simultaneously displaying said display signal and outputting said audio portion.
7 Assignments
0 Petitions
Accused Products
Abstract
There is disclosed a method and apparatus for providing continuous speech-to-subtitles translation utilizing a video-based communications device but without speech synthesis at the output. Instead, a translation of each user'"'"'s speech is displayed continuously in text form on the other user'"'"'s screen. In the preferred embodiment, the sending party speaks into a conventional videophone. Speech recognition and translation of the transmitted signal are performed by a remote device at the receiving party'"'"'s location. The audio portion of the signal is sent both to a speaker for audio output and to a speech recognizer and text-based translation system, the output of which is text translated into the target language. The video portion of the signal and the translated text are combined in a subtitle generator and sent to a display device for viewing by the receiving party.
150 Citations
9 Claims
-
1. An apparatus for providing continuous speech-to-subtitle translation of a signal containing a video portion, and an audio portion comprising:
-
(a) means for converting said audio portion to a corresponding first textual signal, wherein said converting means is located at a sending party'"'"'s location; (b) means for translating said corresponding first textual signal to a second textual signal wherein said second textual signal is in a target language and wherein said translating means is located remotely from said sending party'"'"'s location; (c) means for combining said video portion with said second textual signal to form a display signal, wherein said display signal displays said second textual signal as subtitles; and (d) means for simultaneously displaying said display signal and outputting said audio portion. - View Dependent Claims (2, 3, 4)
-
-
5. A method of providing continuous speech-to-subtitle translation of a signal containing a video portion and an audio portion, comprising the steps of:
-
(e) converting said audio portion to a corresponding first textual signal at a sending party'"'"'s location; (f) translating said corresponding first textual signal to a second textual signal at a location remote from said sending party'"'"'s location, wherein said second textual signal is in a target language; and (g) combining said video portion with said second textual signal to form a display signal, wherein said display signal displays said second textual signal as subtitles. - View Dependent Claims (6, 7, 8, 9)
-
Specification