In-Call Translation
First Claim
1. A language translation relay system for use in a communication system, the communication system for effecting a voice or video call between at least a source user speaking a source language and a target user speaking a target language, the relay system comprising:
- an input configured to receive call audio of the call from a remote source user device of the source user via a communication network of the communication system, the call audio comprising speech of the source user in the source language;
a speech recognition component configured to perform an automatic speech recognition procedure on the call audio;
a translation component configured to generate a translation of the source user'"'"'s speech in the target language using the results of the speech recognition procedure, the translation comprising a translated synthetic speech audio version of the source user'"'"'s speech in the target language for playing out at the target user device, the synthetic speech generated based on the results of the speech recognition procedure;
a mixing component configured to mix the synthetic speech with the source user'"'"'s call audio and/or with translated audio of the target user'"'"'s speech in the source language, thereby generating a mixed audio signal; and
an output configured to transmit the mixed audio signal to at least a remote target user device of the target user via the communication network for outputting to the target user during the call.
1 Assignment
0 Petitions
Accused Products
Abstract
Call audio of a call between a source user speaking a source language and a target user speaking a target language is received from a remote source user device of a source user via a communication network of a communication system, the call audio comprising speech of the source user in the source language. An automatic speech recognition procedure is performed on the call audio. A translation of the source user'"'"'s speech is generated in the target language using the results of the speech recognition procedure. A translated synthetic speech audio version of the source user'"'"'s speech is mixed with the source user'"'"'s call audio and/or with translated audio of the target user'"'"'s speech in the source language. The mixed audio signal is transmitted to a remote target user device of the target user via the communication network for outputting to at least the target user during the call.
69 Citations
20 Claims
-
1. A language translation relay system for use in a communication system, the communication system for effecting a voice or video call between at least a source user speaking a source language and a target user speaking a target language, the relay system comprising:
-
an input configured to receive call audio of the call from a remote source user device of the source user via a communication network of the communication system, the call audio comprising speech of the source user in the source language; a speech recognition component configured to perform an automatic speech recognition procedure on the call audio; a translation component configured to generate a translation of the source user'"'"'s speech in the target language using the results of the speech recognition procedure, the translation comprising a translated synthetic speech audio version of the source user'"'"'s speech in the target language for playing out at the target user device, the synthetic speech generated based on the results of the speech recognition procedure; a mixing component configured to mix the synthetic speech with the source user'"'"'s call audio and/or with translated audio of the target user'"'"'s speech in the source language, thereby generating a mixed audio signal; and an output configured to transmit the mixed audio signal to at least a remote target user device of the target user via the communication network for outputting to the target user during the call. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method performed at a language translation relay system of a communication system, the communication system for effecting a voice or video call between at least a source user speaking a source language and a target user speaking a target language, the method comprising:
-
receiving call audio of the call from a remote source user device of the source user via a communication network of the communication system, the call audio comprising speech of the source user in the source language; performing an automatic speech recognition procedure on the call audio; generating a translation of the source user'"'"'s speech in the target language using the results of the speech recognition procedure, the translation comprising a translated synthetic speech audio version of the source user'"'"'s speech in the target language for playing out at the target user device, the synthetic speech generated based on the results of the speech recognition procedure; mixing the synthetic speech with the source user'"'"'s call audio and/or with translated audio of the target user'"'"'s speech in the source language, thereby generating a mixed audio signal; and transmitting the mixed audio signal to a remote target user device of the target user via the communication network for outputting to at least the target user during the call. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A computer program product comprising computer code stored on a computer readable storage medium for execution on a language translation relay system of a communication system, the communication system for effecting a voice or video call between at least a source user speaking a source language and a target user speaking a target language, the code configured when executed to cause operations of:
-
receiving call audio of the call from a remote source user device of the source user via a communication network of the communication system, the call audio comprising speech of the source user in the source language; performing an automatic speech recognition procedure on the call audio; generating a translation of the source user'"'"'s speech in the target language using the results of the speech recognition procedure, the translation comprising a translated synthetic speech audio version of the source user'"'"'s speech in the target language for playing out at the target user device, the synthetic speech audio version generated based on the results of the speech recognition procedure; mixing the synthetic speech with the source user'"'"'s call audio and/or with translated audio of the target user'"'"'s speech in the source language, thereby generating a mixed audio signal; and transmitting the mixed audio signal to at least a remote target user device of the target user via the communication network for outputting to the target user during the call.
-
Specification