AUTO-TRANSLATION FOR MULTI USER AUDIO AND VIDEO
First Claim
1. A method comprising:
- receiving, at data processing hardware, an output data stream from a user device, the output data stream comprising a language preference indicator and first audio signals representing speech in a first language, the language preference indicator comprising a target language specified by a user of the user device for translating the speech in the first language;
converting, by the data processing hardware, the first audio signals into text in the first language;
translating, by the data processing hardware, the text in the first language into text in the target language using the language preference indicator;
converting, by the data processing hardware, the text in the target language into second audio signals representing a spoken version of the text in the target language; and
transmitting, by the data processing hardware, the second audio signals representing the speech in the target language to the user device.
0 Assignments
0 Petitions
Accused Products
Abstract
The disclosed subject matter provides a system, computer readable storage medium, and a method providing an audio and textual transcript of a communication. A conferencing services may receive audio or audio visual signals from a plurality of different devices that receive voice communications from participants in a communication, such as a chat or teleconference. The audio signals representing voice (speech) communications input into respective different devices by the participants. A translation services server may receive over a separate communication channel the audio signals for translation into a second language. As managed by the translation services server, the audio signals may be converted into textual data. The textual data may be translated into text of different languages based the language preferences of the end user devices in the teleconference. The translated text may be further translated into audio signals.
1 Citation
20 Claims
-
1. A method comprising:
-
receiving, at data processing hardware, an output data stream from a user device, the output data stream comprising a language preference indicator and first audio signals representing speech in a first language, the language preference indicator comprising a target language specified by a user of the user device for translating the speech in the first language; converting, by the data processing hardware, the first audio signals into text in the first language; translating, by the data processing hardware, the text in the first language into text in the target language using the language preference indicator; converting, by the data processing hardware, the text in the target language into second audio signals representing a spoken version of the text in the target language; and transmitting, by the data processing hardware, the second audio signals representing the speech in the target language to the user device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
data processing hardware; and memory hardware in communication with the data processing hardware and storing instructions that when executed by the data processing hardware cause the data processing hardware to perform operations comprising; receiving an output data stream from a user device, the output data stream comprising a language preference indicator and first audio signals representing speech in a first language, the language preference indicator comprising a target language specified by a user of the user device for translating the speech in the first language; converting the first audio signals into text in the first language; translating the text in the first language into text in the target language using the language preference indicator; converting the text in the target language into second audio signals representing a spoken version of the text in the target language; and transmitting the second audio signals representing the speech in the target language to the user device. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification