Network based speech to speech translation
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving, from a server and at a first client device, during a translation process, first audio data corresponding to a translation of first user input speech in a first language to a second language, wherein the first language is determined based at least on a determined geographical location of a second client device and the second language is determined based at least on a determined geographical location of the first client device;
providing, for output at the first client device and during the translation process, the first audio data corresponding to the translation of the first user input speech in the first language to the second language;
receiving, at the first client device and during the translation process, second audio data corresponding to second user input speech in the second language;
determining, by the first client device, to initiate translation of the second audio data corresponding to the second user input speech in the second language; and
based on the determination to initiate translation of the second audio data corresponding to the second user input speech in the second language, transmitting the second audio data corresponding to the second user input speech in the second language from the first client device to the server for translation from the second language to the first language, wherein the server is configured to;
generate third audio data corresponding to a translation of the second user input speech in the second language to the first language;
store the third audio data corresponding to the translation of the second user input speech; and
in response to receiving a request from the second client device to retrieve the third audio data corresponding to the translation of the second user input speech, transmit, to the second client device, the third audio data corresponding to the translation of the second user input speech.
2 Assignments
0 Petitions
Accused Products
Abstract
A method, performed on a server, of translating between languages includes receiving first audio data for a first language from a mobile device, translating the first audio data to second audio data for a second language, receiving an indication that the mobile device has moved between two locations, and sending the second audio data to the mobile device in response to the indication.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, from a server and at a first client device, during a translation process, first audio data corresponding to a translation of first user input speech in a first language to a second language, wherein the first language is determined based at least on a determined geographical location of a second client device and the second language is determined based at least on a determined geographical location of the first client device; providing, for output at the first client device and during the translation process, the first audio data corresponding to the translation of the first user input speech in the first language to the second language; receiving, at the first client device and during the translation process, second audio data corresponding to second user input speech in the second language; determining, by the first client device, to initiate translation of the second audio data corresponding to the second user input speech in the second language; and based on the determination to initiate translation of the second audio data corresponding to the second user input speech in the second language, transmitting the second audio data corresponding to the second user input speech in the second language from the first client device to the server for translation from the second language to the first language, wherein the server is configured to; generate third audio data corresponding to a translation of the second user input speech in the second language to the first language; store the third audio data corresponding to the translation of the second user input speech; and in response to receiving a request from the second client device to retrieve the third audio data corresponding to the translation of the second user input speech, transmit, to the second client device, the third audio data corresponding to the translation of the second user input speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, from a server and at a first client device, during a translation process, first audio data corresponding to a translation of first user input speech in a first language to a second language, wherein the first language is determined based at least on a determined geographical location of a second client device and the second language is determined based at least on a determined geographical location of the first client device; providing, for output at the first client device and during the translation process, the first audio data corresponding to the translation of the first user input speech in the first language to the second language; receiving, at the first client device and during the translation process, second audio data corresponding to second user input speech in the second language; determining, by the first client device, to initiate translation of the second audio data corresponding to the second user input speech in the second language; and based on the determination to initiate translation of the second audio data corresponding to the second user input speech in the second language, transmitting the second audio data corresponding to the second user input speech in the second language from the first client device to the server for translation from the second language to the first language, wherein the server is configured to; generate third audio data corresponding to a translation of the second user input speech in the second language to the first language; store the third audio data corresponding to the translation of the second user input speech; and in response to receiving a request from the second client device to retrieve the third audio data corresponding to the translation of the second user input speech, transmit, to the second client device, the third audio data corresponding to the translation of the second user input speech. - View Dependent Claims (13, 14, 15)
-
16. A computer-readable storage device encoded with a computer program, the program comprising instructions that if executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving, from a server and at a first client device, during a translation process, first audio data corresponding to a translation of first user input speech in a first language to a second language, wherein the first language is determined based at least on a determined geographical location of a second client device and the second language is determined based at least on a determined geographical location of the first client device; providing, for output at the first client device and during the translation process, the first audio data corresponding to the translation of the first user input speech in the first language to the second language; receiving, at the first client device and during the translation process, second audio data corresponding to second user input speech in the second language; determining, by the first client device, to initiate translation of the second audio data corresponding to the second user input speech in the second language; and based on the determination to initiate translation of the second audio data corresponding to the second user input speech in the second language, transmitting the second audio data corresponding to the second user input speech in the second language from the first client device to the server for translation from the second language to the first language, wherein the server is configured to; generate third audio data corresponding to a translation of the second user input speech in the second language to the first language; store the third audio data corresponding to the translation of the second user input speech; and in response to receiving a request from the second client device to retrieve the third audio data corresponding to the translation of the second user input speech, transmit, to the second client device, the third audio data corresponding to the translation of the second user input speech. - View Dependent Claims (17, 18, 19)
-
-
20. A computer-implemented method comprising:
-
transmitting, from a server and to a first client device, during a translation process, first audio data corresponding to a translation of first user input speech in a first language to a second language, wherein the first language is determined based at least on a determined geographical location of a second client device and the second language is determined based at least on a determined geographical location of the first client device; receiving, at the server and from the first client device, and during the translation process, second audio data corresponding to second user input speech in the second language; generating, at the server, third audio data corresponding to a translation of the second user input speech in the second language to the first language; storing, at the server, the third audio data corresponding to the translation of the second user input speech; receiving, at the server and from the second client device after having stored the third audio data corresponding to the translation of the second user input speech, a request to provide the second client device with the third audio data; and in response to receiving the request to provide the second client device with the third audio data corresponding to the translation of the second user input speech, transmitting, from the server to the second client device, the third audio data corresponding to the translation of the second user input speech.
-
Specification