Language translation of visual and audio input
First Claim
1. A method comprising:
- receiving audio input;
receiving visual input comprising a captured image of a target scene; and
translating the audio input from a first language to a second language based upon a contextual hint, not indicative of the first language, determined based upon a non-textual element identified based upon the visual input.
1 Assignment
0 Petitions
Accused Products
Abstract
The present translation system translates visual input and/or audio input from one language into another language. Some implementations incorporate a context-based translation that uses information obtained from visual input or audio input to aid in the translation of the other input. Other implementations combine the visual and audio translation. The translation system includes visual components and/or audio components. The visual components analyze visual input to identify a textual element and translate the textual element into a translated textual element. The visual image represents a captured image of a target scene. The visual components may further substitute the translated textual element for the textual element in the captured image. The audio components convert audio input into translated audio.
40 Citations
20 Claims
-
1. A method comprising:
-
receiving audio input; receiving visual input comprising a captured image of a target scene; and translating the audio input from a first language to a second language based upon a contextual hint, not indicative of the first language, determined based upon a non-textual element identified based upon the visual input. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more processing units; and memory comprising instructions that when executed by at least one of the one or more processing units, perform a method comprising; receiving audio input; receiving visual input comprising a captured image; and translating the audio input from a first language to a second language based upon a contextual hint, not indicative of the first language, determined based upon a non-textual element identified based upon the visual input. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage medium comprising instructions which when executed perform actions, comprising:
-
receiving audio input; receiving visual input comprising a captured image of a target scene; analyzing the visual input to identify a non-textual element; and translating the audio input from a first language to a second language based upon a contextual hint, not indicative of the first language, determined based upon the non-textual element. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification