Systems and methods for determining and displaying multi-line foreign language translations in real time on mobile devices
First Claim
1. A method for providing one or more translations in a real-time video feed of a first language into a second language, comprising:
- cropping a frame of the real-time video feed of one or more words of the first language to fit inside a bounding box to produce a cropped frame;
performing character segment detection on the cropped frame to produce a plurality character segments;
performing character merging on the character segments to produce a plurality of merged character segments while determining at least a shape score for at least one merged character segment;
performing character recognition on the merged character segments by utilizing at least the shape score of the at least one merged character segment to produce a plurality of recognized characters with high scores;
performing one or more translations on the recognized characters of the first language into one or more translated words of the second language; and
displaying the translated words of the second language.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention is related to systems and methods for translating language text on a mobile camera device offline without access to the Internet. More specifically, the present invention relates to systems and methods for displaying text of a first language and a translation of the first language text into a second language text which is displayed in real time in augmented reality on the mobile device. The processing can use a single line or a multiline algorithm designed with a plurality of processing innovations to insure accurate real-time translations without motion jitter. The invention may be used to help travelers in a foreign country with difficulties in reading and understanding text written in the local language of that country. The present invention may be utilized with wearable computers or glasses, producing seamless augmented reality foreign language translations. Some embodiments are particularly useful in translations from Asian languages to English. Some embodiments are especially optimized to translate multiple lines of foreign language text.
-
Citations
30 Claims
-
1. A method for providing one or more translations in a real-time video feed of a first language into a second language, comprising:
-
cropping a frame of the real-time video feed of one or more words of the first language to fit inside a bounding box to produce a cropped frame; performing character segment detection on the cropped frame to produce a plurality character segments; performing character merging on the character segments to produce a plurality of merged character segments while determining at least a shape score for at least one merged character segment; performing character recognition on the merged character segments by utilizing at least the shape score of the at least one merged character segment to produce a plurality of recognized characters with high scores; performing one or more translations on the recognized characters of the first language into one or more translated words of the second language; and displaying the translated words of the second language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A mobile device for providing one or more translations in a real-time video feed of a first language into a second language, the mobile device comprising:
-
a video camera for capturing the video feed of one or more words in the first language which need translation; a display for displaying the words of the first language and one or more translated words of the second language; a processor for processing program code; and one or more memories operatively connected to the processor for storing the program code, which when executed by the processor causes the processor to execute a process to; crop a frame of the real-time video feed of the words of the first language to fit inside a bounding box to produce a cropped frame; perform character segment detection on the cropped frame to produce a plurality character segments; perform character merging on the character segments to produce a plurality of merged character segments while determining at least a shape score for at least one merged character segment; perform character recognition on the merged character segments by utilizing at least the shape score of the at least one merged character segment to produce a plurality of recognized characters with high scores; perform one or more translations on the recognized characters of the first language into the one or more translated words of the second language; and display the translated words of the second language. - View Dependent Claims (25, 26, 27)
-
-
28. A non-transitory, computer-readable digital storage medium for storing program code for translating a video feed in real-time from a first language to a second language, the program code comprising:
-
program code to crop a frame of the real-time video feed of one or more words of the first language to fit inside a bounding box to produce a cropped frame; program code to perform character segment detection on the cropped frame to produce a plurality character segments; program code to perform character merging on the character segments to produce a plurality of merged character segments while determining at least a shape score for at least one merged character segment; program code to perform character recognition on the merged character segments by utilizing at least the shape score of the at least one merged character segment to produce a plurality of recognized characters with high scores; program code to perform one or more translations on the recognized characters of the first language into one or more translated words of the second language; and program code to display the translated words of the second language. - View Dependent Claims (29, 30)
-
Specification