Presenting translations of text depicted in images
First Claim
1. A method performed by data processing apparatus, the method comprising:
- receiving an image from a camera of a user device;
identifying text depicted in the image, the identified text being in two or more text blocks identified in the image, the two or more text blocks including a first text block and a second text block distinct from the first text block, the identified text being in a first language;
processing, by the data processing apparatus, the image to determine a relative prominence between the two or more text blocks and to determine, from a plurality of different prominence presentation contexts, a prominence presentation context for presenting a translation of text depicted in the image based on the relative prominence, wherein each prominence presentation context corresponds to a relative prominence of each text block in which text is presented within images to other text blocks identified in the images, and each prominence presentation context has a corresponding graphical user interface for presenting a translation of a different portion of the identified text than each other prominence presentation context;
determining, based on the selected prominence presentation context, that a translation of a single text block, of the two or more text blocks, will be presented using the graphical user interface corresponding to the selected prominence presentation context;
in response to determining that a translation of a single block of text will be presented using the graphical user interface corresponding to the selected prominence presentation context, selecting, between the first text block and the second text block and based on a size of the text in the first text block and a location of the first text block within the image relative to a size of the text in the second text block and a location of the second text block within the image, first text block as the single text block for which a translation will be presented using the graphical user interface corresponding to the selected prominence presentation context;
presenting, at a display of the user device, the translation of the text in the first text block in an overlay over the image using the graphical user interface corresponding to the selected prominence presentation context, while presenting the text in the second text block in the first language and in the image.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting additional information for text depicted by an image. In one aspect, a method includes receiving an image. Text depicted in the image is identified. The identified text can be in one or more text blocks. A prominence presentation context is selected for the image based on the relative prominence of the one or more text blocks. Each prominence presentation context corresponds to a relative prominence of each text block in which text is presented within images. Each prominence presentation context has a corresponding user interface for presenting additional information related to the identified text depicted in the image. A user interface is identified that corresponds to the selected prominence presentation context. Additional information is presented for at least a portion of the text depicted in the image using the identified user interface.
-
Citations
12 Claims
-
1. A method performed by data processing apparatus, the method comprising:
-
receiving an image from a camera of a user device; identifying text depicted in the image, the identified text being in two or more text blocks identified in the image, the two or more text blocks including a first text block and a second text block distinct from the first text block, the identified text being in a first language; processing, by the data processing apparatus, the image to determine a relative prominence between the two or more text blocks and to determine, from a plurality of different prominence presentation contexts, a prominence presentation context for presenting a translation of text depicted in the image based on the relative prominence, wherein each prominence presentation context corresponds to a relative prominence of each text block in which text is presented within images to other text blocks identified in the images, and each prominence presentation context has a corresponding graphical user interface for presenting a translation of a different portion of the identified text than each other prominence presentation context; determining, based on the selected prominence presentation context, that a translation of a single text block, of the two or more text blocks, will be presented using the graphical user interface corresponding to the selected prominence presentation context; in response to determining that a translation of a single block of text will be presented using the graphical user interface corresponding to the selected prominence presentation context, selecting, between the first text block and the second text block and based on a size of the text in the first text block and a location of the first text block within the image relative to a size of the text in the second text block and a location of the second text block within the image, first text block as the single text block for which a translation will be presented using the graphical user interface corresponding to the selected prominence presentation context; presenting, at a display of the user device, the translation of the text in the first text block in an overlay over the image using the graphical user interface corresponding to the selected prominence presentation context, while presenting the text in the second text block in the first language and in the image. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
a data processing apparatus; and a memory storage apparatus in data communication with the data processing apparatus, the memory storage apparatus storing instructions executable by the data processing apparatus and that upon such execution cause the data processing apparatus to perform operations comprising; receiving an image from a camera of a user device; identifying text depicted in the image, the identified text being in two or more text blocks identified in the image, the two or more text blocks including a first text block and a second text block distinct from the first text block, the identified text being in a first language; processing, by the data processing apparatus, the image to determine a relative prominence between the two or more text blocks and to determine, from a plurality of different prominence presentation contexts, a prominence presentation context for presenting a translation of text depicted in the image based on the relative prominence, wherein each prominence presentation context corresponds to a relative prominence of each text block in which text is presented within images to other text blocks identified in the images, and each prominence presentation context has a corresponding graphical user interface for presenting a translation of a different portion of the identified text than each other prominence presentation context; determining, based on the selected prominence presentation context, that a translation of a single text block, of the two or more text blocks, will be presented using the graphical user interface corresponding to the selected prominence presentation context; in response to determining that a translation of a single block of text will be presented using the graphical user interface corresponding to the selected prominence presentation context, selecting, between the first text block and the second text block and based on a size of the text in the first text block and a location of the first text block within the image relative to a size of the text in the second text block and a location of the second text block within the image, first text block as the single text block for which a translation will be presented using the graphical user interface corresponding to the selected prominence presentation context; presenting, at a display of the user device, the translation of the text in the first text block in an overlay over the image using the graphical user interface corresponding to the selected prominence presentation context, while presenting the text in the second text block in the first language and in the image. - View Dependent Claims (9, 10, 11)
-
-
12. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by a data processing apparatus cause the data processing apparatus to perform operations comprising:
-
receiving an image from a camera of a user device; identifying text depicted in the image, the identified text being in two or more text blocks identified in the image, the two or more text blocks including a first text block and a second text block distinct from the first text block, the identified text being in a first language; processing, by the data processing apparatus, the image to determine a relative prominence between the two or more text blocks and to determine, from a plurality of different prominence presentation contexts, a prominence presentation context for presenting a translation of text depicted in the image based on the relative prominence, wherein each prominence presentation context corresponds to a relative prominence of each text block in which text is presented within images to other text blocks identified in the images, and each prominence presentation context has a corresponding graphical user interface for presenting a translation of a different portion of the identified text than each other prominence presentation context; determining, based on the selected prominence presentation context, that a translation of a single text block, of the two or more text blocks, will be presented using the graphical user interface corresponding to the selected prominence presentation context; in response to determining that a translation of a single block of text will be presented using the graphical user interface corresponding to the selected prominence presentation context, selecting, between the first text block and the second text block and based on a size of the text in the first text block and a location of the first text block within the image relative to a size of the text in the second text block and a location of the second text block within the image, first text block as the single text block for which a translation will be presented using the graphical user interface corresponding to the selected prominence presentation context; presenting, at a display of the user device, the translation of the text in the first text block in an overlay over the image using the graphical user interface corresponding to the selected prominence presentation context, while presenting the text in the second text block in the first language and in the image.
-
Specification