Identifying matching canonical documents consistent with visual query structural information
First Claim
1. A method performed by data processing apparatus, the method comprising:
- receiving, from a device, an image query that includes an image;
identifying textual characters in a region of the image and structural information associated with the textual characters in the region of the image, the structural information specifying a position of at least one of the textual characters with respect to one or more reference point elements in the image of the image query;
retrieving, using one or more of the textual characters and the structural information, a canonical document that includes the one or more textual characters at a location in the canonical document that is consistent with the structural information; and
sending, to the device, at least a portion of the canonical document.
2 Assignments
0 Petitions
Accused Products
Abstract
A server system receives a visual query from a client system, performs optical character recognition (OCR) on the visual query to produce text recognition data representing textual characters, including a plurality of textual characters in a contiguous region of the visual query. The server system also produces structural information associated with the textual characters in the visual query. Textual characters in the plurality of textual characters are scored. The method further includes identifying, in accordance with the scoring, one or more high quality textual strings, each comprising a plurality of high quality textual characters from among the plurality of textual characters in the contiguous region of the visual query. A canonical document that includes the one or more high quality textual strings and that is consistent with the structural information is retrieved. At least a portion of the canonical document is sent to the client system.
64 Citations
20 Claims
-
1. A method performed by data processing apparatus, the method comprising:
-
receiving, from a device, an image query that includes an image; identifying textual characters in a region of the image and structural information associated with the textual characters in the region of the image, the structural information specifying a position of at least one of the textual characters with respect to one or more reference point elements in the image of the image query; retrieving, using one or more of the textual characters and the structural information, a canonical document that includes the one or more textual characters at a location in the canonical document that is consistent with the structural information; and sending, to the device, at least a portion of the canonical document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a data processing apparatus; a memory storage apparatus in data communication with the data processing apparatus, the memory storage apparatus storing instructions executable by the data processing apparatus and that upon such execution cause the data processing apparatus to perform operations comprising; receiving, from a device, an image query that includes an image; identifying textual characters in a region of the image and structural information associated with the textual characters in the region of the image, the structural information specifying a position of at least one of the textual characters with respect to one or more reference point elements in the image of the image query; retrieving, using one or more of the textual characters and the structural information, a canonical document that includes the one or more textual characters at a location in the canonical document that is consistent with the structural information; and sending, to the device, at least a portion of the canonical document. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer storage device encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
receiving, from a device, an image query that includes an image; identifying textual characters in a region of the image and structural information associated with the textual characters in the region of the image, the structural information specifying a position of at least one of the textual characters with respect to one or more reference point elements in the image of the image query; retrieving, using one or more of the textual characters and the structural information, a canonical document that includes the one or more textual characters at a location in the canonical document that is consistent with the structural information; and
sending, to the device, at least a portion of the canonical document. - View Dependent Claims (18, 19, 20)
-
Specification