Document Scanner
First Claim
1. A computer implemented method, the method comprising:
- receiving a scan request, wherein the scan request includes a plurality of text images, each text image representing a portion of a text document, wherein the plurality of text images includes a first text image and a second text image that at least partially overlap;
for each text image of the plurality of text images;
rectifying the text image to generate a text image with parallel image lines,generating a plurality of word bounding boxes that enclose one or more connected components in the text image, wherein each word bounding box is associated with a respective word, andgenerating, for each respective word in the text image, a plurality of points that represent the respective word;
combining the plurality of text images to form a single text document including combining the first text image of the plurality of text images and the second text image of the plurality of text images by matching a first set of points associated with the first text image and a second set of points associated with the second text image; and
providing the combined image as a scannable document.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, to generate a scannable document. In one aspect, a method includes receiving a scan request, wherein the scan request includes a plurality of text images; for each text image of the plurality of text images: rectifying the text image to generate a text image with parallel image lines, generating a plurality of word bounding boxes that enclose one or more connected components in the text image, wherein each word bounding box is associated with a respective word, and generating, for each respective word in the text image, a plurality of points that represent the respective word; combining the plurality of text images to form a single text document; and providing the combined image as a scannable document.
-
Citations
21 Claims
-
1. A computer implemented method, the method comprising:
-
receiving a scan request, wherein the scan request includes a plurality of text images, each text image representing a portion of a text document, wherein the plurality of text images includes a first text image and a second text image that at least partially overlap; for each text image of the plurality of text images; rectifying the text image to generate a text image with parallel image lines, generating a plurality of word bounding boxes that enclose one or more connected components in the text image, wherein each word bounding box is associated with a respective word, and generating, for each respective word in the text image, a plurality of points that represent the respective word; combining the plurality of text images to form a single text document including combining the first text image of the plurality of text images and the second text image of the plurality of text images by matching a first set of points associated with the first text image and a second set of points associated with the second text image; and providing the combined image as a scannable document. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
one or more computers; one or more storage devices storing instructions that, when executed by the one or more computers, cause the one or more computers to perform operations comprising; receiving a scan request, wherein the scan request includes a plurality of text images, each text image representing a portion of a text document, wherein the plurality of text images includes a first text image and a second text image that at least partially overlap; for each text image of the plurality of text images; rectifying the text image to generate a text image with parallel image lines, generating a plurality of word bounding boxes that enclose one or more connected components in the text image, wherein each word bounding box is associated with a respective word, and generating, for each respective word in the text image, a plurality of points that represent the respective word; combining the plurality of text images to form a single text document including combining the first text image of the plurality of text images and the second text image of the plurality of text images by matching a first set of points associated with the first text image and a second set of points associated with the second text image; and providing the combined image as a scannable document. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving a scan request, wherein the scan request includes a plurality of text images, each text image representing a portion of a text document, wherein the plurality of text images includes a first text image and a second text image that at least partially overlap; for each text image of the plurality of text images; rectifying the text image to generate a text image with parallel image lines, generating a plurality of word bounding boxes that enclose one or more connected components in the text image, wherein each word bounding box is associated with a respective word, and generating, for each respective word in the text image, a plurality of points that represent the respective word; combining the plurality of text images to form a single text document including combining the first text image of the plurality of text images and the second text image of the plurality of text images by matching a first set of points associated with the first text image and a second set of points associated with the second text image; and providing the combined image as a scannable document. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification