Optical character recognition of series of images
First Claim
1. A method, comprising:
- receiving, by a processing device, a current image of a series of images of an original document, wherein the current image at least partially overlaps with a previous image of the series of images;
performing optical character recognition (OCR) of the current image to produce an OCR text and a corresponding text layout;
identifying, using the OCR text and the corresponding text layout, a plurality of textual artifacts in each of the current image and the previous image, wherein each textual artifact is represented by a sequence of symbols that has a frequency of occurrence within the OCR text falling below a threshold frequency;
identifying, in each of the current image and the previous image, a corresponding plurality of base points, wherein each base point is associated with at least one textual artifact of the plurality of textual artifacts;
identifying, using coordinates of matching base points in the current image and the previous image, parameters of a coordinate transformation converting coordinates of the previous image into coordinates of the current image;
associating, using the coordinate transformation, at least part of the OCR text with a cluster of a plurality of clusters of symbol sequences, wherein the OCR text is produced by processing the current image and wherein the symbol sequences are produced by processing one or more previously received images of the series of images;
identifying an order of clusters of symbol sequences, the order reflecting a layout of the original document; and
producing, in view of the order of clusters, a resulting OCR text representing at least a portion of the original document.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are described for receiving a current image that partially overlaps with a previous image of a series of images of an original document; performing optical character recognition (OCR) of the current image, producing an OCR text and a corresponding text layout; identifying textual artifacts in the current and previous images, each represented by a sequence of symbols having a frequency of occurrence within the OCR text below a threshold frequency; identifying corresponding base points associated with textual artifacts; identifying parameters of a coordinate transformation converting coordinates of the previous image into coordinates of the current image; associating part of the OCR text with a cluster of symbol sequences, wherein the symbol sequences are produced by processing previously received images; identifying an order of clusters of symbol sequences reflecting a layout of the original document; and producing a resulting OCR text representing a portion of the original document.
-
Citations
22 Claims
-
1. A method, comprising:
-
receiving, by a processing device, a current image of a series of images of an original document, wherein the current image at least partially overlaps with a previous image of the series of images; performing optical character recognition (OCR) of the current image to produce an OCR text and a corresponding text layout; identifying, using the OCR text and the corresponding text layout, a plurality of textual artifacts in each of the current image and the previous image, wherein each textual artifact is represented by a sequence of symbols that has a frequency of occurrence within the OCR text falling below a threshold frequency; identifying, in each of the current image and the previous image, a corresponding plurality of base points, wherein each base point is associated with at least one textual artifact of the plurality of textual artifacts; identifying, using coordinates of matching base points in the current image and the previous image, parameters of a coordinate transformation converting coordinates of the previous image into coordinates of the current image; associating, using the coordinate transformation, at least part of the OCR text with a cluster of a plurality of clusters of symbol sequences, wherein the OCR text is produced by processing the current image and wherein the symbol sequences are produced by processing one or more previously received images of the series of images; identifying an order of clusters of symbol sequences, the order reflecting a layout of the original document; and producing, in view of the order of clusters, a resulting OCR text representing at least a portion of the original document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system, comprising:
-
a memory; a processing device, coupled to the memory, the processing device configured to; receive a current image of a series of images of an original document, wherein the current image at least partially overlaps with a previous image of the series of images; perform optical character recognition (OCR) of the current image to produce an OCR text and a corresponding text layout; identify, using the OCR text and the corresponding text layout, a plurality of textual artifacts in each of the current image and the previous image, wherein each textual artifact is represented by a sequence of symbols that has a frequency of occurrence within the OCR text falling below a threshold frequency; identify, in each of the current image and the previous image, a corresponding plurality of base points, wherein each base point is associated with at least one textual artifact of the plurality of textual artifacts; identify, using coordinates of matching base points in the current image and the previous image, parameters of a coordinate transformation converting coordinates of the previous image into coordinates of the current image; associate, using the coordinate transformation, at least part of the OCR text with a cluster of a plurality of clusters of symbol sequences, wherein the OCR text is produced by processing the current image and wherein the symbol sequences are produced by processing one or more previously received images of the series of images; identify an order of clusters of symbol sequences, the order reflecting a layout of the original document; and produce, in view of the order of clusters, a resulting OCR text representing at least a portion of the original document. - View Dependent Claims (16, 17, 18)
-
-
19. A computer-readable non-transitory storage medium comprising executable instructions that, when executed by a processing device, cause the processing device to:
-
receive a current image of a series of images of an original document, wherein the current image at least partially overlaps with a previous image of the series of images; perform optical character recognition (OCR) of the current image to produce an OCR text and a corresponding text layout; identify, using the OCR text and the corresponding text layout, a plurality of textual artifacts in each of the current image and the previous image, wherein each textual artifact is represented by a sequence of symbols that has a frequency of occurrence within the OCR text falling below a threshold frequency; identify, in each of the current image and the previous image, a corresponding plurality of base points, wherein each base point is associated with at least one textual artifact of the plurality of textual artifacts; identify, using coordinates of matching base points in the current image and the previous image, parameters of a coordinate transformation converting coordinates of the previous image into coordinates of the current image; associate, using the coordinate transformation, at least part of the OCR text with a cluster of a plurality of clusters of symbol sequences, wherein the OCR text is produced by processing the current image and wherein the symbol sequences are produced by processing one or more previously received images of the series of images; identify an order of clusters of symbol sequences, the order reflecting a layout of the original document; and produce, in view of the order of clusters, a resulting OCR text representing at least a portion of the original document. - View Dependent Claims (20, 21, 22)
-
Specification