Low resolution OCR for camera acquired documents
First Claim
1. An optical character recognition system that facilitates text recognition on a low resolution image, comprising:
- at least one processor that executes;
a layout analysis component that determines a set of text lines in the low resolution image, the layout analysis component further segments each text line in the set of lines into individual text words, wherein the layout analysis employs at least two linear filters at each location of the low resolution image;
a character recognition component that segments the individual text words into one or more character portions and provides an observation on the most probable character for each of the one or more character portions; and
a word recognizer that employs dynamic programming mechanisms to ascertain words based upon a series of observations from the character recognition component.
2 Assignments
0 Petitions
Accused Products
Abstract
A global optimization framework for optical character recognition (OCR) of low-resolution photographed documents that combines a binarization-type process, segmentation, and recognition into a single process. The framework includes a machine learning approach trained on a large amount of data. A convolutional neural network can be employed to compute a classification function at multiple positions and take grey-level input which eliminates binarization. The framework utilizes preprocessing, layout analysis, character recognition, and word recognition to output high recognition rates. The framework also employs dynamic programming and language models to arrive at the desired output.
-
Citations
20 Claims
-
1. An optical character recognition system that facilitates text recognition on a low resolution image, comprising:
-
at least one processor that executes; a layout analysis component that determines a set of text lines in the low resolution image, the layout analysis component further segments each text line in the set of lines into individual text words, wherein the layout analysis employs at least two linear filters at each location of the low resolution image; a character recognition component that segments the individual text words into one or more character portions and provides an observation on the most probable character for each of the one or more character portions; and a word recognizer that employs dynamic programming mechanisms to ascertain words based upon a series of observations from the character recognition component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer-implemented method for performing optical character recognition on a low resolution image, comprising:
-
receiving a low resolution image of a text document; a processor for implementing the following steps; identifying lines of text in the low resolution image to generate a set of text lines; partitioning each text lines in the set of text lines into a plurality of text words; segmenting each text word into character portions; determining an observation that includes a probable character imaged in the character portion of the low resolution image, wherein determining the observation comprises employing a convolutional neural network; and employing dynamic programming techniques to evaluate a series of observations related to probable characters to provide a word determination. - View Dependent Claims (17, 18, 19)
-
-
20. An optical character recognition system, comprising:
-
at least one processor coupled to memory, the at least one processor configured to act as; means for receiving a low resolution image of a text document; means for identifying lines of text in the low resolution image to generate a set of text lines; means for partitioning each text lines in the set of text lines into a plurality of text words; means for segmenting each text word into character portions; means for determining an observation that includes a probable character imaged in the character portion of the low resolution image, wherein the means for determining the observation comprises employing a convolutional neural network; and means for employing dynamic programming techniques to evaluate a series of observations related to probable characters to provide a word determination.
-
Specification