Low resolution OCR for camera acquired documents
First Claim
1. A system that facilitates optical character recognition (OCR) of low resolution symbols, comprising:
- a segmentation component that facilitates segmentation of a symbol in an image; and
a recognition component that recognizes the symbol substantially simultaneously with segmentation thereof.
2 Assignments
0 Petitions
Accused Products
Abstract
A global optimization framework for optical character recognition (OCR) of low-resolution photographed documents that combines a binarization-type process, segmentation, and recognition into a single process. The framework includes a machine learning approach trained on a large amount of data. A convolutional neural network can be employed to compute a classification function at multiple positions and take grey-level input which eliminates binarization. The framework utilizes preprocessing, layout analysis, character recognition, and word recognition to output high recognition rates. The framework also employs dynamic programming and language models to arrive at the desired output.
-
Citations
40 Claims
-
1. A system that facilitates optical character recognition (OCR) of low resolution symbols, comprising:
-
a segmentation component that facilitates segmentation of a symbol in an image; and
a recognition component that recognizes the symbol substantially simultaneously with segmentation thereof. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system that facilitates OCR of low resolution camera-acquired documents, comprising:
-
a segmentation component that facilitates segmentation of a symbol in an image;
a language model that facilitates processing a character in a string of characters; and
a dynamic programming component that facilitates the recognition of the string of characters as a word. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer-readable medium having computer-executable instructions for a method of performing low resolution OCR of a camera-acquired document, the method comprising:
-
receiving the photographed document having a plurality of imaged symbols;
performing layout analysis to detect an associated arrangement of the imaged symbols on the document;
deconstructing the associated arrangement into one or more sets of the imaged symbols by detecting spaces between the imaged symbols;
segmenting the sets of imaged symbols into separate imaged symbols;
computing a score for each imaged symbol at a horizontal position, at a higher horizontal resolution;
combining the scores of each of the imaged symbols at the horizontal positions into a total score, which total score is used to determine a word; and
outputting the word that is representative of one of the sets of imaged symbols. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28)
-
-
29. A method of performing low resolution OCR of a photographed document, comprising:
-
preprocessing the photographed document to adjust for imperfections introduced into the photographed document;
analyzing a layout of the document to determine lines of text;
breaking the lines of text into individual words;
indicating bounds for each of the individual words;
recognizing characters in each of the individual words using a machine learning classification algorithm; and
recognizing the individual words with a dynamic programming algorithm to determine which individual word is at a given location. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
Specification