SCANNED TEXT WORD RECOGNITION METHOD AND APPARATUS
First Claim
Patent Images
1. A method for converting digital images to words, the method comprising:
- receiving a digital image comprising text;
generating a binary image from the digital image for each of N binarization threshold values to provide N binary images, where N is greater than or equal to 2;
converting each of the N binary images to text; and
aligning the text from the N binary images to provide a word lattice for the digital image.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for converting digital images to words includes receiving a digital image comprising text, generating a binary image from the digital image for each of N binarization threshold values to provide N binary images, converting each of the N binary images to text, and aligning the text from the N binary images to provide a word lattice for the digital image. Aligning the text may include prioritizing the text from the N binary images according to error rates on a training set. The training set may be a synthetic training set. An apparatus corresponding to the above method is also disclosed herein.
20 Citations
20 Claims
-
1. A method for converting digital images to words, the method comprising:
-
receiving a digital image comprising text; generating a binary image from the digital image for each of N binarization threshold values to provide N binary images, where N is greater than or equal to 2; converting each of the N binary images to text; and aligning the text from the N binary images to provide a word lattice for the digital image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus for converting digital images to words, the apparatus comprising:
-
a processor for executing one or more modules; a binarization module configured to receive a digital image comprising text and generate a binary image from the digital image for each of N binarization threshold values to provide N binary images, where N is greater than or equal to 2; an OCR module configured to convert each of the N binary images to text; and an alignment module configured to align the text from the N binary images to provide a word lattice for the digital image. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer readable medium comprising executable instructions for converting digital images to words, wherein the executable instructions comprise the operations of:
-
receiving a digital image comprising text; generating a binary image from the digital image for each of N binarization threshold values to provide N binary images, where N is greater than or equal to 2; converting each of the N binary images to text; and aligning the text from the N binary images to provide a word lattice for the digital image. - View Dependent Claims (20)
-
Specification