Verification and correction method and system for optical character recognition
First Claim
1. A method for correcting and verifying results of an optical character recognition process comprising:
- a) performing said optical character recognition on an image including characters to be recognized;
b) performing an automatic context analysis on said image;
c) evaluating results obtained by said automatic context analysis, to identify characters requiring further processing;
d) performing a user-assisted verification and correction of less than all of said characters identified as requiring further processing, thereby producing a partially recognized image, wherein said user-assisted verification and correction is omitted if a predetermined level of confidence of recognition of said image is reached by said evaluating of said results; and
e) repeating steps b-d on said partially recognized image until said predetermined level of confidence of recognition of said image is reached by said evaluating of said results.
1 Assignment
0 Petitions
Accused Products
Abstract
An optical character recognition method and system are provided, employing context analysis and operator input, alternatively and in combination, on the same batch of documents. After automatic character recognition, the context analyzer processes the fields that are good enough to expect resolution. This will accept as many fields as possible without any operator intervention. For some other fields, the process uses operator input to certify the character-level OCR result of, or to enter, a certain percentage of the characters, so that context analysis may accept some of the remaining fields. If the context analyzer successfully identifies a small set of very close hypotheses, the process asks the operator to certify one or two characters to resolve the ambiguity between the hypotheses. For the fields that are still not resolved, the fields and the hypotheses are shown to the operator for acceptance, correction, or entry.
88 Citations
54 Claims
-
1. A method for correcting and verifying results of an optical character recognition process comprising:
-
a) performing said optical character recognition on an image including characters to be recognized; b) performing an automatic context analysis on said image; c) evaluating results obtained by said automatic context analysis, to identify characters requiring further processing; d) performing a user-assisted verification and correction of less than all of said characters identified as requiring further processing, thereby producing a partially recognized image, wherein said user-assisted verification and correction is omitted if a predetermined level of confidence of recognition of said image is reached by said evaluating of said results; and e) repeating steps b-d on said partially recognized image until said predetermined level of confidence of recognition of said image is reached by said evaluating of said results. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 49, 50)
-
-
17. A system for correcting and verifying results of an optical character recognition process comprising:
-
a) means for performing said optical character recognition on an image including characters to be recognized; b) means for performing an automatic context analysis on said image; c) means for evaluating results obtained by said automatic context analysis, to identify characters requiring further processing; d) means for performing a user-assisted verification and correction of less than all of said characters identified as requiring further processing, thereby producing a partially recognized image, wherein said user-assisted verification and correction is omitted if a predetermined level of confidence of recognition said image is reached by said evaluating of said results; and e) means for repeating processing performed by means b-d on said partially recognized image until said predetermined level of confidence of recognition of said image is reached by said evaluating of said results. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 51, 52)
-
-
33. A computer program product, for use with a computer system, for causing the computer system to correct and verify results of an optical character recognition process, the computer program product comprising:
-
a) a computer-readable medium; b) means, provided on the computer-readable medium, for directing the computer system to perform said optical character recognition on an image including characters to be recognized; c) means, provided on the computer-readable medium, for directing the computer system to perform an automatic context analysis on said image; d) means, provided on the computer-readable medium, for directing the computer system to evaluate results obtained by said automatic context analysis performing means, to identify characters requiring further processing; e) means, provided on the computer-readable medium, for directing the computer system to perform a user-assisted verification and correction of less than all of said characters identified as requiring further processing, thereby producing a partially recognized image, wherein said user-assisted verification and correction means is bypassed if a predetermined level of confidence of recognition of said image is reached by said evaluating means; and f) means, provided on the computer-readable medium, for directing the computer system to repeat processing performed by means c-e on said partially recognized image until said predetermined level of confidence of recognition of said image is reached by said evaluating means. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 53, 54)
-
Specification