Optical character recognition
First Claim
1. A method comprising:
- receiving, at a processor of a computing system, text outputs from a plurality of optical character recognition (OCR) engines, wherein each of the plurality of OCR engines receives an image of a document and generates an output representative of text depicted in the image of the document;
analyzing, by the processor, the image of the document to identify metadata describing attributes of the documentidentifying, by the processor, a difference among the text outputs of the plurality of OCR engines;
resolving, by the processor, the difference among the text outputs of the plurality of OCR engines, by determining a probability of character recognition accuracy for each of the plurality of OCR engines based on the metadata describing the attributes of the document and selecting a character outputted by one of the OCR engines that has a highest probability of character recognition accuracy to be included in an output character set; and
generating, by the processor, the output character set to represent the text in the document.
2 Assignments
0 Petitions
Accused Products
Abstract
Optical character recognition is described in various implementations. In one example implementation, a method may include receiving a plurality of optical character recognition (OCR) outputs provided by a respective plurality of OCR engines, each of the plurality of OCR outputs being representative of text depicted in a portion of an electronic image. The method may also include identifying a document context associated with the electronic image, and generating an output character set by applying a character resolution model to resolve differences among the plurality of OCR outputs. The character resolution model may define a probability of character recognition accuracy for each of the plurality of OCR engines given the identified document context. The method may also include updating the character resolution model to generate an updated character resolution model such that subsequent generating of output character sets are based on the updated character resolution model.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving, at a processor of a computing system, text outputs from a plurality of optical character recognition (OCR) engines, wherein each of the plurality of OCR engines receives an image of a document and generates an output representative of text depicted in the image of the document; analyzing, by the processor, the image of the document to identify metadata describing attributes of the document identifying, by the processor, a difference among the text outputs of the plurality of OCR engines; resolving, by the processor, the difference among the text outputs of the plurality of OCR engines, by determining a probability of character recognition accuracy for each of the plurality of OCR engines based on the metadata describing the attributes of the document and selecting a character outputted by one of the OCR engines that has a highest probability of character recognition accuracy to be included in an output character set; and generating, by the processor, the output character set to represent the text in the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 20)
-
-
8. A system comprising:
-
a processor resource; and a memory storing instructions that when executed by the processor resource cause the processor resource to; analyze an image of an input document to identify metadata describing attributes of the input document, receive outputs from a plurality of optical character recognition (OCR) engines, wherein each of the plurality of OCR engines receives the image of the input document and generates an output representative of text depicted in the image of the input document, identify a difference among the outputs of the plurality of OCR engines, resolve the difference among the outputs of the plurality of OCR engines based on a character resolution model that utilizes the metadata describing the attributes of the input document and the outputs of the plurality of OCR engines, the character resolution model causing the processor resource to determine a probability of character recognition accuracy for each of the plurality of OCR engines based on the metadata describing the attributes of the input document, and select a character outputted by one of the OCR engines that has a highest probability of character recognition accuracy to be the character for an output document, and update the character resolution model based on the resolving of the difference to generate an updated character resolution model for subsequent use. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage medium storing instructions that, when executed, cause a processor resource to:
-
receive outputs from a plurality of optical character recognition (OCR) engines, wherein each of the plurality of OCR engines receives an image of a document and generates an output representative of text depicted in the image of the document; analyze the image of the document to identify metadata describing attributes of the document identify a difference among the outputs of the plurality of OCR engines; resolve the difference among the outputs of the plurality of OCR engines, by determining a probability of character recognition accuracy for each of the plurality of OCR engines based on the metadata describing the attributes of the document and selecting a character outputted by one of the OCR engines that has a highest probability of character recognition accuracy to be included in an output character set; and generate the output character set to represent the text in the document. - View Dependent Claims (16, 17, 18, 19)
-
Specification