×

Optical character recognition

  • US 10,176,392 B2
  • Filed: 01/31/2014
  • Issued: 01/08/2019
  • Est. Priority Date: 01/31/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, at a processor of a computing system, text outputs from a plurality of optical character recognition (OCR) engines, wherein each of the plurality of OCR engines receives an image of a document and generates an output representative of text depicted in the image of the document;

    analyzing, by the processor, the image of the document to identify metadata describing attributes of the documentidentifying, by the processor, a difference among the text outputs of the plurality of OCR engines;

    resolving, by the processor, the difference among the text outputs of the plurality of OCR engines, by determining a probability of character recognition accuracy for each of the plurality of OCR engines based on the metadata describing the attributes of the document and selecting a character outputted by one of the OCR engines that has a highest probability of character recognition accuracy to be included in an output character set; and

    generating, by the processor, the output character set to represent the text in the document.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×