×

Image quality assessment and improvement for performing optical character recognition

  • US 10,108,883 B2
  • Filed: 10/28/2016
  • Issued: 10/23/2018
  • Est. Priority Date: 10/28/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for identifying information in an electronic document, comprising:

  • obtaining a reference image of the electronic document;

    distorting the reference image by adjusting parameter values for a plurality of sets of parameters associated with a quality of the reference image to generate a plurality of distorted images;

    for each distorted image;

    analyzing the distorted image to attempt to detect a first set of parameters from the plurality of sets of parameters and corresponding parameter values used to generate the distorted image;

    determining an accuracy of detection of the first set of parameters and the corresponding parameter values used to generate the distorted image, the determining including;

    comparing each detected parameter determined as a result of the analyzing the distorted image with the first set of parameters used for generating the distorted image, anddetermining the accuracy of the detection based on the comparison; and

    training a model based at least on the plurality of distorted images and respective accuracies of the detection to generate a trained model;

    obtaining a second image of the electronic document;

    determining, based on the trained model, a second set of parameters to be adjusted in the second image and a value corresponding to each parameter in the second set by which the parameter is to be adjusted;

    determining, based on the trained model, at least one technique for adjusting each parameter in the second set of parameters in the second image to prepare the second image for optical character recognition (OCR);

    preparing the second image for the OCR by adjusting each determined parameter in the second set of parameters by a corresponding determined value based on a corresponding determined technique for the determined parameter to generate a prepared second image; and

    performing OCR on the prepared second image.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×