×

Method for processing optical character recognition (OCR) output data, wherein the output data comprises double printed character images

  • US 8,320,677 B2
  • Filed: 11/24/2008
  • Issued: 11/27/2012
  • Est. Priority Date: 11/30/2007
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for resolving contradicting output data from an Optical Character Recognition (OCR) system, wherein the output data comprises at least one suspected double printed character image, the method comprises:

  • a) searching through the output data identifying images of characters having an image quality above a predefined level, and using these character images as a set of single character template images for characters,b) providing a bounding box around the suspected double printed character image, and then doing a gliding single character image correlation between each respective single template image, one by one, and the suspected double printed character image, and recording the correlation values and corresponding displacement values of the respective character image bodies for each step of movement performed in the gliding single character correlation process,c) selecting single template images having a correlation values above a predefined threshold level to create a list of candidates of combined single character template images aligned relative to each other according to the their corresponding displacement values relative to the bounding box,d) correlating each respective candidate of combined single character template images with the suspected double printed character image, and selects the combined single character template image having the highest correlation value as an identification of each respective character image in the suspected double printed character image.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×