×

System and method for increasing the accuracy of optical character recognition (OCR)

  • US 9,152,883 B2
  • Filed: 11/02/2009
  • Issued: 10/06/2015
  • Est. Priority Date: 11/02/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method for increasing the accuracy of optical character recognition (OCR) for at least one item, comprising:

  • obtaining OCR results of OCR scanning from at least one OCR module;

    creating at least one OCR seed using at least a portion of the OCR results, the at least one OCR seed comprising a plurality of imagelets corresponding to each character identified in the at least a portion of the OCR results, wherein the at least one OCR seed is cleaned by selecting imagelets similar to one another for each character identified in the at least a portion of the OCR results;

    creating at least one OCR learn set using at least a portion of the OCR seed;

    comparing the at least one OCR learn set to each imagelet to create at least one mismatch distribution of the at least one OCR learn set compared to each imagelet, the at least one mismatch distribution comprising at least one confidence rating including a confidence score for the imagelet compared to at least one possible character; and

    applying the OCR learn set and the at least one mismatch distribution to the at least one item to obtain additional OCR results such that only possible characters having a confidence score higher than a threshold are considered when applying the at least one mismatch distribution to obtain the additional OCR results.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×