×

Lower modifier detection and extraction from devanagari text images to improve OCR performance

  • US 9,064,191 B2
  • Filed: 03/08/2013
  • Issued: 06/23/2015
  • Est. Priority Date: 01/26/2012
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method to extract lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the method comprising:

  • obtaining the word image, wherein the word image defines a height of the word image and a width of the word image;

    performing the first test to determine whether a vertical line spanning the height of the word image is present;

    performing the second test to determine whether a jump of a number of components exist in a lower portion of the word image;

    performing the third test to determine sparseness in the lower portion of the word image; and

    comparing test results from the first test, the second test and the third test to decide whether a lower modifier exists.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×