×

LOWER MODIFIER DETECTION AND EXTRACTION FROM DEVANAGARI TEXT IMAGES TO IMPROVE OCR PERFORMANCE

  • US 20130195360A1
  • Filed: 03/08/2013
  • Published: 08/01/2013
  • Est. Priority Date: 01/26/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method to extract lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the method comprising:

  • obtaining the word image, wherein the word image defines a height of the word image and a width of the word image;

    performing the first test to determine whether a vertical line spanning the height of the word image is present;

    performing the second test to determine whether a jump of a number of components exist in a lower portion of the word image;

    performing the third test to determine sparseness in the lower portion of the word image; and

    comparing test results from the first test, the second test and the third test to decide whether a lower modifier exists.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×