LOWER MODIFIER DETECTION AND EXTRACTION FROM DEVANAGARI TEXT IMAGES TO IMPROVE OCR PERFORMANCE
First Claim
1. A method to extract lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the method comprising:
- obtaining the word image, wherein the word image defines a height of the word image and a width of the word image;
performing the first test to determine whether a vertical line spanning the height of the word image is present;
performing the second test to determine whether a jump of a number of components exist in a lower portion of the word image;
performing the third test to determine sparseness in the lower portion of the word image; and
comparing test results from the first test, the second test and the third test to decide whether a lower modifier exists.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, apparatus and methods for extracting lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test are presented. The method obtains the word image and performing a plurality of tests (e.g., a first test, a second test and a third test). The first test determines whether a vertical line spanning the height of the word image exists. The second test determines whether a jump of a number of components in the lower portion of the word image exists. The third test determines sparseness in a lower portion of the word image. The plurality of tests may run sequentially and/or in parallel. Results from the plurality of tests are used to decide whether a lower modifier exists by comparing and accumulating test results from the plurality of tests.
-
Citations
22 Claims
-
1. A method to extract lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the method comprising:
-
obtaining the word image, wherein the word image defines a height of the word image and a width of the word image; performing the first test to determine whether a vertical line spanning the height of the word image is present; performing the second test to determine whether a jump of a number of components exist in a lower portion of the word image; performing the third test to determine sparseness in the lower portion of the word image; and comparing test results from the first test, the second test and the third test to decide whether a lower modifier exists. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A mobile device for extracting lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the mobile device comprising:
-
a camera; a display; and a processor coupled to the camera and the display, wherein the processor comprises instructions to; obtain the word image, wherein the word image defines a height of the word image and a width of the word image; perform the first test to determine whether a vertical line spanning the height of the word image is present; perform the second test to determine whether a jump of a number of components exist in a lower portion of the word image; perform the third test to determine sparseness in the lower portion of the word image; and compare test results from the first test, the second test and the third test to decide whether a lower modifier exists. - View Dependent Claims (12, 13, 14)
-
-
15. A mobile device for extracting lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the mobile device comprising:
-
means for obtaining the word image, wherein the word image defines a height of the word image and a width of the word image; means for performing the first test to determine whether a vertical line spanning the height of the word image is present; means for performing the second test to determine whether a jump of a number of components exist in a lower portion of the word image; means for performing the third test to determine sparseness in the lower portion of the word image; and means for comparing test results from the first test, the second test and the third test to decide whether a lower modifier exists. - View Dependent Claims (16, 17, 18)
-
-
19. A mobile device for extracting lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the mobile device comprising a processor and a memory wherein the memory includes software instructions to:
-
obtain the word image, wherein the word image defines a height of the word image and a width of the word image; perform the first test to determine whether a vertical line spanning the height of the word image is present; perform the second test to determine whether a jump of a number of components exist in a lower portion of the word image; perform the third test to determine sparseness in the lower portion of the word image; and compare test results from the first test, the second test and the third test to decide whether a lower modifier exists. - View Dependent Claims (20, 21)
-
-
22. A non-volatile computer-readable storage medium including program code stored thereon, comprising program code to extract lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test, the program code to:
-
obtain the word image, wherein the word image defines a height of the word image and a width of the word image; perform the first test to determine whether a vertical line spanning the height of the word image is present; perform the second test to determine whether a jump of a number of components exist in a lower portion of the word image; perform the third test to determine sparseness in the lower portion of the word image; and compare test results from the first test, the second test and the third test to decide whether a lower modifier exists.
-
Specification