Methods and systems for detecting numerals in a digital image
First Claim
Patent Images
1. A method for detecting a numeral connected component in a digital image, said method comprising:
- receiving a text-line component, wherein said text-line component comprises a plurality of connected components in a digital image;
calculating an aspect ratio for each of said connected components in said plurality of connected components, thereby producing a plurality of aspect ratios;
calculating a first characteristic of said plurality of aspect ratios;
determining a component bounding box for each of said plurality of connected components, wherein each component bounding box comprises a first-side coordinate, a second-side coordinate, a third-side coordinate and a fourth-side coordinate, wherein said first-side coordinate and said second-side coordinate are associated with a first axis of said bounding box and said third-side coordinate and said fourth-side coordinate are associated with a second axis of said bounding box;
determining a first variability measure associated with said first-side coordinates;
determining a second variability measure associated with said second-side coordinates;
determining a third variability measure associated with said third-side coordinates;
determining a fourth variability measure associated with said fourth-side coordinates;
determining a first accumulation of said first variability measure and said second variability measure;
determining a second accumulation of said third variability measure and said fourth variability measure;
when said first accumulation and said second accumulation meet a first accumulation criterion;
setting a first variability characteristic equal to said first variability measure; and
setting a second variability characteristic equal to said second variability measure;
when said first accumulation and said second accumulation do not meet said first accumulation criterion;
setting said first variability characteristic equal to said third variability measure; and
setting said second variability characteristic equal to said fourth variability measure;
classifying said text-line component as a numeral component when a first criterion comprising said first characteristic meeting a second criterion and said first variability characteristic meeting a third criterion and said second variability characteristic meeting a fourth criterion is met; and
classifying said text-line component as a non-numeral component when said first criterion is not met.
1 Assignment
0 Petitions
Accused Products
Abstract
Aspects of the present invention are related to systems and methods for determining the location of numerals in an electronic document image.
75 Citations
8 Claims
-
1. A method for detecting a numeral connected component in a digital image, said method comprising:
-
receiving a text-line component, wherein said text-line component comprises a plurality of connected components in a digital image; calculating an aspect ratio for each of said connected components in said plurality of connected components, thereby producing a plurality of aspect ratios; calculating a first characteristic of said plurality of aspect ratios; determining a component bounding box for each of said plurality of connected components, wherein each component bounding box comprises a first-side coordinate, a second-side coordinate, a third-side coordinate and a fourth-side coordinate, wherein said first-side coordinate and said second-side coordinate are associated with a first axis of said bounding box and said third-side coordinate and said fourth-side coordinate are associated with a second axis of said bounding box; determining a first variability measure associated with said first-side coordinates; determining a second variability measure associated with said second-side coordinates; determining a third variability measure associated with said third-side coordinates; determining a fourth variability measure associated with said fourth-side coordinates; determining a first accumulation of said first variability measure and said second variability measure; determining a second accumulation of said third variability measure and said fourth variability measure; when said first accumulation and said second accumulation meet a first accumulation criterion; setting a first variability characteristic equal to said first variability measure; and setting a second variability characteristic equal to said second variability measure; when said first accumulation and said second accumulation do not meet said first accumulation criterion; setting said first variability characteristic equal to said third variability measure; and setting said second variability characteristic equal to said fourth variability measure; classifying said text-line component as a numeral component when a first criterion comprising said first characteristic meeting a second criterion and said first variability characteristic meeting a third criterion and said second variability characteristic meeting a fourth criterion is met; and classifying said text-line component as a non-numeral component when said first criterion is not met. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification