Text Detection Using Multi-Layer Connected Components With Histograms
First Claim
1. An apparatus comprising:
- at least one processor; and
at least one memory including computer program code,in which the at least one memory and the computer program code are configured, with the at least one processor, to cause the apparatus at least to;
convert a digital image to a multiple level image;
form multiple scale sets from connected components of the multiple level image, in which different ones of the scale sets define different size spatial bins;
for each of the multiple scale sets;
generate a count of connected components extracted from the respective scale set for each spatial bin; and
link adjacent spatial bins which represent connected components;
merge the connected components from the different scale sets; and
perform text line detection on the merged connected components.
2 Assignments
0 Petitions
Accused Products
Abstract
A digital image is converted to a multiple level image, and multiple scale sets are formed from connected components of the multiple level image such that different ones of the scale sets define different size spatial bins. For each of the multiple scale sets there is generated a count of connected components extracted from the respective scale set for each spatial bin; and adjacent spatial bins which represent connected components are linked. Then the connected components from the different scale sets are merged and text line detection is performed on the merged connected components. In one embodiment each of the scale sets is a histogram, and prior to linking all bins with less than a predetermined count are filtered out; and each histogram is extended such that counts of adjacent horizontal and vertical bins are added (single region bins are filtered out) and the linking is on the extended histograms.
21 Citations
20 Claims
-
1. An apparatus comprising:
-
at least one processor; and at least one memory including computer program code, in which the at least one memory and the computer program code are configured, with the at least one processor, to cause the apparatus at least to; convert a digital image to a multiple level image; form multiple scale sets from connected components of the multiple level image, in which different ones of the scale sets define different size spatial bins; for each of the multiple scale sets; generate a count of connected components extracted from the respective scale set for each spatial bin; and link adjacent spatial bins which represent connected components; merge the connected components from the different scale sets; and perform text line detection on the merged connected components. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
converting a digital image to a multiple level image; form multiple scale sets from connected components of the multiple level image by at least one processor, in which different ones of the scale sets define different size spatial bins; for each of the multiple scale sets; generating a count of connected components extracted from the respective scale set for each spatial bin; and linking adjacent spatial bins which represent connected components; merging the connected components from the different scale sets; and performing text line detection on the merged connected components. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer readable memory storing a program of instructions comprising:
-
code for converting a digital image to a multiple level image; code for forming multiple scale sets from connected components of the multiple level image, in which different ones of the scale sets define different size spatial bins; for each of the multiple scale sets; code for generating a count of connected components extracted from the respective scale set for each spatial bin; and code for linking adjacent spatial bins which represent connected components; code for merging the connected components from the different scale sets; and code for performing text line detection on the merged connected components. - View Dependent Claims (18, 19, 20)
-
Specification