GENERALIZED TEXT LOCALIZATION IN IMAGES
First Claim
1. A method of locating text in digital images, comprising:
- scaling a digital image into images of multiple resolutions;
classifying whether pixels in the multiple resolutions are part of a text region;
integrating scales to create a scale integration saliency map;
using the saliency map to create initial text bounding boxes through expanding the boxes from rectangles of pixels including at least one pixel to include groups of at least one pixel adjacent to the rectangles, wherein the groups have a particular relationship to a first threshold; and
consolidating the initial text bounding boxes.
1 Assignment
0 Petitions
Accused Products
Abstract
In some embodiments, the invention includes a method for locating text in digital images. The method includes scaling a digital image into images of multiple resolutions and classifying whether pixels in the multiple resolutions are part of a text region. The method also includes integrating scales to create a scale integration saliency map and using the saliency map to create initial text bounding boxes through expanding the boxes from rectangles of pixels including at least one pixel to include groups of at least one pixel adjacent to the rectangles, wherein the groups have a particular relationship to a first threshold. The initial text bounding boxes are consolidated. In other embodiments, a method includes classifying whether pixels are part of a text region, creating initial text bounding boxes, and consolidating the initial text bounding boxes, wherein the consolidating includes creating horizontal projection profiles having adaptive thresholds and vertical projection profiles having adaptive thresholds.
72 Citations
32 Claims
-
1. A method of locating text in digital images, comprising:
-
scaling a digital image into images of multiple resolutions;
classifying whether pixels in the multiple resolutions are part of a text region;
integrating scales to create a scale integration saliency map;
using the saliency map to create initial text bounding boxes through expanding the boxes from rectangles of pixels including at least one pixel to include groups of at least one pixel adjacent to the rectangles, wherein the groups have a particular relationship to a first threshold; and
consolidating the initial text bounding boxes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 25, 26, 27, 29, 30, 31, 32)
-
-
12. An apparatus comprising:
a machine readable medium having instructions thereon which when executed cause a processor to perform a method including;
scaling a digital image into images of multiple resolutions;
classifying whether pixels in the multiple resolutions are part of a text region;
integrating scales to create a scale integration saliency map;
using the saliency map to create initial text bounding boxes through expanding the boxes from rectangles of pixels including at least one pixel to include groups of at least one pixel adjacent to the rectangles, wherein the groups have a particular relationship to a first threshold; and
consolidating the initial text bounding boxes.
-
23. A method, comprising:
-
classifying whether pixels are part of a text region;
creating initial text bounding boxes; and
consolidating the initial text bounding boxes, wherein the consolidating includes creating horizontal projection profiles having adaptive thresholds and vertical projection profiles having adaptive thresholds.
-
-
28. An apparatus comprising:
a machine readable medium having instructions thereon which when executed cause a processor to perform a method including;
classifying whether pixels are part of a text region;
creating initial text bounding boxes; and
consolidating the initial text bounding boxes, wherein the consolidating includes creating horizontal projection profiles having adaptive thresholds and vertical projection profiles having adaptive thresholds.
Specification