Finding text in natural scenes
First Claim
1. A computer-implemented method for automatically detecting text in electronic images of natural scenes, comprising:
- receiving an electronic image for analysis;
performing an edge-detection algorithm on the electronic image;
identifying closed contours in the electronic image as a function of detected edges;
establishing links between closed components;
identifying candidate text lines as a function of the identified closed contours;
classifying candidate text lines as being text regions or non-text regions; and
outputting, via a graphical user interface (GUI), the text regions in the electronic image to a user;
wherein identifying candidate text lines further comprises;
selecting a link for consideration;
fitting a line that connects respective centers of first and second closed contours connected by the link;
for each of the first and second closed contours, identifying all associated links other than the selected link, wherein a third closed contour attached to one of the associated links is selected;
re-fitting the fitted line by including newly added third closed contour, wherein the refitted line connects the centers of the first, second, and third closed contours; and
iterating the preceding steps until all closed contours having a center with a distance less than the predetermined threshold Tf have been added to the candidate text line.
5 Assignments
0 Petitions
Accused Products
Abstract
As set forth herein, systems and methods facilitate providing an efficient edge-detection and closed-contour based approach for finding text in natural scenes such as photographic images, digital, and/or electronic images, and the like. Edge information (e.g., edges of structures or objects in the images) is obtained via an edge detection technique. Edges from text characters form closed contours even in the presence of reasonable levels of noise. Closed contour linking and candidate text line formation are two additional features of the described approach. A candidate text line classifier is applied to further screen out false-positive text identifications. Candidate text regions for placement of text in the natural scene of the electronic image are highlighted and presented to a user.
-
Citations
19 Claims
-
1. A computer-implemented method for automatically detecting text in electronic images of natural scenes, comprising:
-
receiving an electronic image for analysis; performing an edge-detection algorithm on the electronic image; identifying closed contours in the electronic image as a function of detected edges; establishing links between closed components; identifying candidate text lines as a function of the identified closed contours; classifying candidate text lines as being text regions or non-text regions; and outputting, via a graphical user interface (GUI), the text regions in the electronic image to a user; wherein identifying candidate text lines further comprises; selecting a link for consideration; fitting a line that connects respective centers of first and second closed contours connected by the link; for each of the first and second closed contours, identifying all associated links other than the selected link, wherein a third closed contour attached to one of the associated links is selected; re-fitting the fitted line by including newly added third closed contour, wherein the refitted line connects the centers of the first, second, and third closed contours; and iterating the preceding steps until all closed contours having a center with a distance less than the predetermined threshold Tf have been added to the candidate text line. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computerized system that facilitates automatically detecting text in electronic images of natural scenes, comprising:
-
a memory that stores computer-executable instructions; and a processor configured to execute the instructions, the instructions comprising; receiving an electronic image for analysis; performing an edge-detection algorithm on the electronic image; identifying closed contours in the electronic image as a function of detected edges; establishing links between closed components; identifying candidate text lines as a function of the identified closed contours; classifying candidate text lines as being text regions or non-text regions; and a graphical user interface (GUI) via which the text regions in the electronic image are displayed to a user; wherein the instructions for identifying candidate text lines further comprise instructions for; selecting a link for consideration; fitting a line that connects respective centers of first and second closed contours connected by the link; for each of the first and second closed contours, identifying all associated links other than the selected link, wherein a third closed contour attached to one of the associated links is selected; re-fitting the fitted line by including newly added third closed contour, wherein the refitted line connects the centers of the first, second, and third closed contours; and iterating the preceding steps until all closed contours having a center with a distance less than the predetermined threshold Tf have been added to the candidate text line. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification