TEXT REGION DETECTION SYSTEM AND METHOD
First Claim
1. A method for detecting a text region in an image, the method comprising:
- detecting a candidate text region from an input image;
generating a set of oriented gradient images of the candidate text region;
capturing one or more detection window images of the candidate text region;
calculating a sum of oriented gradients for a region in one of the oriented gradient images;
classifying whether each detection window image contains text by comparing the associated sum of oriented gradients and a threshold; and
determining whether the candidate text region is a true text region based on the classifications of the detection window images.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for detecting a text region in an image is disclosed. The method includes detecting a candidate text region from an input image. A set of oriented gradient images is generated from the candidate text region, and one or more detection window images of the candidate text region are captured. A sum of oriented gradients is then calculated for a region in one of the oriented gradient images. It is classified whether each detection window image contains text by comparing the associated sum of oriented gradients and a threshold. Based on the classifications of the detection window images, it is determined whether the candidate text region is a true text region.
28 Citations
78 Claims
-
1. A method for detecting a text region in an image, the method comprising:
-
detecting a candidate text region from an input image; generating a set of oriented gradient images of the candidate text region; capturing one or more detection window images of the candidate text region; calculating a sum of oriented gradients for a region in one of the oriented gradient images; classifying whether each detection window image contains text by comparing the associated sum of oriented gradients and a threshold; and determining whether the candidate text region is a true text region based on the classifications of the detection window images. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for detecting a text region in an image, the method comprising:
-
i) detecting a candidate text region from an input image; ii) generating a set of oriented gradient images of the candidate text region; iii) capturing one or more detection window images from the candidate text region; iv) calculating a sum of oriented gradients from a region in an oriented gradient image obtained from each detection window image; v) classifying whether each detection window image contains text by comparing the sum of oriented gradients and a threshold; vi) determining whether each detection window image contains text based on each classification; and vii) if each detection window image is determined to contain text, repeating iv) to vi) to determine whether the candidate text region is a true text region. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for generating one or more classifiers for classifying an image into a text region, the method comprising:
-
providing a plurality of oriented gradient images generated from learning sample images, one or more oriented gradient images being generated from each learning sample image including a tag indicating whether the learning sample image is a text image or non-text image; generating a plurality of candidate weak classifiers, each candidate weak classifier including parameters defining a region and a direction of an oriented gradient image and being configured to classify each learning sample image based on a sum of oriented gradients calculated for the defined region and direction; verifying whether each candidate weak classifier correctly classifies each learning sample image based on the tag; determining an error rate for each candidate weak classifier; and selecting one or more weak classifiers from the candidate weak classifiers based on the error rate of each candidate weak classifier. - View Dependent Claims (24, 25, 26, 27)
-
-
28. An apparatus for detecting a text region in an image, the apparatus comprising:
-
a candidate text region detector configured to receive an input image and detect a candidate text region from the input image; a gradient image generator configured to receive the candidate text region and generate a set of oriented gradient images of the candidate text region; a detection window locator configured to capture one or more detection window images of the candidate text region; a classifier configured to classify whether each detection window image contains text based on a plurality of sums of oriented gradients, each of the sums of oriented gradients being calculated for a region in one of the oriented gradient images; and a first determination unit configured to determine whether the candidate text region is a true text region based on the classifications of the detection window images. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. An apparatus for detecting a text region in an image, the apparatus comprising:
-
a candidate text region detector configured to receive an input image and detect a candidate text region from the input image; a gradient image generator configured to receive the candidate text region and generate a set of oriented gradient images of the candidate text region; a detection window locator configured to capture one or more detection window images of the candidate text region; a plurality of classifiers arranged in series, each classifier being configured to classify whether each detection window image contains text based on a plurality of sums of oriented gradients when the classification from the previous classifier in the series indicates that the detection window image contains text, each of the sums of oriented gradients being calculated for a region in one of the oriented gradient images; and a first determination unit configured to determine whether the candidate text region is a true text region based on the classifications of the detection window images. - View Dependent Claims (42, 43, 44, 45, 46, 47)
-
-
48. A non-transitory computer-readable storage medium comprising instructions for detecting a text region in an image, the instructions causing a processor to perform the operations of:
-
detecting a candidate text region from an input image; generating a set of oriented gradient images of the candidate text region; capturing one or more detection window images of the candidate text region; calculating a sum of oriented gradients for a region in one of the oriented gradient images; classifying whether each detection window image contains text by comparing the associated sum of oriented gradients and a threshold; and determining whether the candidate text region is a true text region based on the classifications of the detection window images. - View Dependent Claims (49, 50, 51, 52, 53, 54, 55, 56, 57, 58)
-
-
59. A non-transitory computer-readable storage medium comprising instructions for detecting a text region in an image, the instructions causing a processor to perform the operations of:
-
i) detecting a candidate text region from an input image; ii) generating a set of oriented gradient images of the candidate text region; iii) capturing one or more detection window images from the candidate text region; iv) calculating a sum of oriented gradients from a region in an oriented gradient image obtained from each detection window image; v) classifying whether each detection window image contains text by comparing the sum of oriented gradients and a threshold; vi) determining whether each detection window image contains text based on each classification; and vii) if each detection window image is determined to contain text, repeating iv) to vi) to determine whether the candidate text region is a true text region.
-
-
60. A non-transitory computer-readable storage medium comprising instructions for generating one or more classifiers for classifying an image into a text region, the instructions causing a processor to perform the operations of:
-
providing a plurality of oriented gradient images generated from learning sample images, one or more oriented gradient images being generated from each learning sample image including a tag indicating whether the learning sample image is a text image or non-text image; generating a plurality of candidate weak classifiers, each candidate weak classifier including parameters defining a region and a direction of an oriented gradient image and being configured to classify each learning sample image based on a sum of oriented gradients calculated for the defined region and direction; verifying whether each candidate weak classifier correctly classifies each learning sample image based on the tag; determining an error rate for each candidate weak classifier; and selecting one or more weak classifiers from the candidate weak classifiers based on the error rate of each candidate weak classifier. - View Dependent Claims (61, 62, 63, 64)
-
-
65. An apparatus for detecting a text region in an image, the apparatus comprising:
-
means for receiving an input image and detecting a candidate text region from the input image; means for receiving the candidate text region and generating a set of oriented gradient images of the candidate text region; means for capturing one or more detection window images of the candidate text region; means for classifying whether each detection window image contains text based on a plurality of sums of oriented gradients, each of the sums of oriented gradients being calculated for a region in one of the oriented gradient images; and means for determining whether the candidate text region is a true text region based on the classifications of the detection window images. - View Dependent Claims (66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77)
-
-
78. An apparatus for detecting a text region in an image, the apparatus comprising:
-
means for receiving an input image and detecting a candidate text region from the input image; means for receiving the candidate text region and generating a set of oriented gradient images of the candidate text region; means for capturing one or more detection window images of the candidate text region; a plurality of means for classifying whether each detection window image contains text based on a plurality of sums of oriented gradients when the classification from the previous classifier in the series indicates that the detection window image contains text, each of the sums of oriented gradients being calculated for a region in one of the oriented gradient images, wherein the plurality of means for classifying is arranged in series; and means for determining whether the candidate text region is a true text region based on the classifications of the detection window images.
-
Specification