Identifying A Maximally Stable Extremal Region (MSER) In An Image By Skipping Comparison Of Pixels In The Region
First Claim
1. A method to identify regions, the method comprising:
- receiving an image of a scene of real world;
creating a plurality of sets of positions automatically, by at least performing comparisons using multiple pluralities of pixels hereinafter compared pixels that are located in the image at corresponding positions comprised in the plurality of sets of positions;
wherein a first set in the plurality of sets of positions is created without using in any comparison, a plurality of pixels hereinafter skipped pixels that are located in the image at additional positions comprised in the first set;
wherein a first region identified by the first set is contiguous in the image, the first region comprising the compared pixels and the skipped pixels identified respectively by the corresponding positions and the additional positions;
wherein a second region is contiguous in the image, the second region being identified by positions in a second set, in the plurality of sets of positions created by the creating;
checking automatically, whether a test is satisfied by a first attribute of the first region and a second attribute of the second region;
preparing automatically, a merged set comprising the positions in the first set and the positions in the second set, based on at least an outcome of said test; and
storing automatically, in one or more memories, the merged set;
wherein the receiving, the creating, the checking, the preparing and the storing are performed by one or more processors coupled to the one or more memories.
1 Assignment
0 Petitions
Accused Products
Abstract
A difference in intensities of a pair of pixels in an image is repeatedly compared to a threshold, with the pair of pixels being separated by at least one pixel (“skipped pixel”). When the threshold is found to be exceeded, a selected position of a selected pixel in the pair, and at least one additional position adjacent to the selected position are added to a set of positions. The comparing and adding are performed multiple times to generate multiple such sets, each set identifying a region in the image, e.g. an MSER. Sets of positions, identifying regions whose attributes satisfy a test, are merged to obtain a merged set. Intensities of pixels identified in the merged set are used to generate binary values for the region, followed by classification of the region as text/non-text. Regions classified as text are supplied to an optical character recognition (OCR) system.
-
Citations
26 Claims
-
1. A method to identify regions, the method comprising:
-
receiving an image of a scene of real world; creating a plurality of sets of positions automatically, by at least performing comparisons using multiple pluralities of pixels hereinafter compared pixels that are located in the image at corresponding positions comprised in the plurality of sets of positions; wherein a first set in the plurality of sets of positions is created without using in any comparison, a plurality of pixels hereinafter skipped pixels that are located in the image at additional positions comprised in the first set; wherein a first region identified by the first set is contiguous in the image, the first region comprising the compared pixels and the skipped pixels identified respectively by the corresponding positions and the additional positions; wherein a second region is contiguous in the image, the second region being identified by positions in a second set, in the plurality of sets of positions created by the creating; checking automatically, whether a test is satisfied by a first attribute of the first region and a second attribute of the second region; preparing automatically, a merged set comprising the positions in the first set and the positions in the second set, based on at least an outcome of said test; and storing automatically, in one or more memories, the merged set; wherein the receiving, the creating, the checking, the preparing and the storing are performed by one or more processors coupled to the one or more memories. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A handheld device to generate regions of binary values for use in text recognition, the handheld device comprising:
-
one or more memories comprising an image of a scene of real world; one or more processors operatively coupled to the one or more memories to execute therefrom a plurality of instructions, the plurality of instructions comprising; first instructions to create a plurality of sets of positions automatically, by at least performing comparisons using multiple pluralities of pixels hereinafter compared pixels that are located in the image at corresponding positions comprised in the plurality of sets of positions; wherein a first set in the plurality of sets of positions is created without using in any comparison, a plurality of pixels hereinafter skipped pixels that are located in the image at additional positions comprised in the first set; wherein a first region identified by the first set is contiguous in the image, the first region comprising the compared pixels and the skipped pixels identified respectively by the corresponding positions and the additional positions; wherein a second region is contiguous in the image, the second region being identified by positions in a second set, in the plurality of sets of positions created by the creating; second instructions to check, whether a test is satisfied by a first attribute of the first region and a second attribute of the second region; third instructions to prepare a merged set comprising the positions in the first set and the positions in the second set, based on at least an outcome of said test; and fourth instructions to store in at least one memory, the merged set. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. One or more storage media comprising a sequence of instructions, which, when executed cause a processor to perform operations, the sequence of instructions comprising:
-
first instructions to create a plurality of sets of positions automatically, by at least performing comparisons using multiple pluralities of pixels hereinafter compared pixels that are located in an image at corresponding positions comprised in the plurality of sets of positions; wherein a first set in the plurality of sets of positions is created without using in any comparison, a plurality of pixels hereinafter skipped pixels that are located in the image at additional positions comprised in the first set; wherein a first region identified by the first set is contiguous in the image, the first region comprising the compared pixels and the skipped pixels identified respectively by the corresponding positions and the additional positions; wherein a second region is contiguous in the image, the second region being identified by positions in a second set, in the plurality of sets of positions created by the creating; second instructions to check, whether a test is satisfied by a first attribute of the first region and a second attribute of the second region; third instructions to prepare a merged set comprising the positions in the first set and the positions in the second set, based on at least an outcome of said test; and fourth instructions to store in at least one memory, the merged set. - View Dependent Claims (18, 19, 20, 21)
-
-
22. An apparatus for generating binary values for use in text recognition, the apparatus comprising:
-
first means for creating a plurality of sets of positions automatically, by at least performing comparisons using multiple pluralities of pixels hereinafter compared pixels that are located in an image at corresponding positions comprised in the plurality of sets of positions; wherein a first set in the plurality of sets of positions is created without using in any comparison, a plurality of pixels hereinafter skipped pixels that are located in the image at additional positions comprised in the first set; wherein a first region identified by the first set is contiguous in the image, the first region comprising the compared pixels and the skipped pixels identified respectively by the corresponding positions and the additional positions; wherein a second region is contiguous in the image, the second region being identified by positions in a second set, in the plurality of sets of positions created by the creating; second means for checking, whether a test is satisfied by a first attribute of the first region and a second attribute of the second region; third means for preparing a merged set comprising the positions in the first set and the positions in the second set, based on at least an outcome of said test; and fourth means for storing in at least one memory, the merged set. - View Dependent Claims (23, 24, 25, 26)
-
Specification