Identifying regions of text to merge in a natural image or video frame
First Claim
1. A method of identifying regions to merge, in an image of a scene of real world captured by a camera in a handheld device, the method of identifying regions comprising:
- checking whether a first block, which contains a first region of pixels that are contiguous with one another and comprising a local extrema of intensity in the image, satisfies a predetermined test, for presence along a line, of pixels with intensities binarizable to a common value;
marking the first block as pixel-line-present, in a memory, when a result of the checking indicates the predetermined test is satisfied;
identifying a second block that is located in the image adjacent to the first block, wherein at least the first block is marked as pixel-line-present;
merging a first set of positions indicative of the first region of pixels in the first block with a second set of positions indicative of a second region of pixels in the second block to obtain a merged set of positions in a merged block, when a predetermined rule is satisfied by one or more geometric attributes of the first block and the second block;
wherein the first region of pixels and the second region of pixels do not contact one another in the merged block;
wherein the merging is performed prior to classification of any pixel in the first region of pixels and in the second region of pixels as text or non-text; and
re-doing the checking, on the merged block, to determine whether the merged block satisfies the predetermined test;
wherein one or more of the checking, the marking, the identifying, the merging, and the re-doing are performed by at least one processor coupled to the memory.
1 Assignment
0 Petitions
Accused Products
Abstract
In several aspects of described embodiments, an electronic device and method use a camera to capture an image or a frame of video of an environment outside the electronic device followed by identification of blocks of regions in the image. Each block that contains a region is checked, as to whether a test for presence of a line of pixels is met. When the test is met for a block, that block is identified as pixel-line-present. Pixel-line-present blocks are used to identify blocks that are adjacent. One or more adjacent block(s) may be merged with a pixel-line-present block when one or more rules are found to be satisfied, resulting in a merged block. The merged block is then subject to the above-described test, to verify presence of a line of pixels therein, and when the test is satisfied the merged block is processed normally, e.g. classified as text or non-text.
-
Citations
29 Claims
-
1. A method of identifying regions to merge, in an image of a scene of real world captured by a camera in a handheld device, the method of identifying regions comprising:
-
checking whether a first block, which contains a first region of pixels that are contiguous with one another and comprising a local extrema of intensity in the image, satisfies a predetermined test, for presence along a line, of pixels with intensities binarizable to a common value; marking the first block as pixel-line-present, in a memory, when a result of the checking indicates the predetermined test is satisfied; identifying a second block that is located in the image adjacent to the first block, wherein at least the first block is marked as pixel-line-present; merging a first set of positions indicative of the first region of pixels in the first block with a second set of positions indicative of a second region of pixels in the second block to obtain a merged set of positions in a merged block, when a predetermined rule is satisfied by one or more geometric attributes of the first block and the second block; wherein the first region of pixels and the second region of pixels do not contact one another in the merged block; wherein the merging is performed prior to classification of any pixel in the first region of pixels and in the second region of pixels as text or non-text; and re-doing the checking, on the merged block, to determine whether the merged block satisfies the predetermined test; wherein one or more of the checking, the marking, the identifying, the merging, and the re-doing are performed by at least one processor coupled to the memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A mobile device comprising:
-
a camera; a memory operatively connected to the camera to receive at least an image therefrom; a processor operatively connected to the memory to execute a plurality of computer instructions stored in the memory, to supply information related to a merged block, the merged block being obtained by merging a first block with a second block; wherein the processor is configured to check whether the first block, which contains a first region of pixels that are contiguous with one another and comprising a local extrema of intensity in the image, satisfies a predetermined test, for presence along a line, of pixels with intensities binarizable to a common value; wherein the processor is configured to mark the first block as pixel-line-present, in the memory, when a result of the check indicates the predetermined test is satisfied; wherein the processor is configured to identify the second block for being located in the image adjacent to the first block, when at least the first block is marked as pixel-line-present; wherein the processor is configured to merge a first set of positions indicative of the first region of pixels in the first block with a second set of positions indicative of a second region of pixels in the second block to obtain a merged set of positions in the merged block, when a predetermined rule is satisfied by one or more geometric attributes of the first block and the second block; wherein the first region of pixels and the second region of pixels do not contact one another in the merged block; wherein the processor is configured to obtain the merged block prior to classification of any pixel in the first region of pixels and in the second region of pixels as text or non-text; and wherein the processor is configured to re-do the check with the merged block, to determine whether the merged block satisfies the predetermined test. - View Dependent Claims (17, 18, 19, 20, 21)
-
-
22. One or more non-transitory computer readable storage media comprising computer instructions, which when executed in a handheld device, cause one or more processors in the handheld device to perform operations, the computer instructions comprising:
-
first instructions to check whether a first block, which contains a first region of pixels that are contiguous with one another and comprising a local extrema of intensity in an image captured by a camera, satisfies a predetermined test, for presence along a line, of pixels with intensities binarizable to a common value; second instructions to mark the first block as pixel-line-present, in a memory, when a result of the checking indicates the predetermined test is satisfied; third instructions to identify a second block that is located in the image adjacent to the first block, wherein at least the first block is marked as pixel-line-present; fourth instructions to merge a first set of positions indicative of the first region of pixels in the first block with a second set of positions indicative of a second region of pixels in the second block to obtain a merged set of positions in a merged block, when a predetermined rule is satisfied by one or more geometric attributes of the first block and the second block; wherein the first region of pixels and the second region of pixels do not contact one another in the merged block; wherein the merging is performed prior to classification of any pixel in the first region of pixels and in the second region of pixels as text or non-text; and fifth instructions to re-do the check with the merged block, to determine whether the merged block satisfies the predetermined test; wherein one or more of the first instructions, the second instructions, the third instructions, the fourth instructions and the fifth instructions are to be executed by at least one processor coupled to the memory. - View Dependent Claims (23, 24, 25, 26, 27)
-
-
28. An apparatus for identifying regions of text, the apparatus comprising:
-
a memory storing an image of an environment outside the apparatus; means for checking whether a first block, which contains a first region of pixels that are contiguous with one another and comprising a local extrema of intensity in the image, satisfies a predetermined test, for presence along a line, of pixels with intensities binarizable to a common value; means for marking the first block as pixel-line-present, in the memory, when a result of the checking indicates the predetermined test is satisfied; means for identifying a second block that is located in the image adjacent to the first block, wherein at least the first block is marked as pixel-line-present; means for merging a first set of positions indicative of the first region of pixels in the first block with a second set of positions indicative of a second region of pixels in the second block to obtain a merged set of positions in a merged block, when a predetermined rule is satisfied by one or more geometric attributes of the first block and the second block; wherein the first region of pixels and the second region of pixels do not contact one another in the merged block; wherein the merging is performed prior to classification of any pixel in the first region of pixels and in the second region of pixels as text or non-text; and means for re-doing the checking with the merged block, to determine whether the merged block satisfies the predetermined test; wherein one or more of the means for checking, the means for marking, the means for identifying, the means for merging, and the means for re-doing comprise at least one processor coupled to the memory. - View Dependent Claims (29)
-
Specification