Positionally encoded document image analysis and labeling
First Claim
1. A method of labeling a document image containing positionally encoded maze patterns for computationally efficient decoding, the method comprising:
- obtaining the document image;
analyzing the document image to determine a number of position encoding bits that can be extracted from the document image, the analyzing including;
dividing the document image into blocks having substantially a same size as maze pattern cells;
determining whether the blocks are occluded by document content;
counting, for each pixel in the document image, a number of completely visible blocks in a neighboring window with the pixel being as a center of the window; and
labeling the pixel based on the number;
performing a thresholding algorithm on the document image to determine if the document image is of a type selected from at least;
a first type containing sufficient amount of visible positionally encoded maze patterns for a computationally efficient algorithm to decode the document image, anda second type containing document content that occludes at least a portion of the positionally encoded maze patterns, wherein the occlusion results in insufficient amount of maze patterns being visible for the computationally efficient algorithm to decode the document image;
labeling the document image with the type based on the number of position encoding bits; and
performing a search algorithm on the document image when an insufficient amount of maze patterns are visible.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed embodiments of the invention relate to analyzing document images, which contain positionally encoded information, such as a maze-pattern watermark, and labeling the images based on a degree to which the document'"'"'s content, such as text, occludes the position-encoding information. Depending on the degree of such occlusion, it may not be possible to extract enough position-encoding bits from a camera-captured image of the document to determine the camera-captured image'"'"'s location within the document. An analysis-and-labeling module receives, as input, image data output by an image-generation-and-capturing module and off-line training data; performs analysis-and-labeling processing; and outputs image-label information. The results of document-analysis-and-labeling processing may be used for efficiently determining a location of a camera-captured image within a positionally encoded document.
-
Citations
39 Claims
-
1. A method of labeling a document image containing positionally encoded maze patterns for computationally efficient decoding, the method comprising:
-
obtaining the document image; analyzing the document image to determine a number of position encoding bits that can be extracted from the document image, the analyzing including; dividing the document image into blocks having substantially a same size as maze pattern cells; determining whether the blocks are occluded by document content; counting, for each pixel in the document image, a number of completely visible blocks in a neighboring window with the pixel being as a center of the window; and labeling the pixel based on the number; performing a thresholding algorithm on the document image to determine if the document image is of a type selected from at least; a first type containing sufficient amount of visible positionally encoded maze patterns for a computationally efficient algorithm to decode the document image, and a second type containing document content that occludes at least a portion of the positionally encoded maze patterns, wherein the occlusion results in insufficient amount of maze patterns being visible for the computationally efficient algorithm to decode the document image; labeling the document image with the type based on the number of position encoding bits; and performing a search algorithm on the document image when an insufficient amount of maze patterns are visible. - View Dependent Claims (2, 3, 4)
-
-
5. A method of labeling a camera-captured image containing positionally encoded maze patterns for computationally efficient decoding, the method comprising:
-
obtaining the camera-captured image; analyzing the camera-captured image to determine a number of position encoding bits that can be extracted from the camera-captured image, the analyzing including; dividing the camera-captured image into blocks having substantially the same size as maze pattern cells; determining whether the blocks are occluded by document content; counting, for each pixel in the camera-captured image, a number of completely visible blocks in a neighboring window with the pixel being as a center of the window; and labeling the pixel based on the number; performing a thresholding algorithm on the camera-captured image to determine if the camera captured image is of a type selected from at least; a first type containing positionally encoded maze patterns for a computationally efficient algorithm to decode the camera-captured image, and a second type containing document content that occludes at least a portion of the positionally encoded maze patterns, wherein the occlusion results in insufficient amount of maze patterns being visible for the computationally efficient algorithm to decode the camera-captured image; labeling the camera-captured image with the type based on the number of position encoding bits; and performing a search algorithm on the camera-captured image when an insufficient amount of maze patterns are visible. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system, implemented at least in part by a computing device, that labels a document image containing positionally encoded maze patterns for computationally efficient decoding, the system comprising:
-
an image-generation-and-capturing module including an image capturing pen that obtains the document image; and an analysis-and-labeling module that analyzes the document image to determine a number of position encoding bits that can be extracted from the document image, the analyzing including; dividing the document image into blocks having substantially the same size as maze pattern cells; determining whether the blocks are occluded by document content; counting, for each pixel in the document image, a number of completely visible blocks in a neighboring window with the pixel being as a center of the window; and labeling the pixel based on the number; wherein the analysis-and-labeling module further performs a thresholding algorithm and labels the document image as being of a type selected from at least; a first type containing sufficient amount of visible positionally encoded maze patterns for a computationally efficient algorithm to decode the document image, and a second type containing document content that occludes at least a portion of the positionally encoded maze patterns, wherein the occlusion results in insufficient amount of maze patterns being visible for the computationally efficient algorithm to decode the document image, wherein the analysis-and-labeling module further performs a search algorithm on the document image when an insufficient amount of maze patterns are visible. - View Dependent Claims (15, 16, 17)
-
-
18. A system, implemented at least in part by a computing device, that labels a camera-captured image containing positionally encoded maze patterns for computationally efficient decoding, the system comprising:
-
an image-generation-and-capturing module including an image capturing pen that obtains the camera-captured image; and an analysis-and-labeling module that that analyzes the camera-captured image to determine a number of position encoding bits that can be extracted from the camera-captured image, the analyzing including; dividing the camera-captured image into blocks having substantially the same size as maze pattern cells; determining whether the blocks are occluded by document content; counting, for each pixel in the camera-captured image, a number of completely visible blocks in a neighboring window with the pixel being as a center of the window; and labeling the pixel based on the number; wherein the analysis-and-labeling module further performs a thresholding algorithm on the camera-captured image and labels the camera-captured image as being of a type selected from at least; a first type containing sufficient amount of visible positionally encoded maze patterns for a computationally efficient algorithm to decode the camera-captured image, and a second type containing document content that occludes at least a portion of the positionally encoded maze patterns, wherein the occlusion results in insufficient amount of maze patterns being visible for the computationally efficient algorithm to decode the camera-captured image, wherein the analysis-and-labeling module further performs a search algorithm on the camera-captured image when an insufficient amount of maze patterns are visible. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A computer-readable medium containing computer-readable instructions for labeling a document image containing positionally encoded maze patterns for computationally efficient decoding, by performing steps comprising:
-
obtaining the document image; analyzing the document image to determine a number of position encoding bits that can be extracted from the document image, the analyzing including; dividing the document image into blocks having substantially the same size as maze pattern cells; determining whether the blocks are occluded by document content; counting, for each pixel in the document imager a number of completely visible blocks in a neighboring window with the pixel being as a center of the window; and labeling the pixel based on the number; performing a thresholding algorithm on the document image to determine if the image is of a type selected from at least; a first type containing sufficient amount of visible positionally encoded maze patterns for a computationally efficient algorithm to decode the document image, and a second type containing document content that occludes at least a portion of the positionally encoded maze patterns, wherein the occlusion results in insufficient amount of maze patterns being visible for the computationally efficient algorithm to decode the document image; labeling the document image with the type based on the number of position encoding bits; and performing a search algorithm on the document image when an insufficient amount of maze patterns are visible. - View Dependent Claims (28, 29, 30)
-
-
31. A computer-readable medium containing computer-readable instructions for labeling a camera-captured image containing positionally encoded maze patterns for computationally efficient decoding, by performing steps comprising:
-
obtaining the camera-captured image; analyzing the camera-captured image to determine a number of position encoding bits that can be extracted from the camera-captured image, the analyzing including; dividing the camera-captured image into blocks having substantially the same size as maze pattern cells; determining whether the blocks are occluded by document content; counting, for each pixel in the camera-captured image, a number of completely visible blocks in a neighboring window with the pixel being as a center of the window; and labeling the pixel based on the number; performing a thresholding algorithm on the camera-captured image to determine if the camera-captured image is of a type selected from at least; a first type containing sufficient amount of visible positionally encoded maze patterns for a computationally efficient algorithm to decode the camera-captured image and a second type containing document content that occludes at least a portion of the positionally encoded maze patterns, wherein the occlusion results in insufficient amount of maze patterns being visible for the computationally efficient algorithm to decode the camera-captured image; labeling the camera-captured image with the type based on the number of position encoding bits; and performing a search algorithm on the camera-captured image when an insufficient amount of maze patterns are visible. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39)
-
Specification