Method of analyzing digital document images
First Claim
1. A method for analyzing an input image x with K1×
- K2 pixels x(r,s) where (r,s) denotes the pixel location with r=1, 2, . . . , K1 indicating the image row, and s=1, 2, . . . , K2 indicating the image column, the input image being one of a digitized image stored in a memory or a scanned image from a scanner, comprising;
using a processor to form a feature image z from the input image x by;
dividing the input image x into a plurality of blocks of pixels each block having a block size of g1×
g2 pixels, where g1 denotes a number of image rows in a block and g2 denotes a number of image columns in a block,associating each block of pixels in the input image x with a single pixel in the feature image z, with the feature image z consisting of K1/g1×
K2/g2 pixels, andoutputting the feature image z for further analysis or storage in a memory,wherein the feature image z is a two-channel image with feature pixels z(m,n)=[f(m,n), b(m,n)] , for m=1, 2, . . . , K1/g1 and n=1, 2, . . . , K2 /g2 , wherein f(m,n) and b(m,n) denote, respectively, the foreground and background components of a feature pixel z(m,n),wherein the foreground and background components of a feature pixel z(m,n) are respectively defined as follows;
f(m,n)=min{x(r,s);
(m−
1)g1<
r≦
mg1,(n−
1)g2<
s<
ng2}
b(m,n)=max{x(r,s);
(m−
1)g1<
r≦
mg1,(n−
1)g2<
s≦
ng2}where min and max are the minimum and maximum operators, andwherein the processor detects pages according to;
2 Assignments
0 Petitions
Accused Products
Abstract
Analyzing an input image, the input image being one of a digitized image stored in a memory or a scanned image from a scanner. Forming a feature image from the input image by dividing the input image into a plurality of blocks of pixels, thus associating each block of pixels in the input image with a single pixel in the feature image, and outputting the feature image for further analysis or storage in a memory. Example embodiments extract and analyze features from a document image to detect particular characteristics associated with the page area, the distortion area, and the book spine area. Extracted features can be further analyzed to detect document characteristics at the paragraph, line, word, and character levels.
-
Citations
11 Claims
-
1. A method for analyzing an input image x with K1×
- K2 pixels x(r,s) where (r,s) denotes the pixel location with r=1, 2, . . . , K1 indicating the image row, and s=1, 2, . . . , K2 indicating the image column, the input image being one of a digitized image stored in a memory or a scanned image from a scanner, comprising;
using a processor to form a feature image z from the input image x by; dividing the input image x into a plurality of blocks of pixels each block having a block size of g1×
g2 pixels, where g1 denotes a number of image rows in a block and g2 denotes a number of image columns in a block,associating each block of pixels in the input image x with a single pixel in the feature image z, with the feature image z consisting of K1/g1×
K2/g2 pixels, andoutputting the feature image z for further analysis or storage in a memory, wherein the feature image z is a two-channel image with feature pixels z(m,n)=[f(m,n), b(m,n)] , for m=1, 2, . . . , K1/g1 and n=1, 2, . . . , K2 /g2 , wherein f(m,n) and b(m,n) denote, respectively, the foreground and background components of a feature pixel z(m,n), wherein the foreground and background components of a feature pixel z(m,n) are respectively defined as follows;
f(m,n)=min{x(r,s);
(m−
1)g1<
r≦
mg1,(n−
1)g2<
s<
ng2}
b(m,n)=max{x(r,s);
(m−
1)g1<
r≦
mg1,(n−
1)g2<
s≦
ng2}where min and max are the minimum and maximum operators, and wherein the processor detects pages according to; - View Dependent Claims (2, 3, 4, 5, 6, 7)
- K2 pixels x(r,s) where (r,s) denotes the pixel location with r=1, 2, . . . , K1 indicating the image row, and s=1, 2, . . . , K2 indicating the image column, the input image being one of a digitized image stored in a memory or a scanned image from a scanner, comprising;
-
8. A device for analyzing an input image x with K1 ×
- K2 pixels x(r,s) where (r,s) denotes the pixel location with r=1, 2, . . . , K1 indicating the image row, and s=1, 2, . . . , K2 indicating the image column, comprising;
an image capture unit that captures input image x; a memory that stores input image x; and a processor that forms a feature image z from the input image x by; dividing the input image x into a plurality of blocks of pixels each block having a block size of g1×
g2 pixels, where g1 denotes a number of image rows in a block and g2 denotes a number of image columns in a block,associating each block of pixels in the input image x with a single pixel in the feature image z, with the feature image z consisting of K1/g1×
K/g2 pixels, andoutputting the feature image z for further analysis or storage in a memory, wherein the feature image z is a two-channel image with feature pixels z(m,n)=[f(m,n),b(m,n)], for m=1, 2, . . . , K1/g2 and n=1, 2, . . . , K2/g2, wherein f(m,n) and b(m,n) denote, respectively, the foreground and background components of a feature pixel z(m,n), wherein the foreground and background components of a feature pixel z(m,n) are respectively defined as follows;
f(m,n)=min{x(r,s);
(m−
1)g1<
r≦
mg1, (n−
1)g2<
s≦
ng2}
b(m,n)=max{x(r,s);
(m−
1)g1<
r≦
mg1,(n−
1)g2<
s≦
ng2}where min and max are the minimum and maximum operators, and wherein the processor detects pages according to; - View Dependent Claims (9)
- K2 pixels x(r,s) where (r,s) denotes the pixel location with r=1, 2, . . . , K1 indicating the image row, and s=1, 2, . . . , K2 indicating the image column, comprising;
-
10. One or more tangible computer-readable media having computer-readable instructions thereon, which, when executed by a processor analyze an input image x with K1 ×
- K2 pixels x(r,s) where (r,s) denotes the pixel location with r=1,2, . . . , K1 indicating the image row, and s=1, 2, . . . , K2 indicating the image column, the input image being one of a digitized image stored in a memory or a scanned image from a scanner, wherein;
the processor forms a feature image z from the input image x by; dividing the input image x into a plurality of blocks of pixels each block having a block size of g1×
g2 pixels, where g1 denotes a number of image rows in a block and g2 denotes a number of image columns in a block,associating each block of pixels in the input image x with a single pixel in the feature image z, with the feature image z consisting of K1/g1×
K2 /g2 pixels, andoutputting the feature image z for further analysis or storage in a memory, wherein the feature image z is a two-channel image with feature pixels z(m,n)=[f(m,n),b(m,n)], for m=1, 2, . . . , K1/g2 and n=1, 2, . . . , K2/g2, wherein f(m,n) and b(m,n) denote, respectively, the foreground and background components of a feature pixel z(m,n), wherein the foreground and background components of a feature pixel z(m,n) are respectively defined as follows;
f(m,n)=min{x(r,s);
(m−
1)g1<
r≦
mg1, (n−
1)g2<
s≦
ng2}
b(m,n)=max{x(r,s);
(m−
1)g1<
r≦
mg1, (n−
1)g2<
s≦
ng2}where min and max are the minimum and maximum operators, and wherein the processor detects pages according to; - View Dependent Claims (11)
- K2 pixels x(r,s) where (r,s) denotes the pixel location with r=1,2, . . . , K1 indicating the image row, and s=1, 2, . . . , K2 indicating the image column, the input image being one of a digitized image stored in a memory or a scanned image from a scanner, wherein;
Specification