Method and apparatus for imaging, image processing and data compression merge/purge techniques for document image databases
First Claim
1. A method for processing at least two images and storing the images in a database, comprising the steps of:
- (a) scanning the image to create a first digital image thereof;
(b) comparing said first digital image against a codebook of stored digital images;
(c) matching said first digital image with one of said stored digital images;
(d) producing an index code identifying said one of said stored digital images as having matched said first digital image;
(e) subtracting said one of said stored digital images from said first digital image to produce a second digital image;
(f) storing said second digital image together with its respective index code as a record in the database;
(g) repeating steps (a) through (g) at least once for another image;
(h) clustering said stored images with their respective index codes based upon the use of at least one key, wherein said at least, one key comprises a first key and second key;
(i) computing said first key for each record in the database by extracting at least a portion of a first field;
(j) merge sorting the records in the database using said first key;
(k) comparing to each other a predetermined number of sequential reports sorted according to said first key to determine if one or more of the records match;
(l) storing identifiers for any matching records;
(m) computing said second key for each record in the database by extracting at least a portion of a second field;
(n) merge sorting the records in the database using said second key;
(o) comparing to each other a predetermined number of sequential records sorted according to said second key to determine if one or more of the records match;
(p) storing identifiers for any matching records;
(q) creating union of said stored identifiers; and
(r) subjecting said union to transitive closure.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for processing an image, consisting of a foreground and a background, to produce a highly compressed and accurate representation of the image, including the steps of scanning the image to create a digital image of the image, comparing the digital image against a codebook of stored digital images; matching the digital image with one of the stored digital images of the codebook; producing an index code identifying the background of the stored digital image as having matched the digital image; subtracting the stored digital image from the digital image to produce a second digital image representing the foreground of the stored digital image; and storing the second digital image with the index code. Techniques are also provided to enable merge/purge of the database(s) thereby created.
559 Citations
18 Claims
-
1. A method for processing at least two images and storing the images in a database, comprising the steps of:
-
(a) scanning the image to create a first digital image thereof; (b) comparing said first digital image against a codebook of stored digital images; (c) matching said first digital image with one of said stored digital images; (d) producing an index code identifying said one of said stored digital images as having matched said first digital image; (e) subtracting said one of said stored digital images from said first digital image to produce a second digital image; (f) storing said second digital image together with its respective index code as a record in the database; (g) repeating steps (a) through (g) at least once for another image; (h) clustering said stored images with their respective index codes based upon the use of at least one key, wherein said at least, one key comprises a first key and second key; (i) computing said first key for each record in the database by extracting at least a portion of a first field; (j) merge sorting the records in the database using said first key; (k) comparing to each other a predetermined number of sequential reports sorted according to said first key to determine if one or more of the records match; (l) storing identifiers for any matching records; (m) computing said second key for each record in the database by extracting at least a portion of a second field; (n) merge sorting the records in the database using said second key; (o) comparing to each other a predetermined number of sequential records sorted according to said second key to determine if one or more of the records match; (p) storing identifiers for any matching records; (q) creating union of said stored identifiers; and (r) subjecting said union to transitive closure. - View Dependent Claims (2, 3, 4, 5, 6, 18)
-
-
7. A method for processing at least two images, storing the images in a database, and creating a codebook representing portions of at least one of said images, comprising the steps of:
-
(a) scanning each image to create a first digital image thereof; (b) storing each scanned image as a record in a database; (c) computing a first key for each record in the database by extracting at least a portion of a scanned image to create a field image; (d) merge sorting the records in the database using the first key; (e) comparing to each other a predetermined number of sequential records sorted according to the first key to determine if another record has a field image that matches the field image; (f) creating an identifier specific for each such matching field image that identifies the particular record in which the match was found; (g) subtracting the matched field image from the identified record to produce a residual image; (h) storing the residual image with its specific identifier; (i) storing the subtracted, matched field image in a codebook database; (j) computing a second key for each record in the database by extracting at least a portion of a scanned image to create a second field image; (k) merge sorting the records in the database using the second key; (l) comparing to each other a predetermined number of sequential records sorted according to the second key to determine if a record has a field image that matches the second field image; (m) storing an identifier specific for each such matching second field image that identifies the particular record in which the match was found; (n) subtracting the matched second field image from the identified record; and (o) storing the matched second field image in a codebook database. - View Dependent Claims (8, 11, 12, 13, 14, 15, 16, 17)
-
-
9. A method for processing at least two images, storing the image in database, and creating a codebook representing portions of at least one of said images, comprising the steps of:
-
(a) scanning each image to create a first digital image thereof; (b) storing each scanned image as a record in a database; (c) computing a first key for each record in the database by extracting at least a portion of a scanned image to create a field image; (d) merge sorting the records in the database using the first key; (e) comparing to each other a predetermined number of sequential records sorted according to the first key to determine if another record has a field image that matches the field image; (f) creating an identifier specific for each such matching field image that identifies the particular record in which the match was found; (g) subtracting the matched field image from the identified record to produce a residual image; (h) storing the residual image with its specific identifier; (i) storing the subtracted, matched field image in a codebook datebase; (j) computing a second key for each residual image in the database by extracting at least a portion thereof to create a second field image; (k) merge sorting the residual images in the database using the second key; (l) comparing to each other a predetermined number of sequential residual images sorted according to the second key to determine if a residual image has a field image that matches the second field image; (m) creating an identifier specific for each such matching second field image that identifies the particular residual image in which the match was found; (n) subtracting the matched second field image from the respective residual image; (o) storing the residual image with its specific identifier; and (p) storing the subtracted, matched second field image in a codebook database. - View Dependent Claims (10)
-
Specification