Image processing system with on-the-fly JPEG compression
First Claim
1. A personal imaging computer system which scans in documents and determines the identity of printed characters on the scanned-in documents, said system comprising:
- a scanner for scanning in lines of the document so as to form lines of gray-scale document information;
an on-the-fly compression processor which operates in coordination with the scanner to compress, using lossy compression, the lines of gray-scale document information so as to form a compressed document image which includes compressed printed character images;
decompression means for decompressing the compressed document image and the compressed printed character images in the compressed document image so as to form a decompressed gray-scale document image which includes gray-scale printed character images containing artifacts due to lossy compression by said compression processor and decompression by said decompression means;
optical-character-recognition-processing means for gray-scale OCR identification of the artifacted gray-scale printed character images in the decompressed gray-scale document image so as to obtain computerized character codes which correspond to the printed characters; and
storing means for storing the compressed document image in association with a text file containing the character codes determined by said optical-character-recognition-processing means.
0 Assignments
0 Petitions
Accused Products
Abstract
A personal imaging computer system includes an on-the-fly compression processor which operates in coordination with a document scanner to compress document image information as the document is being scanned. Once the document has been scanned and compressed, the personal imaging computer system operates to determine the identity of characters on the scanned-in document by decompressing the compressed image, processing the decompressed image so as to identify characters in the image, and by storing a text file which contains the identity of the characters so-determined. Preferably, the text file is stored in association with the compressed image so that the image may be retrieved in response to a text-based search of plural such text files.
-
Citations
28 Claims
-
1. A personal imaging computer system which scans in documents and determines the identity of printed characters on the scanned-in documents, said system comprising:
-
a scanner for scanning in lines of the document so as to form lines of gray-scale document information; an on-the-fly compression processor which operates in coordination with the scanner to compress, using lossy compression, the lines of gray-scale document information so as to form a compressed document image which includes compressed printed character images; decompression means for decompressing the compressed document image and the compressed printed character images in the compressed document image so as to form a decompressed gray-scale document image which includes gray-scale printed character images containing artifacts due to lossy compression by said compression processor and decompression by said decompression means; optical-character-recognition-processing means for gray-scale OCR identification of the artifacted gray-scale printed character images in the decompressed gray-scale document image so as to obtain computerized character codes which correspond to the printed characters; and storing means for storing the compressed document image in association with a text file containing the character codes determined by said optical-character-recognition-processing means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 17)
-
-
9. A personal imaging computer system which scans in documents and determines the identity of printed characters on the scanned-in documents, said system comprising:
-
a scanner for scanning in lines of the document so as to form lines of gray-scale document information; an on-the-fly compression processor which operates in coordination with the scanner to compress, using lossy compression, the lines of gray-scale document information so as to form a compressed document image which includes compressed printed character images; a memory for storing the compressed document image and for storing process steps for processing the compressed document image; and a processor for executing the stored process steps; wherein the process steps stored in said memory include process steps to (a) decompress the compressed document image and the compressed printed character images in the compressed document image so as to form a decompressed gray-scale image which includes gray-scale printed character images containing artifacts due to lossy compression by said compression processor and subsequent decompression, (b) optical-character recognition process the gray-scale printed character images in the decompressed gray-scale image so as to obtain computerized character codes which correspond to the printed characters, and (c) store in a memory a text file containing the identity of the character codes so obtained. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 18)
-
-
19. A personal imaging computer system which scans in documents and determines the identity of printed characters on the scanned-in documents, said system comprising:
-
a scanner interface which collects gray-scale document information from a scanner which scans in the document; a compression processor which performs lossy compression on the gray-scale document information so as to form a compressed document image which includes compressed printed character images; a decompresser which decompresses the compressed document image and the compressed printed character images in the compressed document image so as to form a decompressed gray-scale document image which includes gray-scale printed character images containing artifacts due to lossy compression by said compression processor and decompression by said decompresser; an optical-character-recognition processor which processes the artifacted gray-scale printed character images in the decompressed gray-scale document image so as to obtain computerized character codes which correspond to the printed characters; and a memory which stores the compressed document image in association with a text file containing the character codes obtained by said recognition processor. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28)
-
Specification