Non-edit multiple image font processing of records
First Claim
1. A computer-implemented method of retrievably storing contents of a plurality of documents having images imprinted thereon comprisingoptically scanning the documents to generate electrical signals forming a digital representation of the images on the documents,storing the signals forming the digital image representation of each document,establishing a font table in memory including signals forming images of characters in a plurality of different fonts, the signals for images of each character of each font having a unique, identifiable location in a memory area,selectively recognizing and converting groups of characters from signals forming the digital representations of the images into signals representing computer readable code,storing signals forming images of characters which are not recognizable and convertible as ambiguous characters in unique, identifiable locations in the font table,searching for a document by the steps ofselecting a search word,constructing signals forming an image of the selected search word by copying signals representing individual characters from the font table in at least one font,comparing the signals forming the constructed search word image with signals forming the image representations of scanned and stored documents until a match is found,selecting images of the ambiguous characters for use in search word images, andrepeating the step of comparing including comparing signals representing images of ambiguous characters with stored signals representing images of the document contents until a match is found.
1 Assignment
0 Petitions
Accused Products
Abstract
A sequence of documents is delivered to an optical scanner in which each document is scanned to form a digital image representation of the content of the document. The image representation is automatically examined by data processing apparatus to select search words which meet predetermined criteria and by which the document can be subsequently located. The search words are stored in a non-volatile memory and the entire document content is stored in mass storage in image form. A font table is established and images of entered search words are constructed from the table. Unrecognized or imperfectly formed ambiguous characters are stored with the font table and are used in the construction of search words to eliminate text editing or are stored with converted characters of search words. The approach can also be used to eliminate editing of full text-converted data bases during input.
61 Citations
76 Claims
-
1. A computer-implemented method of retrievably storing contents of a plurality of documents having images imprinted thereon comprising
optically scanning the documents to generate electrical signals forming a digital representation of the images on the documents, storing the signals forming the digital image representation of each document, establishing a font table in memory including signals forming images of characters in a plurality of different fonts, the signals for images of each character of each font having a unique, identifiable location in a memory area, selectively recognizing and converting groups of characters from signals forming the digital representations of the images into signals representing computer readable code, storing signals forming images of characters which are not recognizable and convertible as ambiguous characters in unique, identifiable locations in the font table, searching for a document by the steps of selecting a search word, constructing signals forming an image of the selected search word by copying signals representing individual characters from the font table in at least one font, comparing the signals forming the constructed search word image with signals forming the image representations of scanned and stored documents until a match is found, selecting images of the ambiguous characters for use in search word images, and repeating the step of comparing including comparing signals representing images of ambiguous characters with stored signals representing images of the document contents until a match is found.
-
41. A computer-implemented method of retrievably storing contents of a plurality of documents having images imprinted thereon comprising
optically scanning the documents to generate electrical signals forming a digital representation of the images on the documents, storing the signals forming the digital image representation of each document, establishing a font table in memory including signals forming images of characters in a plurality of different fonts, the signals for images of each character of each font having a unique, identifiable location, selecting groups of characters from signals forming the digital representations of the images for use as search words, storing signals representative of images of characters which are imperfectly formed on the document as ambiguous characters in unique, identifiable locations in the font table, searching for a document by the steps of selecting a search word, constructing signals forming an image of the selected search word by copying signals representing individual characters from the font table in at least one font, comparing the signals forming the constructed search word image with signals forming the image representations of scanned documents until a match is found, selecting signals forming images of the ambiguous characters for use in search word images, and repeating the step of comparing including comparing signals representing images of ambiguous characters with stored signals representing images of the document contents until a match is found.
Specification