Non-edit multiple image font processing of records
First Claim
1. A method of inputting and preparing data from source documents for storage and subsequent retrieval comprising the steps ofscanning each source document and forming signals representative of digitized patterns derived from images of characters and graphics thereon,storing the signals representative of the digitized patterns,selecting segments of the stored signals for further processing,converting signals representative of digitized patterns of characters into a machine code,storing as ambiguous characters the digitized patterns of each character not successfully converted into machine code including storing as ambiguous words each group of characters which includes converted characters and at least one ambiguous character, andstoring the digitized patterns of the selected segments correlated with the machine code for subsequent use.
1 Assignment
0 Petitions
Accused Products
Abstract
A sequence of documents is delivered to an optical scanner in which each document is scanned to form a digital image representation of the content of the document. The image representation is automatically examined by data processing apparatus to select search words which meet predetermined criteria and by which the document can be subsequently located. The search words are stored in a non-volatile memory and the entire document content is stored in mass storage in image form. A font table is established and images of entered search words are constructed from the table. Unrecognized or imperfectly formed ambiguous characters are stored with the font table and are used in the construction of search words to eliminate text editing or are stored with converted characters of search words. The approach can also be used to eliminate editing of full text-converted data bases during input.
45 Citations
20 Claims
-
1. A method of inputting and preparing data from source documents for storage and subsequent retrieval comprising the steps of
scanning each source document and forming signals representative of digitized patterns derived from images of characters and graphics thereon, storing the signals representative of the digitized patterns, selecting segments of the stored signals for further processing, converting signals representative of digitized patterns of characters into a machine code, storing as ambiguous characters the digitized patterns of each character not successfully converted into machine code including storing as ambiguous words each group of characters which includes converted characters and at least one ambiguous character, and storing the digitized patterns of the selected segments correlated with the machine code for subsequent use.
Specification