Computer method for processing records with images and multiple fonts
First Claim
1. A computer-implemented method of preparing for storage and retrieval data from source documents comprising the steps ofestablishing in computer memory stored patterns of electrical signals forming lexicons of images of characters in at least one font,comparing signals representative of images of characters from the source documents with stored signals representative of images of characters in the lexicons,identifying signals representative of images of characters for which no match is found as ambiguous characters;
- andstoring the signals representative of images of ambiguous characters for use in retrieval of documents in which the ambiguous characters appeared.
1 Assignment
0 Petitions
Accused Products
Abstract
A sequence of documents is delivered to an optical scanner in which each document is scanned to form a digital image representation of the content of the document. The image representation is automatically examined by data processing apparatus to select search words which meet predetermined criteria and by which the document can be subsequently located. The search words are stored in a non-volatile memory and the entire document content is stored in mass storage in image form. A font table is established and images of entered search words are constructed from the table. Unrecognized or imperfectly formed ambiguous characters are stored with the font table and are used in the construction of search words to ease or eliminate text editing or are stored with converted characters of search words. The approach can also be used to ease or eliminate editing of full text-converted data bases during input.
-
Citations
10 Claims
-
1. A computer-implemented method of preparing for storage and retrieval data from source documents comprising the steps of
establishing in computer memory stored patterns of electrical signals forming lexicons of images of characters in at least one font, comparing signals representative of images of characters from the source documents with stored signals representative of images of characters in the lexicons, identifying signals representative of images of characters for which no match is found as ambiguous characters; - and
storing the signals representative of images of ambiguous characters for use in retrieval of documents in which the ambiguous characters appeared. - View Dependent Claims (2, 3, 4, 5)
- and
-
6. A computer-implemented method of calculating using signals representative of data from source documents comprising the steps of
establishing in computer memory stored patterns of electrical signals forming lexicons of images of characters in at least one font, the characters including letters and numerals, comparing signals representative of images of characters from the source documents with stored signals representative of images of characters represented by signals in the lexicons, identifying signals representative of images of characters which represent numerals, assigning a value to signals representing each numeral found correlated with the signals representing the image of the numeral, and using the signals representing the images of the numerals to perform calculations.
-
9. A computer-implemented method of editing data from source documents comprising the steps of
establishing in computer memory stored patterns of signals forming lexicons of images of characters in at least one font, the characters including letters and numerals, comparing signals representative of images of characters from the source documents with stored signals representative of images of characters in the lexicons, identifying signals representing images of characters which represent numerals, and displaying only the characters representing numerals for human review and storing signals representative of other characters without human review.
Specification