Optical character isolation system, apparatus and method
First Claim
1. Method of producing output data representative of isolated information markings within a class of such markings oriented on a document, the metod comprising the steps of:
- vertically sensing the information markings in a selected sequence to produce word data manifestations of the sensed information markings on a word by word basis where said information markings are in the form of a word and where said class of markings are in the form of text,storing the word data manifestations in substantially similar orientation to the orientation of the sensed information markings on the document,selectively addressing stored word data manifestations oriented with respect to a selected reference location,producing encoded output in recognition of selectively addressed stored word data manifestations as specific ones of the information markings within the class of such markings,deleting from storage the selectively addressed word data manifestations which are recognized, and(a) selectively addressing any remaining stored word data manifestations which are selectively oriented in storage with respect to the locations of the deleted word data manifestations, and(b) successively producing encoded outputs in recognition of any successively addressed word data manifestations, and(c) successively deleting from storage any successively addressed word data manifestations which are recognized, until such recognitions and deletions from storage of word data manifestations extend to a selected orientation limit,andselectively accumulating the encoded outputs to provide isolated output data representative of the information word markings and the orientations thereof on the document.
2 Assignments
0 Petitions
Accused Products
Abstract
Low cost, high-speed, optical character isolation and page reconstruction system, method and apparatus are presented which overcome problems caused by copied pages, noise, underlines, skewed and bowed text, forms features, logos and signatures. As characters or noise are isolated and recognized, their corresponding bit patterns in memory are deleted. Recognized characters are isolated within entire words at a time to form page image records which are then linked to form lines of words. The text on the original page is then reconstructed from lines of words to yield output signals suitable for input to a host word processor.
-
Citations
19 Claims
-
1. Method of producing output data representative of isolated information markings within a class of such markings oriented on a document, the metod comprising the steps of:
-
vertically sensing the information markings in a selected sequence to produce word data manifestations of the sensed information markings on a word by word basis where said information markings are in the form of a word and where said class of markings are in the form of text, storing the word data manifestations in substantially similar orientation to the orientation of the sensed information markings on the document, selectively addressing stored word data manifestations oriented with respect to a selected reference location, producing encoded output in recognition of selectively addressed stored word data manifestations as specific ones of the information markings within the class of such markings, deleting from storage the selectively addressed word data manifestations which are recognized, and (a) selectively addressing any remaining stored word data manifestations which are selectively oriented in storage with respect to the locations of the deleted word data manifestations, and (b) successively producing encoded outputs in recognition of any successively addressed word data manifestations, and (c) successively deleting from storage any successively addressed word data manifestations which are recognized, until such recognitions and deletions from storage of word data manifestations extend to a selected orientation limit, and selectively accumulating the encoded outputs to provide isolated output data representative of the information word markings and the orientations thereof on the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. Isolation apparatus for producing output signals indicative of information markings within a class of such markings oriented on a document, the apparatus comprising:
-
transducer means disposed with respect to a document for vertically producing word data signals representative of information markings at selected locations on the document on a word by word basis where said information markings are in the form of words and where said class of markings are in the form of text, storge means coupled to the transducer means for storing word data signals therein in substantially similar orientation to the orientation of the corresponding information markings on the document, circuit means coupled to the storage means for addressing word data signals stored therein at selected locations, recognition circuit means responsive to the addressed data signals in the storage means for producing code signals indicative of the addressed word data signals corresponding substantially to an information word marking within the class of such markings, means coupled to the storage means for removing from storage therein the stored word data signals corresponding to the information marking for which code signals were produced, said circuit means being operative thereafter for addressing any remaining word data signals stored in said storage means at selected locations relative to the locations of previously removed word data signals, said recognition circuit means producing isolation code signals indicative of such remaining addressed word data signals corresponding substantially to information marking within the class of such markings, said means operating to remove from storage the word data signals for which code signals were produced until the removals of word data signals in storage extend to a selected orientation limit, and compiling means coupled to receive said isolation code signals for assembling same in selected sequence to produce isolated output signals representative of the information markings and the orientations thereof on the document.
-
-
19. Document-handling system for producing isolation output signals representative of information markings oriented on the document, the system comprising:
-
sensing means mounted with respect to a document for vertically producing word data signals in response to detection of selected portions of information markings on the document on a word by word basis where said information markings are in the form of words and where said class of markings are in the form of text, transport means coupled to the document for selectively displacing the document incrementally along a direction relative to the sensing means, storage means coupled to the sensing means for storing word data signals at addressable locations therein, circuit means coupled to the storage means for selectively addressing word data signals therein to produce isolation output signals representative of substantially corresponding information markings, and to remove from said storage means the selectively addressed word data signals for which isolated output signals are produced, and control means coupled to said transport means and said storage means and said circuit means for successively displacing the document incrementally at a rate sufficient to maintain the storage means substantially filled with word data signals as the same are selectively addressed and removed therefrom.
-
Specification