×

Method and apparatus for processing alphanumeric and graphic information to create a data base

  • US 5,319,745 A
  • Filed: 09/16/1992
  • Issued: 06/07/1994
  • Est. Priority Date: 09/16/1991
  • Status: Expired due to Fees
First Claim
Patent Images

1. Method for processing page form documents, said documents comprising discrete information portions, each of said portions comprising text and graphic fields, to create a data base stored in a computer system of digital representations of said page form documents which can be searched and edited, comprising the following steps:

  • A. creating digitally formatted documents in bitmap form comprising digital representations of said page form documents;

    B. a first processing phase comprising the steps of;

    (1) identifying characteristic elements of each page of said page form documents in order to verify correct pagination of said digitally formatted documents,(2) determining by calculation what angle of rotation must be applied to properly orient each text field of each digitally formatted document for subsequent Optical Character Recognition conversion of said text fields,(3) creating a bitmap mask of said characteristic elements of each page of said page form documents, while allowing for said angle of rotation,(4) identifying said characteristic elements on each digitally formatted document in order to compare and verify said characteristics with said bitmap mask,(5) window-formatting said digitally formatted documents to separate the text and graphics fields each of said portions into blocks of digital information which can be separately accessed,(6) segmenting said blocks to distinguish text and graphics fields so that said fields may be separately stored,(7) correcting and aligning only said text fields by taking into account said angle of rotation to create aligned text fields,(8) reconstructing said digitally formatted documents from said aligned text fields and graphics fields so that each portion of said digitally formatted documents may be separately stored,(9) storing said digitally formatted documents, each portion of said digitally formatted documents, said blocks of digital information which can be separately accessed, and said text and graphic fields, in files which can be edited, and(10) manually correcting errors of digitization, pagination, indexing, segmenting and alignment, andC. a second processing phase comprising Optical Character Recognition conversion of characters contained within said aligned text fields and storing said characters in a file which can be searched.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×