×

Knowledge-based document analysis system

  • US 5,937,084 A
  • Filed: 05/22/1996
  • Issued: 08/10/1999
  • Est. Priority Date: 05/22/1996
  • Status: Expired due to Term
First Claim
Patent Images

1. A system for analyzing a target document including at least one informational element, the system comprising:

  • (a) means for receiving a digitized image of the target document;

    (b) means for extracting low level features from the digitized image;

    (c) means for classifying the document based upon the extracted low level features from the digitized image to identify a most probable document type from a plurality of possible document classes that the target document most closely matches, wherein the means for classifying performs the steps of;

    (i) extracting a sample immediate feature set from at least one sample document for each document class, wherein each sample immediate feature set includes at least one feature of a sample document;

    (ii) Generating a sample indirect feature set for each sample document;

    (iii) generating a target document immediate feature set and a target document indirect feature set, the target document immediate feature set comprising information describing a location and a type indicator for basic image features of the target document, and the target document indirect feature set comprising information summarizing attributes of the immediate features in the target document immediate feature set;

    (iv) comparing the target document indirect feature set with each of the sample indirect feature sets; and

    (v) classifying the target document responsive to the comparison of step (iv) to determine the most probable document type for the target document; and

    (d) means for analyzing the target document in order to extract informational data associated with the at least one informational element based upon the most probable document type identified by the classifying means.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×