×

Method and apparatus for creating, indexing and viewing abstracted documents

  • US 6,002,798 A
  • Filed: 01/19/1993
  • Issued: 12/14/1999
  • Est. Priority Date: 01/19/1993
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for creating a retrieval index by which an image of an arbitrarily formatted document may be retrieved, the method comprising the steps of:

  • processing the image of the arbitrarily formatted document to identify text regions on the document and non-text regions on the document, said processing step including, for each text region on the document, the step of automatically determining a region type using rule-based decisions automatically applied to the image of the text region without regard to a position of the text region in the document and without regard to a predetermined format for the document, the region type being one of plural different predefined region types encompassed by the rules;

    converting the image of the document in text regions into text;

    indexing the converted text so as to permit retrieval by reference to the converted text;

    indexing the automatically determined region types so as to permit retrieval by reference to one of the determined region types; and

    storing the image of the document such that the stored document image may be retrieved by reference to whether text in a text query appears in the indexed text and by reference to whether the text in the text query appears in one of the indexed region types.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×