×

Document storage and retrieval system for storing and retrieving document image and full text data

  • US 5,628,003 A
  • Filed: 08/24/1993
  • Issued: 05/06/1997
  • Est. Priority Date: 08/23/1985
  • Status: Expired due to Term
First Claim
Patent Images

1. A document storage and retrieval system for storing and retrieving textual documents, comprising:

  • image file means for storing textual documents which are digital image data, said textual documents including bibliographic items providing bibliographic information of said textual documents and body text data providing data of text found in bodies of said textual documents;

    document recognition means, coupled to said image file means, for recognizing said textual documents, said document recognition means includes;

    (a) means for extracting pattern elements forming character patterns from said digital image data,(b) a document knowledge file for storing regulations of a layout of said bibliographic items in said textual documents as document knowledge,(c) character segmentation means for extracting character patterns by analyzing said pattern elements with reference to said document knowledge in said document knowledge file, and(d) recognition means for recognizing said extracted character patterns, said recognition means outputs a recognition result including said bibliographic items and said body text data with a layout structure name corresponding to the recognition result;

    data base file means, coupled to said document recognition means, for storing said bibliographic items and information as bibliographic information of said outputted recognition result with said layout structure name;

    text file means, coupled to said document recognition means, for storing at least said body text data as document contents of recognized textual documents;

    input means for inputting a request of a search keyword;

    retrieval means, coupled to said image file means, said data base file means, said text file means and said input means, for retrieving digital image data of at least one textual document which includes said search keyword based on said stored bibliographic information and said stored body text data; and

    output means, coupled to said retrieval means, for outputting said retrieved digital image data of at least one textual document.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×