×

Document storage and retrieval system

  • US 4,985,863 A
  • Filed: 07/30/1990
  • Issued: 01/15/1991
  • Est. Priority Date: 08/23/1985
  • Status: Expired due to Fees
First Claim
Patent Images

1. A document storage and retrieval system comprising:

  • storage means for storing plural documents of data to be retrieved, said data being stored in the form of character strings, said character strings including multiple candidates of character codes for a character which is not identified during character recognition of characters indicated in said plural documents, said multiple candidates of character codes being located between predetermined special character codes;

    input means for inputting a partial character string retrieval request to initiate a text search for retrieval of a desired document having a desired partial character string out of said plural documents;

    first generation means connected to said input means for generating a hetero-notation and a synonym in response to said partial character string retrieval request by using a hetero-notation convention and a thesaurus and for generating an aggregation of character strings on the basis of said hetero-notation and synonym generation;

    second generation means connected to said first generation means for generating a finite state automaton in the form of a state transition matrix based on said aggregation of character strings generated by said first generation means in accordance with a predetermined rule, and for generating an extended finite state automaton defining predetermined states to transform a character string aggregation when said predetermined special character codes, which indicate the location of multiple candidates of character codes in said character strings, appear during retrieval of said plural documents;

    means responsive to said second generation means for reading characters one-by-one out of said storage means and for verifying whether or not a desired partial character string exists according to said finite state automaton or said extended finite state automaton; and

    means responsive to said reading and verifying means for outputting the documents in which said partial character string exists.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×