×

Dictionary and index creating system and document retrieval system

  • US 6,169,999 B1
  • Filed: 04/14/1998
  • Issued: 01/02/2001
  • Est. Priority Date: 05/30/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. A dictionary and index creating system designed to create a regular expression dictionary and a word index on the basis of a retrieval document undergoing retrieval and a word dictionary including words w (where w is one of n words w1, w2, w3, . . . , wn, n being a number greater than 1), said system comprising:

  • a retrieval document storage unit for storing said retrieval document composed of a lineup of a finite number of characters included in a predetermined character set;

    a word dictionary storage unit for storing said word dictionary in which are registered a finite number of words each being a lineup of one or more characters included in said character set;

    means for reading out each word w of said words w1, . . . , wn from said word dictionary in said word dictionary storage unit and further for making out m sets of regular expressions a, b, where m represents a variable to establish one or more regular expression sets and depends on each word w of said words w1, . . . , wn and where each of a, b is indicative of a set of equal length character strings having lengths which are equal to each other, except null sets on said character set, with said regular expressions a, b being determined according to a rule depending on each said word w of said words w1, . . . , wn, itself, or on an attribute of each said word w;

    a regular expression dictionary storage unit for joining said regular expressions a, b to before and after each word w of said words w1, . . . , wn, respectively, to make out m sets of regular expressions awb corresponding to each word w and further for collecting all of said regular expressions awb made out to produce said regular expression dictionary, different from said word dictionary, according to a predetermined rule depending on each word w of said words w1, . . . , wn and for storing said regular expression dictionary;

    means for retrieving a character string matching with a regular expression in said regular expression dictionary from said retrieval document storage unit and further for creating an index element comprising a set of said regular expressions and a matching character positional range in said retrieval document; and

    a word index storage unit for storing a word index made out by a collection of said index elements decided as being non-deducible from other index elements.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×