×

Computer system and method for data base indexing and information retrieval

  • US 4,817,036 A
  • Filed: 03/15/1985
  • Issued: 03/28/1989
  • Est. Priority Date: 03/15/1985
  • Status: Expired due to Fees
First Claim
Patent Images

1. In a computer system comprising a CPU, an input/output terminal connected to said CPU, a main CPU memory and a secondary storage means containing a data base, a method of indexing individual records of said data base, and rapidly searching and retrieving selected records corresponding to one or more keywords input to said CPU, said method comprising the steps of:

  • said CPU forming a vector for each said keyword, each said vector comprising one or more array elements which together comprise a numerically sorted list of all record numbers where the keyword for that vector is found;

    said CPU transforming each said vector so as to form a data base index comprising a bit string for each said vector, said step of transforming each said vector comprising the steps of;

    (a) transforming said numerically sorted list of record numbers into a binary matrix wherein each row of said matrix corresponds to a binary representation of one of said vector array elements, and wherein each column of said matrix corresponds to a level of said hierarchal tree;

    (b) determining the first column of said matrix where both ones and zeros are present;

    (c) grouping said ones and zeros to identify the number of bits in each such group;

    (d) determining whether the first and last bit in each said group are both ones, are zero and one or both zeros, and outputting a "01," "11" or "10," respectively, so as to form one bit pair of said bit string;

    (e) splitting the next column of said matrix into groups of bits based on the number of bits in each group determined in step (c);

    (f) repeating steps (c) and (d) for each said group of said next column; and

    (g) repeating steps (b) through (f) until each column of said matrix has been done.said CPU storing said data base index in said secondary storage means;

    inputting at said input/output terminal at least one keyword;

    said CPU searching said data base index and retrieving the bit string for said keyword input at said terminal;

    said CPU transforming said retrieving bit string back into the vector for said input keyword; and

    said CPU identifying at said input/output terminal the records of said data base identified by said list of record numbers associated with the vector for said input keyword.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×