×

Information retrieval based on rank-ordered cumulative query scores calculated from weights of all keywords in an inverted index file for minimizing access to a main database

  • US 5,263,159 A
  • Filed: 09/18/1990
  • Issued: 11/16/1993
  • Est. Priority Date: 09/20/1989
  • Status: Expired due to Term
First Claim
Patent Images

1. An information retrieval method for use in a computer having a processor, an external storage, a main memory into which data from the external storage can be copied, a sequential file stored in the external storage containing a plurality of retrieval objects, each retrieval object having a unique retrieval object identifier, a transposed file stored in the external storage in which access keys are associated with a set of retrieval object identifiers, and a query inputting device for specifying a query expressed as a combination of access keys given weighting coefficients and a number N of retrieval objects to be output, the information retrieval method comprising the steps of:

  • selecting, using the processor, an access key in the query having the highest weighting coefficient which has not been previously selected, responsive to the inputting of a query;

    copying data into the main memory from the transposed file which contains the retrieval object identifiers which are associated with the selected access key;

    calculating, using the processor, for each retrieval object identifier a cumulative query score by adding the weighting coefficient for the selected access key to a previously calculated cumulative query score, if any, for each retrieval object which contains the selected access key;

    determining, using the processor, for each retrieval object a maximum anticipated score by adding the weighting coefficient for all access keys not previously selected to a previously calculated cumulative query score, if any, for each retrieval object identifier;

    ranking, using the processor, the cumulative query scores for retrieval object identifiers from highest to lowest to create a rank list;

    repeating the selecting, copying, calculating, determining and ranking steps until all access keys in the query have been selected or the cumulative query score at the N-th element in the rank list exceeds the maximum anticipated score for any retrieval object with a cumulative query score at the (N+1)-th element in the rank list or below; and

    reading N retrieval objects corresponding to the N retrieval object identifiers having the highest cumulative query scores from the sequential file into the main memory.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×