×

System and method for optimized source selection in an information retrieval system

  • US 5,960,422 A
  • Filed: 11/26/1997
  • Issued: 09/28/1999
  • Est. Priority Date: 11/26/1997
  • Status: Expired due to Fees
First Claim
Patent Images

1. In a distributed information system including databases as sources of documents for query searching, a method of optimizing the selection of sources for satisfying a query, comprising the steps of:

  • a) forming a training set of documents by randomly selecting significant portions of the documents from each of the sources;

    b) forming a test set of documents by using the set of documents excluded in the training set;

    c) defining each document in the training and test set in terms of features/attributes and a name as samples representing individual sources;

    d) processing the samples using an algorithm to recognize patterns in the documents which distinguish one source from another source;

    e) generating a set of rules from the patterns as a model using the algorithm; and

    f) applying to the model a query in terms of desired features/attributes to predict the optimum sources satisfying the query.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×