×

Method for detecting and extracting text data using database schemas

  • US 5,717,913 A
  • Filed: 01/03/1995
  • Issued: 02/10/1998
  • Est. Priority Date: 01/03/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. An Information Filtering (IF) system for retrieving relevant text from a database collection of documents comprising the steps of:

  • (a) defining an information interest as a natural language statement;

    (b) creating a synonym list from each substantive word in the natural language statement;

    (c) creating a domain list from the natural language statement;

    (d) combining the synonym lists and the domain lists into a filter window;

    (e) selecting a minimum threshold value for the filter window;

    (f) scanning a first document having a first total length of a database collection with the filter window in order to calculate both a first value and a second value, wherein the first value is the number of matches between words in the synonym lists and corresponding words in the first document, and the second value is the number of matches between words in the domain lists and corresponding words in the first document;

    (g) adding the first value to the second value to form a sum value, and dividing the sum value by the total length value of the first document to form a relevancy value for the first document; and

    (h) repeating steps (a) through (g) for subsequent documents from the database collection if the relevancy value of each subsequent document is less than the minimum threshold value.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×