×

Method and system for searching for relevant documents from a text database collection, using statistical ranking, relevancy feedback and small pieces of text

  • US 5,642,502 A
  • Filed: 12/06/1994
  • Issued: 06/24/1997
  • Est. Priority Date: 12/06/1994
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for retrieving relevant text data from a text database collection in a computer without annotating, parsing or pruning the text database collection, comprising the steps of:

  • (a) searching a text database collection in a computer using a first search query of natural language to retrieve a first group of selected small pieces of text, where each of the selected small pieces of text corresponds to a document;

    (b) weighting each word of the selected small pieces of text with semantics to form document weighted values for each of the selected small pieces of text in the first group;

    (c) weighting each word in the first search query with semantics to form query weighted values;

    (d) combining the query weighted values and the document weighted values to form similarity values for each of the selected small pieces of text;

    (e) ranking the similarity values for each of the selected small pieces of text to form a first ranked list;

    (f) applying feedback information based on a manual determination of the relevancy of each of the selected small pieces of text in the first ranked list to automatically create a second search query;

    (g) repeating steps (a) to (e) to form a second ranked list, wherein the second ranked list includes a second group of selected small pieces of text, wherein the second group is missing at least one of the selected small pieces of text in the first group.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×