×

Method and apparatus for establishing topic word classes based on an entropy cost function to retrieve documents represented by the topic words

  • US 6,128,613 A
  • Filed: 04/29/1998
  • Issued: 10/03/2000
  • Est. Priority Date: 06/26/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. In a computer, a method for establishing topic words to represent a document wherein the topic words are suitable for inclusion in a computer database index structure, the method comprising the steps of:

  • accepting at least a portion of the document from a data input device, wherein the portion of the document includes words;

    determining a plurality of document keywords from the portion of the document;

    classifying each of the document keywords into one of a plurality of preestablished keyword classes; and

    selecting words as the topic words, each said selected word from a different one of the preestablished keyword classes, to minimize an entropy-based cost function on proposed topic words.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×