×

Method for generating descriptors for the classification of texts

  • US 6,038,527 A
  • Filed: 03/14/1997
  • Issued: 03/14/2000
  • Est. Priority Date: 07/19/1995
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of generating descriptors for natural language texts, using a plurality of training texts having a plurality of words, comprising the steps of:

  • extracting words from a text during a training phase on the basis of the training texts;

    predetermining a minimum structure of said descriptors;

    breaking down words in the text into shorter word segments, wherein each shorter word segment within a longer word segment must meet said minimum structure for said breaking down to be permitted; and

    matching said word segments that remain in the text against each other to generate a list of descriptors.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×