×

Method and apparatus for text classification

  • US 5,371,807 A
  • Filed: 03/20/1992
  • Issued: 12/06/1994
  • Est. Priority Date: 03/20/1992
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for classifying natural language text input into a computer system, the system includes memory having a domain specific knowledge base having a plurality of categories stored therein, the method comprising the steps of:

  • (a) accepting as input natural language input text;

    (b) parsing the natural language input text into a first list of recognized keywords;

    (c) using the first list to deduce further facts from the natural language input text;

    (d) compiling the deduced facts into a second list;

    (e) calculating a numeric similarity score for each one of the plurality of categories in the knowledge base to indicate how similar one of the plurality of categories is to the natural language input text;

    (f) applying a dynamic threshold to determine which ones of the plurality of categories are most similar to the recognized keywords of the first list, comprising the sub-steps of;

    (I) calculating a value for the dynamic threshold based upon a similarity score of a most similar category and a predefined threshold offset, and(II) classifying the categories based upon their respective similarity scores by discarding categories whose similarity scores are below the threshold value;

    (g) compiling the ones of the plurality of categories determined to be most similar in step (f) into a third list; and

    (i) passing the first list, the second list and the third list to an external application.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×