×

Classification of data records by comparison of records to a training database using probability weights

  • US 5,251,131 A
  • Filed: 07/31/1991
  • Issued: 10/05/1993
  • Est. Priority Date: 07/31/1991
  • Status: Expired due to Term
First Claim
Patent Images

1. A system for classifying natural language data, comprising:

  • means for storing a new record including a plurality of predictor data fields containing the natural language data expressed in natural language values,means for storing a plurality of training records,each training record includinga plurality of predictor data fields, each predictor data field containing a feature, wherein each feature is a natural language term, anda target data field containing a target value representing a classification of a training record, andprobability weight means for storing, for each feature, a probability weight value representing a probability that a new record will have the target value contained in the target data field if a feature contained in a corresponding predictor data field occurs in the new record,query means for extracting features from the new record and querying the training records with each feature extracted from the new record,the query means being responsive to a match between a feature extracted from the new record and a feature stored in said training record for providing the probability weight corresponding to the feature, andmetric means for receiving the probability weights from the query means and accumulating for each training record a comparison score representing the probability that said training record matches the new record, andproviding an output indicating said target field value of said training record as said target value of the new record.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×