×

Computer-based system for classifying documents into a hierarchy and linking the classifications to the hierarchy

  • US 5,794,236 A
  • Filed: 05/29/1996
  • Issued: 08/11/1998
  • Est. Priority Date: 05/29/1996
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer system for classifying electronic text according to multiple classifications arranged in a hierarchy, comprising:

  • a memory for storing and retrieving electronic text;

    identification means for identifying embedded citations contained in the electronic text;

    means for stripping embedded citations identified by said identification means and storing them in memory;

    matching means for comparing stripped citations to stored citations associated with at least one classification in the hierarchy, and for identifying stripped citations which match at least one stored citation;

    scoring means for assigning scores to the matching citations identified by said matching means, based on heuristic rules;

    calculating means for calculating a classification score for each classification associated with the stored citations which match the matching citations identified by said matching means, based on the scores assigned to the matching citations and the heuristic rules;

    comparison means for comparing each classification score with a threshold value;

    classification means for classifying the electronic text within the hierarchy based on the comparison of the classification score with the threshold value; and

    association means for associating the electronic text with stored classification identifying strings to produce a classified electronic text.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×