Systems, methods, and software for classifying text from judicial opinions and other documents
First Claim
1. An automated method of classifying input text to a target classification system having two or more target classes, the method comprising:
- for each target class;
providing at least first and second class-specific weights and a class-specific decision threshold;
using at least first and second classification methods to determine respective first and second scores based on the input text and the target class;
determining a composite score based on the first score scaled by the first class-specific weight for the target class and the second score scaled by the second class-specific weight for the target class; and
classifying or recommending classification of the input text to the target class based on the composite score and the class-specific decision threshold.
7 Assignments
0 Petitions
Accused Products
Abstract
To reduce cost and improve accuracy, the inventors devised systems, methods, and software to aid classification of text, such as headnotes and other documents, to target classes in a target classification system. For example, one system computes composite scores based on: similarity of input text to text assigned to each of the target classes; similarity of non-target classes assigned to the input text and target classes; probability of a target class given a set of one or more non-target classes assigned to the input text; and/or probability of the input text given text assigned to the target classes. The exemplary system then evaluates the composite scores using class-specific decision criteria, such as thresholds, ultimately assigning or recommending assignment of the input text to one or more of the target classes. The exemplary system is particularly suitable for classification systems having thousands of classes.
89 Citations
17 Claims
-
1. An automated method of classifying input text to a target classification system having two or more target classes, the method comprising:
for each target class;
providing at least first and second class-specific weights and a class-specific decision threshold;
using at least first and second classification methods to determine respective first and second scores based on the input text and the target class;
determining a composite score based on the first score scaled by the first class-specific weight for the target class and the second score scaled by the second class-specific weight for the target class; and
classifying or recommending classification of the input text to the target class based on the composite score and the class-specific decision threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A machine-readable medium comprising instructions for performing an automated method of classifying input text to a target classification system having two or more target classes, the medium including instructions that for each target class can cause:
-
providing at least first and second class-specific weights and a class-specific decision threshold;
using at least first and second classification methods to determine respective first and second scores based on the input text and the target class;
determining a composite score based on the first score scaled by the first class-specific weight for the target class and the second score scaled by the second class-specific weight for the target class; and
classifying or recommending classification of the input text to the target class based on the composite score and the class-specific decision threshold.
-
-
9. An automated method of classifying input text to a target classification system having two or more target classes, the method comprising:
- for each target class;
determining first and second scores based on the input text and the target class and respective first and second classification methods;
determining a composite score based on the first score scaled by a first class-specific weight for the target class and the second score scaled by a second class-specific weight for the target class; and
determining whether to identify the input text for classification to the target class based on the composite score and a class-specific decision threshold for the target class. - View Dependent Claims (10, 11, 12, 13, 14, 15)
- for each target class;
-
16. A machine-readable medium comprising instructions for performing an automated method of classifying input text to a target classification system having two or more target classes, the medium including instructions for:
-
determining for each target class first and second scores based on the input text and the target class and respective first and second classification methods;
determining for each target class a composite score based on the first score scaled by a first class-specific weight for the target class and the second score scaled by a second class-specific weight for the target class; and
determining for each target class whether to identify the input text for classification to the target class based on the composite score and a class-specific decision threshold for the target class.
-
-
17. A system for classifying input text to a target classification system having two or more target classes, the system comprising:
-
means for determining for each of the target classes at least first and second scores based on the input text and the target class and respective first and second classification methods;
means for determining for each of the target classes a corresponding composite score based on the first score scaled by a first class-specific weight for the target class and the second score scaled by a second class-specific weight for the target class; and
means for determining for each of the target classes whether to classify or recommend classification of the input text to the target class based on the corresponding composite score and a class-specific decision threshold for the target class.
-
Specification