Determining document classification probabilistically through classification rule analysis
First Claim
Patent Images
1. A method executed on a computing device for determining document classification probabilistically through classification rule analysis, the method comprising:
- identifying patterns and evidences within content of representative documents;
constructing a classification rule based on an entity determined according to an analysis of the patterns and an affinity determined according to an analysis of the evidences;
processing the content with the classification rule to;
determine an entity count and an entity confidence level for the entity;
determine an affinity presence and an affinity confidence level for the affinity;
aggregate the entity count, the entity confidence level, the affinity presence, andthe affinity confidence level to returned results;
comparing the returned results to expected results to evaluate the classification rule against acceptance requirements; and
if the classification rule meets the acceptance requirements, identifying confidence levels for the patterns and the evidences;
elseediting the classification rule.
2 Assignments
0 Petitions
Accused Products
Abstract
A classification application identifies patterns and evidences within representative documents. The application constructs a classification rule according to an entity and an affinity determined from the patterns and evidences. The application processes the representative documents with the classification rule to evaluate whether the rules meet acceptance requirements. Subsequent to a successful evaluation, the application identifies confidence levels for patterns and evidences within other documents.
-
Citations
18 Claims
-
1. A method executed on a computing device for determining document classification probabilistically through classification rule analysis, the method comprising:
-
identifying patterns and evidences within content of representative documents; constructing a classification rule based on an entity determined according to an analysis of the patterns and an affinity determined according to an analysis of the evidences; processing the content with the classification rule to; determine an entity count and an entity confidence level for the entity; determine an affinity presence and an affinity confidence level for the affinity; aggregate the entity count, the entity confidence level, the affinity presence, and the affinity confidence level to returned results; comparing the returned results to expected results to evaluate the classification rule against acceptance requirements; and if the classification rule meets the acceptance requirements, identifying confidence levels for the patterns and the evidences;
elseediting the classification rule. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing device for determining document classification probabilistically through classification rule analysis, the computing device comprising:
-
a memory configured to store instructions; and a processor coupled to the memory, the processor executing an application in conjunction with the instructions stored in the memory, wherein the application is configured to; identify patterns and evidences within content of representative documents; construct a classification rule based on an entity determined according to an analysis of the patterns and an affinity determined according to an analysis of the evidences; process the content with the classification rule to; determine an entity count and an entity confidence level for the entity; determine an affinity presence and an affinity confidence level for the affinity; aggregate the entity count, the entity confidence level, the affinity presence, and the affinity confidence level to returned results; compare the returned results to expected results to evaluate the classification rule against acceptance requirements; and if the classification rule meets the acceptance requirements, identify confidence levels for the patterns and the evidences;
elseedit the classification rule. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A method executed on a computing device to determine document classification probabilistically through classification rule analysis, the method comprising:
-
identifying patterns and evidences within content of representative documents; constructing a classification rule based on an entity determined according to an analysis of the patterns and an affinity determined according to an analysis of the evidences; processing the content with the classification rule to collect returned results by; determining an entity count and an entity confidence level for the entity; determining an affinity presence and an affinity confidence level for the affinity; aggregating the entity count, the entity confidence level, the affinity presence, and the affinity confidence level to returned results; comparing the returned results to expected results to evaluate the classification rule against acceptance requirements by; accepting the entity subsequent to determining an entity environment value equal or greater than a predetermined entity precision value and an entity recall value equal or greater than a predetermined entity recall value; accepting the affinity subsequent to determining an affinity environment value equal or greater than a predetermined affinity precision value and an affinity recall value equal or greater than a predetermined affinity recall value; if the classification rule meets the acceptance requirements, identifying confidence levels for the patterns and the evidences;
elseediting the classification rule. - View Dependent Claims (16, 17, 18)
-
Specification