Example-based item classification
First Claim
1. A method of operating a computerized device according to an electronic mail application program to selectively classify items of a set of electronic mail messages and display classified electronic mail messages to a user, comprising:
- receiving a user selection of an example subset of the set of electronic mail messages;
performing an analysis on the example subset to find one or more shared text-based features that are shared across all items of the example subset, and based on the analysis generating a candidate classification rule identifying the shared text-based features, the analysis including multiple steps performed in sequence for respective distinct fields of the electronic mail messages, wherein in a first pass the analysis stops at a first step at which text content is found that is shared across all items of the example subset, and wherein the analysis for a next iteration continues past the first step to a second step at which text content is found that is shared across all items of the example subset;
applying the candidate classification rule to the set of electronic mail messages to identify a resultant subset of the electronic mail messages satisfying the candidate classification rule, the resultant subset being a superset of the example subset;
displaying the resultant subset to the user and receiving user input indicating, based on user review of the resultant subset, whether the candidate classification rule is accepted;
if the user input indicates that the candidate classification rule is accepted, then finalizing the candidate classification rule into a final classification rule, and otherwise repeating the above steps one or more times for respective adjusted example subsets of the set of electronic mail messages until a respective adjusted candidate classification rule is accepted and finalized into the final classification rule, the adjusted example subset for a given repetition being formed by the user identifying a false match and/or a false non-match with respect to the resultant subset identified in the preceding repetition; and
automatically applying the final classification rule in subsequent operation of the electronic mail application program to identify electronic mail messages satisfying the final classification rule and to display the identified electronic mail messages to the user in a manner reflecting their identification as satisfying the final classification rule.
14 Assignments
0 Petitions
Accused Products
Abstract
Item classification rules are created based on examples selected by a user, such as by selecting a subset of emails, and the rule is used across a larger set of items to obtain automatic classification of similar items according to the rule. Based on an analysis, a candidate classification rule is generated identifying text-based features shared among the items of the subset. The user can review the candidate rule as well as a resultant subset of items generated by the rule, and either accept the candidate rule or make an adjustment to the examples and then perform one or more iterations of the analysis to refine the rule. Adjustments can be made by removing items incorrectly included in a resultant subset and/or adding items incorrectly excluded from a resultant subset, and using the adjusted subset in a next iteration.
5 Citations
20 Claims
-
1. A method of operating a computerized device according to an electronic mail application program to selectively classify items of a set of electronic mail messages and display classified electronic mail messages to a user, comprising:
-
receiving a user selection of an example subset of the set of electronic mail messages; performing an analysis on the example subset to find one or more shared text-based features that are shared across all items of the example subset, and based on the analysis generating a candidate classification rule identifying the shared text-based features, the analysis including multiple steps performed in sequence for respective distinct fields of the electronic mail messages, wherein in a first pass the analysis stops at a first step at which text content is found that is shared across all items of the example subset, and wherein the analysis for a next iteration continues past the first step to a second step at which text content is found that is shared across all items of the example subset; applying the candidate classification rule to the set of electronic mail messages to identify a resultant subset of the electronic mail messages satisfying the candidate classification rule, the resultant subset being a superset of the example subset; displaying the resultant subset to the user and receiving user input indicating, based on user review of the resultant subset, whether the candidate classification rule is accepted; if the user input indicates that the candidate classification rule is accepted, then finalizing the candidate classification rule into a final classification rule, and otherwise repeating the above steps one or more times for respective adjusted example subsets of the set of electronic mail messages until a respective adjusted candidate classification rule is accepted and finalized into the final classification rule, the adjusted example subset for a given repetition being formed by the user identifying a false match and/or a false non-match with respect to the resultant subset identified in the preceding repetition; and automatically applying the final classification rule in subsequent operation of the electronic mail application program to identify electronic mail messages satisfying the final classification rule and to display the identified electronic mail messages to the user in a manner reflecting their identification as satisfying the final classification rule. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 19, 20)
-
-
11. A non-transitory computer-readable medium storing computer program instructions of an electronic mail application program, the instructions being executable by a computer to cause the computer to selectively classify items of a set of electronic mail messages and display classified electronic mail messages to a user, the method including:
-
receiving a user selection of an example subset of the set of electronic mail messages; performing an analysis on the example subset to find one or more shared text-based features that are shared across all items of the example subset, and based on the analysis generating a candidate classification rule identifying the shared text-based features, the analysis including multiple steps performed in sequence for respective distinct fields of the electronic mail messages, wherein in a first pass the analysis stops at a first step at which text content is found that is shared across all items of the example subset, and wherein the analysis for a next iteration continues past the first step to a second step at which text content is found that is shared across all items of the example subset; applying the candidate classification rule to the set of electronic mail messages to identify a resultant subset of the electronic mail messages satisfying the candidate classification rule, the resultant subset being a superset of the example subset; displaying the resultant subset to the user and receiving user input indicating, based on user review of the resultant subset, whether the candidate classification rule is accepted; if the user input indicates that the candidate classification rule is accepted, then finalizing the candidate classification rule into a final classification rule, and otherwise repeating the above steps one or more times for respective adjusted example subsets of the set of electronic mail messages until a respective adjusted candidate classification rule is accepted and finalized into the final classification rule, the adjusted example subset for a given repetition being formed by the user identifying a false match and/or a false non-match with respect to the resultant subset identified in the preceding repetition; and automatically applying the final classification rule in subsequent operation of the electronic mail application program to identify electronic mail messages satisfying the final classification rule and to display the identified electronic mail messages to the user in a manner reflecting their identification as satisfying the final classification rule. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
Specification