Methodology for creating and maintaining a scheme for categorizing electronic communications
First Claim
1. A computer implemented method for categorizing incoming electronic communications using a supervised machine learning component, and for factoring an organization'"'"'s business domain into the technology domain to enable an acceptable automated response and routing scheme, said method comprising the steps of:
- (a) analyzing the business domain;
(b) determining an approach to machine learning using a program or an algorithm for inducing a categorizer using supervised learning, the categorizer being generated from training data comprising a set of examples of the type of electronic communications to be classified;
(c) collecting existing data of representative examples of electronic communications and inventories of personnel skills, business processes, workflows, and business missions;
(d) Analyzing the collected data for determining one or more attributes of said electronic communications selected from the group consisting of complexity, vagueness, and uniqueness to be expected in the type of communications to be categorized, as well as the relative numbers of electronic communications having a particular attribute of said one or more attributes, and for determining a technical structure of the communications relevant to categorization, and factoring the inventories of personnel skills, business processes, workflows, and business missions collected to determine what must be done with each electronic communication, and by whom;
(e) defining a categorization scheme;
(f) labeling examples of electronic communications with categories from the categorization scheme for use both as training data to be used in the supervised learning step and as test data;
(g) converting, using a computer, the labeled examples into a form suitable for subsequent processing, both for purposes of machine learning and technical validation;
(h) performing using said computer, machine based supervised learning technology to induce said categorizer for the categorization scheme; and
(i) validating the categorization scheme with respect to technical performance and business requirements.
1 Assignment
0 Petitions
Accused Products
Abstract
Supervised learning is used to develop a computer program that can automatically route or respond to electronic communications assuming the existence of an appropriate formal scheme for categorizing incoming electronic communications. A method is described by which such a categorization scheme for electronic communications can be constructed. The method is based on an analysis of factors having an impact on the categorization scheme from both the business domain and the technology domain. The problem solved by this method is a new one that is only now emerging as automated methods of routing communications based on supervised learning are becoming feasible. Among other uses, this method may be practically employed as a disciplined way of carrying out consulting engagements that call for setting up and maintaining categorization schemes for electronic communications.
48 Citations
9 Claims
-
1. A computer implemented method for categorizing incoming electronic communications using a supervised machine learning component, and for factoring an organization'"'"'s business domain into the technology domain to enable an acceptable automated response and routing scheme, said method comprising the steps of:
-
(a) analyzing the business domain;
(b) determining an approach to machine learning using a program or an algorithm for inducing a categorizer using supervised learning, the categorizer being generated from training data comprising a set of examples of the type of electronic communications to be classified;
(c) collecting existing data of representative examples of electronic communications and inventories of personnel skills, business processes, workflows, and business missions;
(d) Analyzing the collected data for determining one or more attributes of said electronic communications selected from the group consisting of complexity, vagueness, and uniqueness to be expected in the type of communications to be categorized, as well as the relative numbers of electronic communications having a particular attribute of said one or more attributes, and for determining a technical structure of the communications relevant to categorization, and factoring the inventories of personnel skills, business processes, workflows, and business missions collected to determine what must be done with each electronic communication, and by whom;
(e) defining a categorization scheme;
(f) labeling examples of electronic communications with categories from the categorization scheme for use both as training data to be used in the supervised learning step and as test data;
(g) converting, using a computer, the labeled examples into a form suitable for subsequent processing, both for purposes of machine learning and technical validation;
(h) performing using said computer, machine based supervised learning technology to induce said categorizer for the categorization scheme; and
(i) validating the categorization scheme with respect to technical performance and business requirements. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method for categorizing incoming electronic communications using a supervised machine learning component, and for factoring an organization'"'"'s business domain the technology domain to enable an acceptable automated response and routing scheme, said method comprising the steps of:
-
selecting a machine learning component for the technology domain;
preparing a set of training data comprising representations of previously categorized electronic communications, wherein the data in an electronic communication is textual and each electronic communication has features, where a feature is related to textual data;
analyzing the organization'"'"'s business domain with respect to desired routing and handling of contemplated message categories of electronic communications, the analysis resulting in identification of tasks to be performed and actions to be taken in response to a received electronic communication of a contemplated message category, the analysis also resulting in identification of features relevant to categorization of electronic communications;
determining skill levels of personnel corresponding to required tasks and actions identified in the step of analyzing the organization'"'"'s business domain;
extracting, using a computer, a new representation of each electronic communication in the training set depending on a frequency of occurrence in the electronic communication of features identified as relevant to the business domain;
inducing a pattern characterization when an electronic communication belongs to a category, wherein the patterns are presented as rules or another format correspond the selected machine learning component; and
developing, using said computer, an initial categorization scheme based on areas of the business domain receiving a greater quantity of electronic communications or electronic communications of a relatively higher priority. - View Dependent Claims (9)
-
Specification