Filter for modeling system and method for handling and routing of text-based asynchronous communications
DCFirst Claim
1. A system for filtering and modeling electronic text messages comprising:
- a message reception means for receiving an electronic text message into the system, said text message having a header and a body, said body containing a natural language text message from an author;
a feature extraction means for performing natural language analysis of the text message from the message reception means, said feature extraction means producing one or more output signals relating to any of keyword frequencies, word co-occurrence statistics, a dimensionally-reduced representation of the keyword frequencies, phoneme frequencies, structural pattern statistics for any of sentences, paragraphs and pages, estimated education level of the author, and customer type;
a clustering means receiving said output signals from said feature extraction means, said clustering means producing a set of assigned properties based upon the content of the body of the electronic message, said assigned properties including at least one of attitude, one or more issues presented, one or more requests, an author type, and an author'"'"'s education level; and
a learning process which receives said assigned properties and performs relevance ranking and query by example, and which is capable of learning changes to said assigned properties submitted via a user interface such that rules and thresholds used in said feature extraction means and/or clustering means are updated automatically in real time without operator intervention.
3 Assignments
Litigations
0 Petitions
Accused Products
Abstract
A three phase process and system is disclosed for automatically and adaptively filtering and classifying electronic text-based messages, such as e-mail, e-commerce transactions, CGI forms, and optically scanned and textualized written and facsimile messages. In the first phase of processing, the message is subjected to one or more feature extraction methodologies. The output signals from the first phase are then clustered in the second phase of processing using one or more clustering methodologies. The second phase yields a suggested five characteristics of the message: attitude, issue or problem, request, customer type, and author education level. In the third phase, a human operator interface presents the original message along with the proposed properties and allows an operator to correct or tune the properties, and corrections and tuning being fed back into the network of a feature extraction and clustering methodologies. Finally, the architecture of the system is such that feature extraction and clustering methodologies may be added, updated, or removed in a module fashion to allow the system to be customized to various applications and to allow the system to be modernized as new algorithms become available.
159 Citations
49 Claims
-
1. A system for filtering and modeling electronic text messages comprising:
-
a message reception means for receiving an electronic text message into the system, said text message having a header and a body, said body containing a natural language text message from an author;
a feature extraction means for performing natural language analysis of the text message from the message reception means, said feature extraction means producing one or more output signals relating to any of keyword frequencies, word co-occurrence statistics, a dimensionally-reduced representation of the keyword frequencies, phoneme frequencies, structural pattern statistics for any of sentences, paragraphs and pages, estimated education level of the author, and customer type;
a clustering means receiving said output signals from said feature extraction means, said clustering means producing a set of assigned properties based upon the content of the body of the electronic message, said assigned properties including at least one of attitude, one or more issues presented, one or more requests, an author type, and an author'"'"'s education level; and
a learning process which receives said assigned properties and performs relevance ranking and query by example, and which is capable of learning changes to said assigned properties submitted via a user interface such that rules and thresholds used in said feature extraction means and/or clustering means are updated automatically in real time without operator intervention. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
a tagged message reception means for receiving said tagged messages from said clustering means;
a network update means which is capable of modifying parameters, thresholds, and coefficients within said feature extraction means and within said clustering means; and
a user interface means for presenting the received electronic text message and said message tag set, receiving operator input modifying said message tag set, and providing network updates to the system via said network update means.
-
-
36. A process for filtering and modeling electronic text messages of asynchronous communications systems comprising the steps of:
-
receiving an electronic text-based message via a reception media, said text message having a header and a body, said body containing a natural language text message from an author;
performing feature extraction by performing natural language analysis of the text message to produce one or more output signals relating to any of keyword frequencies, word co-occurrence statistics, a dimensionally-reduced representation of the keyword frequencies, phoneme frequencies, structural pattern statistics for any of sentences, paragraphs, and pages, estimated education level of the author, and customer type;
performing clustering according to said feature extraction output signals to produce a set of assigned properties based upon the content of the body of the electronic message, said assigned properties including an attitude, one or more issues presented, on or more requests, an author type, and an author'"'"'s education level; and
performing a learning process by receiving said assigned properties, executing relevance ranking and query by example, and learning changes to said assigned properties submitted via a user interface such that rules and thresholds used in said feature extraction means and/or clustering means are updated automatically in real time without operator intervention. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48)
presentation of the electronic text message and the message tags to a user via a user interface;
receiving corrections to said message tags via said user interface from said user; and
automatically modifying logic within said determination of inherent factor within said text message.
-
-
49. A computer-readable medium containing a data structure for storing property tags for electronic text-based messages comprising.
an identifier link to a received electronic text-based message, an entry for an author'"'"'s apparent attitude; -
an entry for an issue raised by the message, an entry for a request made in the message;
an entry for a demographic profile indication for the author; and
an entry for an estimated education level of the author.
-
Specification