Data mining electronic communications
First Claim
Patent Images
1. A method for providing electronic communication data mining, the method comprising:
- collecting a plurality of communications associated with a user, wherein the plurality of communications comprise a plurality of data types comprising at least two of the following;
an e-mail, an instant messages (IM), a short message service (SMS) message, a voicemail, a search query, a public posting, a document authored by the user, a directory of a plurality of users, a contact list, a calendar appointment, and a task entry;
delivering the plurality of collected communications to at least one data analyzer module, the at least one data analyzer module comprising a plurality of weighted workflows, each of the plurality of weighted workflows comprising a different level of processing for generating insights and updating insights based on at least one of batch data, collected data and generated data, the plurality of weighted workflows comprising at least a medium weight workflow for updating existing insights based on the at least one of the batch data and a lightweight workflow for at least one of generating and updating the insights in real-time, the plurality of workflows being executed on a plurality of machines in a high availability cluster, the plurality of machines including at least one active node utilized by the lightweight workflow and one passive node utilized by the medium weight workflow, the medium weight workflow utilizing a resource intensive batch analysis that is executed using spare cycles of the passive node;
deriving, utilizing one or more of the plurality of weighted workflows, an insight about the user from the collected plurality of communications, the insight about the user being based on at least one of the following;
a processor type used to derive the insight and a communication type for at least one of the collected communications from which the insight is derived; and
modifying at least one application functionality based on the derived insight.
2 Assignments
0 Petitions
Accused Products
Abstract
User insights derived from communication data may be provided. Communications associated with a user may be collected and analyzed to derive insights about the user. The insight may then be provided to the user, such as for modifying an application functionality, creating a message processing rule, providing new information to the user, and updating an application display or user interface.
-
Citations
19 Claims
-
1. A method for providing electronic communication data mining, the method comprising:
-
collecting a plurality of communications associated with a user, wherein the plurality of communications comprise a plurality of data types comprising at least two of the following;
an e-mail, an instant messages (IM), a short message service (SMS) message, a voicemail, a search query, a public posting, a document authored by the user, a directory of a plurality of users, a contact list, a calendar appointment, and a task entry;delivering the plurality of collected communications to at least one data analyzer module, the at least one data analyzer module comprising a plurality of weighted workflows, each of the plurality of weighted workflows comprising a different level of processing for generating insights and updating insights based on at least one of batch data, collected data and generated data, the plurality of weighted workflows comprising at least a medium weight workflow for updating existing insights based on the at least one of the batch data and a lightweight workflow for at least one of generating and updating the insights in real-time, the plurality of workflows being executed on a plurality of machines in a high availability cluster, the plurality of machines including at least one active node utilized by the lightweight workflow and one passive node utilized by the medium weight workflow, the medium weight workflow utilizing a resource intensive batch analysis that is executed using spare cycles of the passive node; deriving, utilizing one or more of the plurality of weighted workflows, an insight about the user from the collected plurality of communications, the insight about the user being based on at least one of the following;
a processor type used to derive the insight and a communication type for at least one of the collected communications from which the insight is derived; andmodifying at least one application functionality based on the derived insight. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A tangible computer-readable device which stores a set of instructions which when executed performs a method for providing communication data mining, the method executed by the set of instructions comprising:
-
analyzing a plurality of communications associated with a user utilizing at least one data analyzer module, the at least one data analyzer module comprising a plurality of weighted workflows, each of the plurality of weighted workflows comprising a different level of processing for generating insights and updating insights based on at least one of batch data, collected data and generated data, the plurality of weighted workflows comprising at least a medium weight workflow for updating existing insights based on the at least one of the batch data and a lightweight workflow for at least one of generating and updating the insights in real-time, the plurality of workflows being executed on a plurality of machines in a high availability cluster, the plurality of machines including at least one active node utilized by the lightweight workflow and one passive node utilized by the medium weight workflow, the medium weight workflow utilizing a resource intensive batch analysis that is executed using spare cycles of the passive node, wherein the plurality of communications comprise a plurality of data types comprising at least two of the following;
an e-mail, an instant messages (IM), a short message service (SMS) message, a voicemail, a search query, a public posting, a document authored by the user, a directory of a plurality of users, a contact list, a calendar appointment, and a task entry;deriving an insight based on the analyzed plurality of communications; and providing the insight to the user when the insight based on at least one of the following;
a processor type used to derive the insight and a communication type for at least one of the collected communications from which the insight is derived, wherein providing the insight to the user comprises at least one of the following;
creating a message processing rule, providing at least one new piece of information to the user, updating an application display, and adding at least one functionality to an application. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A system for providing user insights through data mining, the system comprising:
-
a memory storage; and a processing unit coupled to the memory storage, wherein the processing unit is operative to; collect a plurality of data elements from a plurality of sources associated with a user, wherein the plurality of sources comprises at least two of the following;
a document, an e-mail, an SMS message, a voicemail, a search query, a web submission, a forum posting, a blog posting, a calendar item, a to-do list entry, an instant message, and a directory;cleanse at least one data element of the plurality of data elements, wherein being operative to cleanse the at least one data element comprises at least one of the following;
being operative to perform a spell check on the at least one data element, validate the at least one data element, normalize the at least one data element, correct the at least one data element, and remove at least one extraneous character from the at least one data element;extract at least one piece of key information from the at least one data element, wherein the at least one piece of key information comprises at least one of the following;
a sender, a recipient, a time, a subject, a contact method, a related communication, and a storage location;analyze the at least one piece of key information utilizing at least one data analyzer module comprising a plurality of weighted workflows, each of the plurality of weighted workflows comprising a different level of processing for generating insights and updating insights based on at least one of batch data, collected data and generated data, the plurality of weighted workflows comprising at least a medium weight workflow for updating existing insights based on the at least one of the batch data and a lightweight workflow for at least one of generating and updating the insights in real-time, the plurality of workflows being executed on a plurality of machines in a high availability cluster, the plurality of machines including at least one active node utilized by the lightweight workflow and one passive node utilized by the medium weight workflow, the medium weight workflow utilizing a resource intensive batch analysis that is executed using spare cycles of the passive node, the analyzed at least one piece of key information being utilized to derive an insight associated with the user, wherein the derived insight comprises at least one of the following;
a behavior, a pattern, a relationship, an interest, a preference, a demographic factor, and an expertise;assign a confidence probability to the insight; determine whether the assigned confidence probability of the insight meets a threshold level to provide the insight to the user, the threshold level to provide the insight being based on at least one of the following;
a processor type used to derive the insight and a communication type for at least one of the collected communications from which the insight is derived;in response to determining that the assigned confidence probability of the insight is high enough to provide the insight to the user, provide the insight to the user wherein being operative to provide the insight to the user comprises at least one of the following;
creating a message processing rule, providing at least one new piece of information to the user, updating an application display, and adding at least one functionality to an application;store the insight in a location accessible by the user from a plurality of clients; and receive at least one modification to the insight from the user, wherein the at least one modification comprises at least one of the following;
a feedback rating, a disabling of a use of at least one of the plurality of data elements, an enabling of a use of at least one of the plurality of data elements, a disabling of the insight, an enabling of the insight, a weighting of at least one of the plurality of data elements, and a modification of the confidence probability.
-
Specification