Method and apparatus for extracting journey of life attributes of a user from user interactions
First Claim
Patent Images
1. A computer implemented method for performing analysis on data generated by user interactions, comprising:
- providing a processor executing instructions for receiving text information from at least one interaction between a user and an agent;
said processor text mining said interaction information to extract personal information relating to said user automatically;
said processor using a topic model to extract lines from said text information to reduce a number of dimensions required to represent the text, wherein all information of interest is highly pronounced, and wherein a resulting lower dimensional representation of the text allows significantly faster computations;
said processor extracting said lines of text as anchored text lines that are indicative of personal information that is present in the anchored text lines;
said processor identifying said anchored text lines by checking for specific keywords which are present when said user is mentioning personal information during said interaction;
said processor applying a statistical technique to said anchored text lines to discover information present in the anchored text lines;
said processor using a k nearest neighbor algorithm to discover said information present in the anchored text lines;
said processor representing said text in a topic space with a score along each axis to indicate an extent to which said text contains personal information about said user;
upon receiving a new data point, which optionally comprises an anchored text line from said text, said processor picking the k closest points to said new data point, determining a predominant class among classes in the k closest points, and assigning said predominant class to said new data point;
said processor assigning a user corresponding to said text to at least one class based on said identified personal information; and
said processor classifying said user based upon said extracted lines of text.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the invention relate to managing user interactions and, more particularly, to performing analysis on data generated by user interactions. Embodiments of the invention use text mining to extract personal information of users from user interactions automatically. A topic model is used to reduce the number of dimensions required to represent the text, yet all the information of interest is highly pronounced. This enables a lower dimensional representation of the data leading to significantly faster computations.
45 Citations
11 Claims
-
1. A computer implemented method for performing analysis on data generated by user interactions, comprising:
-
providing a processor executing instructions for receiving text information from at least one interaction between a user and an agent; said processor text mining said interaction information to extract personal information relating to said user automatically; said processor using a topic model to extract lines from said text information to reduce a number of dimensions required to represent the text, wherein all information of interest is highly pronounced, and wherein a resulting lower dimensional representation of the text allows significantly faster computations; said processor extracting said lines of text as anchored text lines that are indicative of personal information that is present in the anchored text lines; said processor identifying said anchored text lines by checking for specific keywords which are present when said user is mentioning personal information during said interaction; said processor applying a statistical technique to said anchored text lines to discover information present in the anchored text lines; said processor using a k nearest neighbor algorithm to discover said information present in the anchored text lines; said processor representing said text in a topic space with a score along each axis to indicate an extent to which said text contains personal information about said user; upon receiving a new data point, which optionally comprises an anchored text line from said text, said processor picking the k closest points to said new data point, determining a predominant class among classes in the k closest points, and assigning said predominant class to said new data point; said processor assigning a user corresponding to said text to at least one class based on said identified personal information; and said processor classifying said user based upon said extracted lines of text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus for performing analysis on data generated by user interactions, comprising:
-
a processor executing instructions for receiving text information from at least one interaction between a user and an agent; said processor text mining said interaction information to extract personal information relating to said user automatically; said processor using a topic model to extract lines from said text information to reduce a number of dimensions required to represent the text, wherein all information of interest is highly pronounced, and wherein a resulting lower dimensional representation of the text allows significantly faster computations; said processor extracting said lines of text as anchored text lines that are indicative of personal information that is present in the anchored text lines; said processor identifying said anchored text lines by checking for specific keywords which are present when said user is mentioning personal information during said interaction; said processor applying a statistical technique said anchored text lines to discover information present in the anchored text lines; said processor using a k nearest neighbor algorithm to discover said information present in the anchored text lines; said processor representing said text in a topic space with a score along each axis to indicate an extent to which said text contains personal information about said user; upon receiving a new data point, which optionally comprises an anchored text line from said text, said processor picking the k closest points to said new data point, determining a predominant class among classes in the k closest points, and assigning said predominant class to said new data point; said processor assigning a user corresponding to said text to at least one class based on said identified personal information; and said processor classifying said user based upon said extracted lines of text.
-
Specification