Predicting interesting things and concepts in content
First Claim
1. A computer-implemented process performed by a computing device, comprising:
- receiving arbitrary content selected by a particular user and rendering that content to a display device;
automatically identifying and extracting a plurality of candidate items from the arbitrary content, the candidate items comprising any combination of one or more words, phrases, sentences, images, audio, hyperlinks, and topics in the arbitrary content;
predicting, for each of the plurality of the candidate items, a measure of interest with respect to the particular user, the measure of interest for each of the candidate items representing a statistical measure of a likelihood that the particular user will be interested in additional content related to the corresponding candidate item;
the predicted measure of interest provided by any combination of one or more of a machine-learned topic model and a machine-learned transition model,the machine-learned topic model generated by applying machine-learning to automatically evaluate content of samples of source and destination documents using a generative model that posits a distribution over joint latent transition topics for each possible transition between the source and destination topics,the machine-learned transition model generated by applying machine-learning to train on observed transitions between pairs of source content and destination content, and any combination of features extracted from those pairs and joint latent transition topics for each possible transition between the source and destination documents;
ranking the candidate items based on the predicted measure of interest for each candidate item;
selecting a predetermined number of the highest ranked candidate items;
associating additional content with the selected candidate items; and
modifying the arbitrary content by automatically arranging and displaying the associated additional content on the display device in conjunction with the arbitrary content, thereby enabling the user to interact with both the arbitrary content and the associated additional content.
2 Assignments
0 Petitions
Accused Products
Abstract
An “Engagement Predictor” provides various techniques for predicting whether things and concepts (i.e., “nuggets”) in content will be engaging or interesting to a user in arbitrary content being consumed by the user. More specifically, the Engagement Predictor provides a notion of interestingness, i.e., an interestingness score, of a nugget on a page that is grounded in observable behavior during content consumption. This interestingness score is determined by evaluating arbitrary documents using a learned transition model. Training of the transition model combines web browsing log data and latent semantic features in training data (i.e., source and destination documents) automatically derived by a Joint Topic Transition (JTT) Model. The interestingness scores are then used for highlighting one or more nuggets, inserting one or more hyperlinks relating to one or more nuggets, importing content relating to one or more nuggets, predicting user clicks, etc.
-
Citations
20 Claims
-
1. A computer-implemented process performed by a computing device, comprising:
-
receiving arbitrary content selected by a particular user and rendering that content to a display device; automatically identifying and extracting a plurality of candidate items from the arbitrary content, the candidate items comprising any combination of one or more words, phrases, sentences, images, audio, hyperlinks, and topics in the arbitrary content; predicting, for each of the plurality of the candidate items, a measure of interest with respect to the particular user, the measure of interest for each of the candidate items representing a statistical measure of a likelihood that the particular user will be interested in additional content related to the corresponding candidate item; the predicted measure of interest provided by any combination of one or more of a machine-learned topic model and a machine-learned transition model, the machine-learned topic model generated by applying machine-learning to automatically evaluate content of samples of source and destination documents using a generative model that posits a distribution over joint latent transition topics for each possible transition between the source and destination topics, the machine-learned transition model generated by applying machine-learning to train on observed transitions between pairs of source content and destination content, and any combination of features extracted from those pairs and joint latent transition topics for each possible transition between the source and destination documents; ranking the candidate items based on the predicted measure of interest for each candidate item; selecting a predetermined number of the highest ranked candidate items; associating additional content with the selected candidate items; and modifying the arbitrary content by automatically arranging and displaying the associated additional content on the display device in conjunction with the arbitrary content, thereby enabling the user to interact with both the arbitrary content and the associated additional content. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for augmenting user selected documents, comprising:
-
a general purpose computing device; and a computer program comprising program modules executable by the computing device, wherein the computing device is directed by the program modules of the computer program to; automatically extract a plurality of candidate items from a user-selected document, the candidate items comprising any combination of one or more words, phrases, sentences, images, audio, hyperlinks, and topics in the user-selected document; predict a separate user-specific interestingness score for each of the candidate items, the interestingness score for each of the candidate items representing a statistical measure of a likelihood that the user will be interested in additional content related to the corresponding candidate item; the predicted user-specific interestingness score provided by any combination of one or more of a machine-learned topic model and a machine-learned transition model, the machine-learned topic model generated by applying machine-learning to automatically evaluate content of samples of source and destination documents using a generative model that posits a distribution over joint latent transition topics for each possible transition between the source and destination topics, the machine-learned transition model generated by applying machine-learning to train on observed transitions between pairs of source content and destination content, and any combination of features extracted from those pairs and joint latent transition topics for each possible transition between the source and destination documents; sort the candidate items based on the corresponding interestingness scores; select a predetermined number of the top sorted candidate items; for each of the selected candidate items, associate the related additional content with the corresponding selected candidate items; and modify the user-selected document by automatically arranging and displaying the associated related additional content on a display device in conjunction with the user-selected document, thereby enabling the user to interact with both the selected document and the corresponding associated related additional content. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computer-readable storage device having computer executable instructions stored therein, said instructions causing a computing device to execute a method comprising:
-
rendering, on a display device, a user selected document; automatically extracting a plurality of candidate items from a user selected document, the candidate items comprising any combination of one or more words, phrases, sentences, images, audio, hyperlinks, and topics in the user selected document; predicting a measure of interest for each of a plurality of the candidate items with respect to the user, the measure of interest for each of the candidate items representing a statistical measure of a likelihood that the user will be interested in additional content related to the corresponding candidate item; the predicted measure of interest provided by any combination of one or more of a machine-learned topic model and a machine-learned transition model, the machine-learned topic model generated by applying machine-learning to automatically evaluate content of samples of source and destination documents using a generative model that posits a distribution over joint latent transition topics for each possible transition between the source and destination topics, the transition model generated by applying machine-learning to train on observed transitions between pairs of source content and destination content, and any combination of features extracted from those pairs and joint latent transition topics for each possible transition between the source and destination documents; ranking the candidate items based on the predicted measure of interest for each candidate item; responsive to the predicted measures of interest, selecting one or more of the highest ranked candidate items; associating additional content with the selected candidate items; and modifying the user-selected document by automatically arranging and displaying the associated additional related content on the display device in conjunction with the user-selected document. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification