Smart selection of text spans
First Claim
1. A computer-implemented process for predicting a text span forming either a single word or a series of two or more words that a user intended to select, comprising:
- using a computer to perform the following process actions;
receiving a document comprising a string of characters;
receiving a location pointer indicating a particular location in the document;
inputting the document and the location pointer to a plurality of different candidate text span generation methods;
receiving a ranked list of one or more scored candidate text spans from each of the different candidate text span generation methods;
using a machine-learned ensemble model to re-score each of the scored candidate text spans received from each of the different candidate text span generation methods, the ensemble model being trained using a machine learning method and features from a dataset of true intended user text span selections; and
receiving a ranked list of re-scored candidate text spans from the ensemble model.
2 Assignments
0 Petitions
Accused Products
Abstract
A text span forming either a single word or a series of two or more words that a user intended to select is predicted. A document and a location pointer that indicates a particular location in the document are received and input to different candidate text span generation methods. A ranked list of one or more scored candidate text spans is received from each of the different candidate text span generation methods. A machine-learned ensemble model is used to re-score each of the scored candidate text spans that is received from each of the different candidate text span generation methods. The ensemble model is trained using a machine learning method and features from a dataset of true intended user text span selections. A ranked list of re-scored candidate text spans is received from the ensemble model.
201 Citations
20 Claims
-
1. A computer-implemented process for predicting a text span forming either a single word or a series of two or more words that a user intended to select, comprising:
-
using a computer to perform the following process actions; receiving a document comprising a string of characters; receiving a location pointer indicating a particular location in the document; inputting the document and the location pointer to a plurality of different candidate text span generation methods; receiving a ranked list of one or more scored candidate text spans from each of the different candidate text span generation methods; using a machine-learned ensemble model to re-score each of the scored candidate text spans received from each of the different candidate text span generation methods, the ensemble model being trained using a machine learning method and features from a dataset of true intended user text span selections; and receiving a ranked list of re-scored candidate text spans from the ensemble model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented process for predicting a text span forming either a single word or a series of two or more words that a user intended to select, comprising:
-
using a computer to perform the following process actions; receiving a document comprising a string of characters; receiving a location pointer indicating a particular location in the document; inputting the document and the location pointer to a machine-learned hyperlink intent model; and receiving a ranked list of scored candidate text spans from the hyperlink intent model. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A system for predicting a text span forming either a single word or a series of two or more words that a user intended to select, comprising:
-
a computing device comprising a display device; and a computer program having program modules executable by the computing device, the computing device being directed by the program modules of the computer program to, receive a document comprising text, receive a location pointer indicating a particular location in the document; input the document and the location pointer to a plurality of different candidate text span generation methods comprising one or more different linguistic unit detector methods and one or more different heuristic methods, receive a ranked list of one or more scored candidate text spans from each of the different candidate text span generation methods, use a machine-learned ensemble model to re-score each of the scored candidate text spans received from each of the different candidate text span generation methods, the ensemble model being trained using a machine learning method and features from a dataset of true intended user text span selections, said dataset being augmented with a testset of simulated user text span selections, receive a ranked list of re-scored candidate text spans from the ensemble model, identify the candidate text span in the ranked list of re-scored candidate text spans having the highest score, and display said identified candidate text span on the display device as a prediction of the text span that the user intended to select.
-
Specification