Custom language models
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving, at a computing device having one or more processors, a collection of documents;
clustering, at the computing device, the documents into one or more clusters;
generating, at the computing device, a cluster vector for each cluster of the one or more clusters;
generating, at the computing device, a target vector associated with a target profile, the target profile being associated with an identified user;
storing, at the computing device, the target profile;
comparing, at the computing device, the target vector with each of the cluster vectors;
selecting, at the computing device, one or more of the one or more clusters based on the comparison of the target vector with each of the cluster vectors; and
generating, at the computing device, a custom language model for the identified user using documents from the one or more selected clusters.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and apparatuses including computer program products for generating a custom language model. In one implementation, a method is provided. The method includes receiving a collection of documents; clustering the documents into one or more clusters; generating a cluster vector for each cluster of the one or more clusters; generating a target vector associated with a target profile; comparing the target vector with each of the cluster vectors; selecting one or more of the one or more clusters based on the comparison; and generating a language model using documents from the one or more selected clusters.
20 Citations
14 Claims
-
1. A computer-implemented method comprising:
-
receiving, at a computing device having one or more processors, a collection of documents; clustering, at the computing device, the documents into one or more clusters; generating, at the computing device, a cluster vector for each cluster of the one or more clusters; generating, at the computing device, a target vector associated with a target profile, the target profile being associated with an identified user; storing, at the computing device, the target profile; comparing, at the computing device, the target vector with each of the cluster vectors; selecting, at the computing device, one or more of the one or more clusters based on the comparison of the target vector with each of the cluster vectors; and generating, at the computing device, a custom language model for the identified user using documents from the one or more selected clusters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented method comprising:
-
receiving, at a computing device having one or more processors, a user input identifying a user; identifying, at the computing device, a user profile corresponding to the user; using, at the computing device, the identified user profile to generate a user specific language model, including; receiving a collection of documents associated with the user, clustering the documents into one or more clusters, generating a cluster vector for each cluster of the one or more clusters, generating a target vector associated with the identified user profile, comparing the target vector with each of the cluster vectors, selecting one or more of the one or more clusters based on the comparison between the target vector and each of the cluster vectors, and generating the user specific language model for the user using documents from the one or more selected clusters; and sending, by the computing device, the user specific language model to a first client associated with the user. - View Dependent Claims (11, 12)
-
-
13. A computer program product, encoded on a non-transitory program carrier, operable to cause data processing apparatus to perform operations comprising:
-
receiving a collection of documents; clustering the documents into one or more clusters; generating a cluster vector for each cluster of the one or more clusters; generating a target vector associated with a target profile, the target profile being associated with an identified user; comparing the target vector with each of the cluster vectors; selecting one or more of the one or more clusters based on the comparison between the target vector and each of the cluster vectors; and generating a custom language model for the identified user using documents from the one or more selected clusters.
-
-
14. A system, comprising:
-
a non-transitory machine-readable storage device including a program product; and one or more computers operable to execute the program product and perform operations comprising; receiving a collection of documents; clustering the documents into one or more clusters; generating a cluster vector for each cluster of the one or more clusters; generating a target vector associated with a target profile, the target profile being associated with an identified user; comparing the target vector with each of the cluster vectors; selecting one or more of the one or more clusters based on the comparison between the target vector and each of the cluster vectors; and generating a custom language model for the identified user using documents from the one or more selected clusters.
-
Specification