RECOMMENDING CONTENT USING DISCRIMINATIVELY TRAINED DOCUMENT SIMILARITY
First Claim
1. A method for training document similarity models, the method comprising:
- obtaining prior information of document relations and non-relations; and
discriminatively training an ensemble of document similarity classification models using the prior information of document relations and non-relations.
2 Assignments
0 Petitions
Accused Products
Abstract
A generalized discriminative training framework for reconciling the training and evaluation objectives for document similarity is provided. Prior information about document relations and non-relations, are used to discriminatively train an ensemble of document similarity classification models. This result is a model set that can be used to compute similarity between seen documents in the training sets and new documents. The measure of similarity forms the basis of recommending documents to a user as well as being able to obtain metadata information such as keywords and tags for new documents not having such information.
63 Citations
20 Claims
-
1. A method for training document similarity models, the method comprising:
-
obtaining prior information of document relations and non-relations; and discriminatively training an ensemble of document similarity classification models using the prior information of document relations and non-relations. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A document recommendation system comprising:
-
a plurality of candidate documents; and a module configured to receive a new document and calculate a similarity score of the new document relative to each of the candidate documents using a measure of discriminatively trained similarity associated with each candidate document. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for obtaining metadata related to a document, the system comprising:
-
a plurality of documents, each document having metadata associated therewith; and a module configured to receive a new document and determine metadata to be associated therewith based on a similarity score of the new document relative to each of the documents of the plurality of documents using a measure of similarity based on a weighting factor associated with each document of the plurality of documents. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification