Document quality measurement
First Claim
1. A method for evaluating document quality, the method comprising:
- for a first plurality of documents, performing for each document of the first plurality of documents;
identifying, by a computer system, quality attributes associated with the each document of the first plurality of documents, the quality attributes including characterizations of usage of the each document of the first plurality of documents, relevance of text in the each document of the first plurality of documents to one or more topics, and quantity of text and media;
associating, by the computer system, a classifier from a plurality of classifiers with the each document of the first plurality of documents in accordance with relevance of content of the each document of the first plurality of documents to the classifier, by selecting the classifier from a taxonomy of concepts as being representative of the each document of the first plurality of documents by evaluating meanings of textual representations within the each document of the first plurality of documents; and
receiving, by the computer system, a ranking of the each document of the first plurality of documents;
training a plurality of class-specific models each corresponding to a classifier of the plurality of classifiers by, for each classifier of the plurality of classifiers, training, by the computer system, a class-specific model of the plurality of class-specific models corresponding to the each classifier of the plurality of classifiers according to the ranking of the first plurality of documents associated with the each classifier of the plurality of classifiers and both of the quality attributes associated with documents of the first plurality of documents associated with the each classifier of the plurality of classifiers and the content of the documents of the first plurality of documents associated with the each classifier of the plurality of classifiers; and
for a second plurality of documents, performing, by the computer system, for each document of the second plurality of documents;
identifying quality attributes associated with the each document of the second plurality of documents;
associating a classifier with the each document of the second plurality of documents in accordance with relevance of content of the each document of the second plurality of documents to the classifier;
inputting, by the computer system, the each document of the second plurality of documents to a selected class-specific model of the plurality of class-specific models corresponding to the classifier associated with the each document of the second plurality of documents;
ranking, by the computer system, each document of the second plurality of documents according to the selected class-specific model using as inputs both of the quality attributes and the content of the each document of the second plurality of documents.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed herein for ranking the quality of documents, such as documents shared or referenced in postings by users. For a first set of documents quality attributes that are indicative of quality or lack of quality are identified. Ratings of the quality of the first set of documents are received. Classifiers are associated with each document and the ratings and quality attributes for each attribute used to train class-specific models corresponding to the classifiers. Subsequently received documents are then classified and corresponding quality attributes are evaluated using the corresponding class-specific model in order to rank the quality of the document.
24 Citations
20 Claims
-
1. A method for evaluating document quality, the method comprising:
-
for a first plurality of documents, performing for each document of the first plurality of documents; identifying, by a computer system, quality attributes associated with the each document of the first plurality of documents, the quality attributes including characterizations of usage of the each document of the first plurality of documents, relevance of text in the each document of the first plurality of documents to one or more topics, and quantity of text and media; associating, by the computer system, a classifier from a plurality of classifiers with the each document of the first plurality of documents in accordance with relevance of content of the each document of the first plurality of documents to the classifier, by selecting the classifier from a taxonomy of concepts as being representative of the each document of the first plurality of documents by evaluating meanings of textual representations within the each document of the first plurality of documents; and receiving, by the computer system, a ranking of the each document of the first plurality of documents; training a plurality of class-specific models each corresponding to a classifier of the plurality of classifiers by, for each classifier of the plurality of classifiers, training, by the computer system, a class-specific model of the plurality of class-specific models corresponding to the each classifier of the plurality of classifiers according to the ranking of the first plurality of documents associated with the each classifier of the plurality of classifiers and both of the quality attributes associated with documents of the first plurality of documents associated with the each classifier of the plurality of classifiers and the content of the documents of the first plurality of documents associated with the each classifier of the plurality of classifiers; and for a second plurality of documents, performing, by the computer system, for each document of the second plurality of documents; identifying quality attributes associated with the each document of the second plurality of documents; associating a classifier with the each document of the second plurality of documents in accordance with relevance of content of the each document of the second plurality of documents to the classifier; inputting, by the computer system, the each document of the second plurality of documents to a selected class-specific model of the plurality of class-specific models corresponding to the classifier associated with the each document of the second plurality of documents; ranking, by the computer system, each document of the second plurality of documents according to the selected class-specific model using as inputs both of the quality attributes and the content of the each document of the second plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for rule generation, the system comprising one or more processors and one or more memory devices operably coupled to the one or more processors, the one or more memory devices storing executable and operational data effective to cause the one or more processors to:
-
for a first plurality of documents, perform for each document of the first plurality of documents; identifying quality attributes associated with the each document of the first plurality of documents, the quality attributes including characterizations of usage of the each document, relevance of text in the each document to one or more topics, and quantity of text and media; associating a classifier from a plurality of classifiers with the each document of the first plurality of documents in accordance with relevance of content of the each document of the first plurality of documents to the classifier, by selecting the classifier from a taxonomy of concepts as being representative of the each document by evaluating meanings of textual representations within the each document; and receiving a ranking of the each document of the first plurality of documents; training a plurality of class-specific models each corresponding to a classifier of the plurality of classifiers by, for each classifier of the plurality of classifiers, training each class-specific model of the plurality of class-specific models corresponding to the each classifier according to both of the quality attributes associated with the each document of the first plurality of documents associated with the each classifier and the content of the each documents of the first plurality of documents associated with the each classifier; and for a second plurality of documents, perform for each document in the second plurality of documents; identifying quality attributes associated with the each document of the second plurality of documents; associating a classifier with the each document of the second plurality of documents in accordance with relevance of content of the each document of the second plurality of documents to the classifier; inputting the each document of the second plurality of documents to a selected class-specific model of the plurality of class-specific models corresponding to the classifier associated with the each document; ranking the each document of the second plurality of documents according to the selected class-specific model using as inputs one or both of the quality attributes and the content of the each document of the second plurality of documents. - View Dependent Claims (17, 18, 19, 20)
-
Specification