System and method for filtering documents
First Claim
1. A method of separating a set of related documents, the method comprising:
- determining, on a document selection system, quality scores for a plurality of the documents in the set of related documents based on comparisons with a predetermined value;
obtaining a similarity score for a plurality of pairs of documents in the set of related document; and
on the document selection system, obtaining a first subset of related documents which solves an optimization problem, the first subset of related documents being a subset of the set of related documents, the optimization problem being a function of one or more quality scores of the documents assigned to the first subset of related documents and one or more similarity scores of pairs of documents assigned to the first subset of related documents, wherein the optimization problem maximizes an evaluation function and wherein the evaluation function is;
1 Assignment
0 Petitions
Accused Products
Abstract
A method and document separation system for separating a set of related documents is described. In one aspect, the method comprises: determining, on a document selection system, quality scores for a plurality of the documents in the set of related documents; obtaining a similarity score for a plurality of pairs of documents in the set of related document; and on a document selection system, obtaining a first subset of related documents which solves an optimization problem, the first subset of related documents including a portion of the document in the set of related documents, the optimization problem being a function of one or more quality scores of the documents assigned to the first subset of related documents and one or more similarity scores of pairs of documents assigned to the first subset of related documents.
-
Citations
30 Claims
-
1. A method of separating a set of related documents, the method comprising:
-
determining, on a document selection system, quality scores for a plurality of the documents in the set of related documents based on comparisons with a predetermined value; obtaining a similarity score for a plurality of pairs of documents in the set of related document; and on the document selection system, obtaining a first subset of related documents which solves an optimization problem, the first subset of related documents being a subset of the set of related documents, the optimization problem being a function of one or more quality scores of the documents assigned to the first subset of related documents and one or more similarity scores of pairs of documents assigned to the first subset of related documents, wherein the optimization problem maximizes an evaluation function and wherein the evaluation function is; - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A document separation system comprising:
-
a processor; and a memory coupled to the processor, the memory storing processor executable instructions which, when executed by the processor cause the processor to; determine quality scores for a plurality of the documents in the set of related documents based on comparisons with a predetermined value; obtain a similarity score for a plurality of pairs of documents in the set of related document; and obtain a first subset of related documents which solves an optimization problem, the first subset of related documents being a subset of the set of related documents, the optimization problem being a function of one or more quality scores of the documents assigned to the first subset of related documents and one or more similarity scores of pairs of documents assigned to the first subset of related documents, wherein the optimization problem maximizes an evaluation function and wherein the evaluation function is; - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification