METHOD AND DEVICE OF HIERARCHICAL DOCUMENT FILTERING
First Claim
1. A method for filtering documents, comprising:
- selecting multiple documents from a to-be-filtered document set of a current document filtering layer according to a preset sampling strategy, and forming a first document list by using the selected documents according to an order of the selected documents in the to-be-filtered document set, wherein documents in the to-be-filtered document set are ordered according to quality values of the documents at an upper document filtering layer;
calculating a quality value of each document in the first document list on the current document filtering layer according to a relevance calculation method for the current document filtering layer;
reordering the documents in the first document list according to the quality value of each document in the first document list on the current document filtering layer, to obtain a second document list; and
filtering the to-be-filtered document set of the current document filtering layer according to the degree of consistency between the first document list and the second document list.
2 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure provides a method and a device of hierarchical document filtering. The method includes: selecting multiple documents from a to-be-filtered document set of a current document filtering layer according to a preset sampling strategy, and forming a first document list by using the selected documents according to an order of the selected documents in the to-be-filtered document set; calculating a quality value of each document in the first document list respectively according to a relevance calculation method for the current document filtering layer; reordering the documents in the first document list according to the quality value of each document in the first document list, to obtain a second document list; and filtering the to-be-filtered document set of the current document filtering layer according to the degree of consistency between the first document list and the second document list.
6 Citations
12 Claims
-
1. A method for filtering documents, comprising:
-
selecting multiple documents from a to-be-filtered document set of a current document filtering layer according to a preset sampling strategy, and forming a first document list by using the selected documents according to an order of the selected documents in the to-be-filtered document set, wherein documents in the to-be-filtered document set are ordered according to quality values of the documents at an upper document filtering layer; calculating a quality value of each document in the first document list on the current document filtering layer according to a relevance calculation method for the current document filtering layer; reordering the documents in the first document list according to the quality value of each document in the first document list on the current document filtering layer, to obtain a second document list; and filtering the to-be-filtered document set of the current document filtering layer according to the degree of consistency between the first document list and the second document list. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A device for filtering documents, comprising:
-
a selection module, configured to select multiple documents from a to-be-filtered document set of a current document filtering layer according to a preset sampling strategy, and form a first document list by using the selected documents according to an order of the selected documents in the to-be-filtered document set, wherein documents in the to-be-filtered document set are ordered according to quality values of the documents at an upper document filtering layer; a calculation module, configured to calculate a quality value of each document in the first document list respectively according to a relevance calculation method for the current document filtering layer; an ordering module, configured to reorder the documents in the first document list according to the quality value of each document in the first document list, to obtain a second document list; and a filtering module, configured to filter the to-be-filtered document set of the current document filtering layer according to the degree of consistency between the first document list and the second document list. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification