Scheme for filtering documents on network using relevant and non-relevant profiles
First Claim
1. A document filtering method using a computer for extracting documents matching with a relevant profile expressing a user'"'"'s request in terms of features of relevant documents that were correctly selected in the past by the computer from search target documents and presenting extracted documents to a user through the computer, comprising the steps of:
- producing a non-relevant profile expressing non-relevant documents in terms of features of documents that were judged in the past as having high similarities with respect to the relevant profile by the computer but not matching with the user'"'"'s request;
calculating a first similarity between the relevant profile and each search target document, and comparing the first similarity with a prescribed first threshold for the relevant profile;
calculating a second similarity between the non-relevant profile and each search target document for which the first similarity is higher than the prescribed first threshold, and comparing the second similarity with a prescribed second threshold for the non-relevant profile;
removing each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is higher than the prescribed second threshold as a non-relevant document, and selecting each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is not higher than the prescribed second threshold as a document to be presented to the user; and
updating the relevant profile and the non-relevant profile according to a feedback from the user regarding relevance of documents presented to the user.
1 Assignment
0 Petitions
Accused Products
Abstract
A document filtering scheme capable of reducing the number of erroneously selected non-relevant documents without reducing the number of correctly selected relevant documents is disclosed. a non-relevant profile expressing non-relevant documents that are judged as having high similarities with respect to a relevant profile expressing a user'"'"'s request but not matching with the user'"'"'s request is utilized in addition to the relevant profile, such that each search target document for which the similarity with respect to the relevant profile is higher than a prescribed threshold for the relevant profile and the similarity with respect to the non-relevant profile is higher than a prescribed threshold for the non-relevant profile is removed as a non-relevant document.
-
Citations
12 Claims
-
1. A document filtering method using a computer for extracting documents matching with a relevant profile expressing a user'"'"'s request in terms of features of relevant documents that were correctly selected in the past by the computer from search target documents and presenting extracted documents to a user through the computer, comprising the steps of:
-
producing a non-relevant profile expressing non-relevant documents in terms of features of documents that were judged in the past as having high similarities with respect to the relevant profile by the computer but not matching with the user'"'"'s request; calculating a first similarity between the relevant profile and each search target document, and comparing the first similarity with a prescribed first threshold for the relevant profile; calculating a second similarity between the non-relevant profile and each search target document for which the first similarity is higher than the prescribed first threshold, and comparing the second similarity with a prescribed second threshold for the non-relevant profile; removing each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is higher than the prescribed second threshold as a non-relevant document, and selecting each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is not higher than the prescribed second threshold as a document to be presented to the user; and updating the relevant profile and the non-relevant profile according to a feedback from the user regarding relevance of documents presented to the user. - View Dependent Claims (2, 3, 4)
-
-
5. A document filtering system using a computer for extracting documents matching with a relevant profile expressing a user'"'"'s request in terms of features of relevant documents that were correctly selected in the past by the computer from search target documents and presenting extracted documents to a user through the computer, comprising:
-
a non-relevant profile production unit configured to produce a non-relevant profile expressing non-relevant documents in terms of features of documents that were judged in the past as having high similarities with respect to the relevant profile by the computer but not matching with the user'"'"'s request; a relevant profile similarity calculation unit configured to calculate a first similarity between the relevant profile and each search target document, and compare the first similarity with a prescribed first threshold for the relevant profile; a non-relevant profile similarity calculation unit configured to calculate a second similarity between the non-relevant profile and each search target document for which the first similarity is higher than the prescribed first threshold, and compare the second similarity with a prescribed second threshold for the non-relevant profile; a relevance judgment unit configured to remove each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is higher than the prescribed second threshold as a non-relevant document, and select each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is not higher than the prescribed second threshold as a document to be presented to the user; and a profile updating unit configured to update the relevant profile and the non-relevant profile according to a feedback from the user regarding relevance of documents presented to the user. - View Dependent Claims (6, 7, 8)
-
-
9. A computer usable medium having computer readable program codes embodied therein for causing a computer to function as a document filtering system for extracting documents matching with a relevant profile expressing a user'"'"'s request in terms of features of relevant documents that were correctly selected in the past by the computer from search target documents and presenting extracted documents to a user through the computer, the computer readable program codes include:
-
a first computer readable program code for causing said computer to produce a non-relevant profile expressing non-relevant documents in terms of features of documents that were judged in the past as having high similarities with respect to the relevant profile by the computer but not matching with the user'"'"'s request; a second computer readable program code for causing said computer to calculate a first similarity between the relevant profile and each search target document, and compare the first similarity with a prescribed first threshold for the relevant profile; a third computer readable program code for causing said computer to calculate a second similarity between the non-relevant profile and each search target document for which the first similarity is higher than the prescribed first threshold, and compare the second similarity with a prescribed second threshold for the non-relevant profile; a fourth computer readable program code for causing said computer to remove each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is higher than the prescribed second threshold as a non-relevant document, and select each search target document for which the first similarity is higher than the prescribed first threshold and the second similarity is not higher than the prescribed second threshold as a document to be presented to the user; and a fifth computer readable program code for causing said computer to update the relevant profile and the non-relevant profile according to a feedback from the user regarding relevance of documents presented to the user. - View Dependent Claims (10, 11, 12)
-
Specification