Document search method
First Claim
1. A document search method executed by a computer for extracting from a document database document information similar to other document information which is acquired from a network, comprising the steps of:
- (a) formatting first document information acquired from the network into a format of the document database; and
(b) outputting second document information and similarity information, where the second document information exists in the document database and is similar to the formatted first document information, and the similarity information is obtained by correcting a degree of similarity between the formatted first document information and the second document information in accordance with a condition which is preset.
1 Assignment
0 Petitions
Accused Products
Abstract
A document search method for extracting document information similar in content to given document information, from a document database with high accuracy and efficiency. A first document database is searched based on a search query which is input by a user. First document information extracted by the search of the first document database is formatted into a format of a second document database. The second document database is searched by using the formatted first document information. Second document information which is similar in content to the formatted first document information is extracted. A degree of similarity between the formatted first document information and the second document information is calculated. The calculated degree of similarity is corrected in accordance with a condition of correction which is preset. The first and second document information and the corrected degree of similarity are output.
75 Citations
16 Claims
-
1. A document search method executed by a computer for extracting from a document database document information similar to other document information which is acquired from a network, comprising the steps of:
-
(a) formatting first document information acquired from the network into a format of the document database; and
(b) outputting second document information and similarity information, where the second document information exists in the document database and is similar to the formatted first document information, and the similarity information is obtained by correcting a degree of similarity between the formatted first document information and the second document information in accordance with a condition which is preset. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A document search method executed by a computer for extracting from a network document information similar to other document information which is extracted from a document database, comprising the steps of:
-
(a) searching said document database based on a search query which is input by a user, so as to extract first document information;
(b) formatting said first document information extracted in step (a) into a predetermined format; and
(c) outputting second document information and similarity information, where the second document information is extracted from said network and is similar to the formatted first document information, and the similarity information is obtained by correcting a degree of similarity between the formatted first document information and the second document information in accordance with a condition of correction which is preset. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A document search method executed by a computer for extracting from first and second document databases first document information and second document information which are similar in content, comprising the steps of:
-
(a) searching said first document database based on a search query which is input by a user, so as to extract said first document information;
(b) formatting said first document information extracted in step (a) into a format of said second document database; and
(c) outputting said second document information and similarity information, where the second document information is extracted from the second document database and is similar in content to the formatted first document information, and the similarity information is obtained by correcting a degree of similarity between the formatted first document information and the second document information in accordance with a condition which is preset.
-
-
13. A document search program which makes a computer perform document search processing for extracting from first and second document databases first document information and second document information which are similar in content, said document search processing comprising the steps of:
-
(a) searching said first document database based on a search query which is input by a user, so as to extract said first document information;
(b) formatting said first document information extracted in step (a) into a format of said second document database; and
(c) outputting said second document information and information on similarity between the formatted first document information and the second document information, where the second document information is extracted from the second document database and is similar in content to the formatted first document information. - View Dependent Claims (14)
-
-
15. A document search method executed by a computer for extracting document information similar in content from first and second document databases, comprising the steps of:
-
(a) preliminarily registering first document information of which a user is to be notified, in said first document database;
(b) searching for document information newly stored in said second document database, at regular time intervals, so as to extract second document information;
(c) formatting said second document information extracted in step (b) into a format of said first document database;
(d) searching said first document database by using the formatted second document information, outputting third document information which is similar in content to said formatted second document information, and calculating a degree of similarity between the formatted second document information and the third document information;
(e) correcting said degree of similarity in accordance with a condition which is preset; and
(f) sending said second document information extracted from said second document database and the corrected degree of similarity to said user when said third document information is said first document information, and the corrected degree of similarity is equal to or greater than a predetermined value.
-
-
16. A document search apparatus for extracting first document information and second document information similar in content from first and second document databases, comprising:
-
first document search means for searching said first document database based on a search query which is input by a user, so as to extract said first document information;
document formatting means for formatting said first document information extracted from said first document database, into a format of said second document database;
second document search means for searching said second document database by using the formatted first document information, outputting said second document information which is similar in content to the formatted first document information, and calculating a degree of similarity between the formatted first document information and the second document information;
correction means for correcting said degree of similarity in accordance with a condition which is preset; and
document output means for outputting said first and second document information and the corrected degree of similarity.
-
Specification