Document retrieval system
First Claim
Patent Images
1. A document retrieval system for retrieving one or more registered documents from a document database responsive to designated retrieval conditions including one or more designated keywords, said document retrieval system comprising:
- (a) inverted file generating means supplied with the documents and keywords for generating an inverted file which indicates a relation between the keywords and each of the documents;
(b) table generating means supplied with the keywords for generating a keyword connection table containing keyword connections which describe relations between two arbitrary keywords in said inverted file, said keyword connection table containing relation information, the relation information including at least (1) a relation name and (2) a value of a relationship of the keyword connections, the value of the relationship indicating an amplitude of the relations between the two arbitrary keywords, the amplitudes of the relations between the two arbitrary keywords being dependent on whether the two arbitrary keywords are (1) directly coupled by a direct keyword connection or (2) are coupled indirectly by an indirect keyword connection through one or more keywords other than said two arbitrary keywords, the values of the relationships contained in said keyword connection table dynamically changing depending on (1) the direct keyword connection each time a document is registered in a storage means and (2) the indirect keyword connection;
(c) said storage means, said storage means for storing said inverted file, said keyword connection table, the keywords and the documents as registered documents, in a form of the document database;
(d) designating means for designating retrieval conditions including one or more designated keywords, said retrieval conditions determining one or more of said registered documents to be retrieved from said storage means;
(e) selecting means for looking up said keyword connection table by use of one or more of designated keywords and referring to said inverted file so as to select said one or more registered documents which satisfy said retrieval conditions, said selecting means including;
(1) a request processor for processing requests including a document retrieval request from said designating means;
(2) a first calculation part coupled to said storage means for calculating a relevance of keywords which indicate the amplitude of a relation between (1) a group of designated keywords from said designating means and (2) each of the keywords registered in said storage means, in response to the document retrieval request processed by said request processor;
(3) a second calculation part coupled to said storage means for calculating a relevance of text which indicates the amplitude of a relation between (1) the group of designated keywords from said designating means and (2) each of the documents registered in said storage means, in response to the document retrieval request processed in said request processor, said second calculation part calculating the relevance of text by (1) obtaining a sum of the values of the keyword connections between (i) the designated keywords and (ii) the keywords registered in said storage means, and (2) normalizing said sum; and
(4) a display controller, coupled to said first and second calculation parts, for providing information including the registered documents from said storage means to an output means, depending on calculation results obtained in said first and second calculation parts; and
(f) said output means, said output means coupled to said selecting means for outputting said one or more registered documents selected by said selecting means.
1 Assignment
0 Petitions
Accused Products
Abstract
A document retrieval system employs a keyword connection table which contains relation information of keyword connections respectively coupling two arbitrary keywords which are used for retrieving registered documents. The relation information at least includes a relation name and a relationship describing the relation between the two arbitrary keywords. The relation information may dynamically change depending on a frequency of use of the keywords, that is, by a learning function.
-
Citations
9 Claims
-
1. A document retrieval system for retrieving one or more registered documents from a document database responsive to designated retrieval conditions including one or more designated keywords, said document retrieval system comprising:
-
(a) inverted file generating means supplied with the documents and keywords for generating an inverted file which indicates a relation between the keywords and each of the documents; (b) table generating means supplied with the keywords for generating a keyword connection table containing keyword connections which describe relations between two arbitrary keywords in said inverted file, said keyword connection table containing relation information, the relation information including at least (1) a relation name and (2) a value of a relationship of the keyword connections, the value of the relationship indicating an amplitude of the relations between the two arbitrary keywords, the amplitudes of the relations between the two arbitrary keywords being dependent on whether the two arbitrary keywords are (1) directly coupled by a direct keyword connection or (2) are coupled indirectly by an indirect keyword connection through one or more keywords other than said two arbitrary keywords, the values of the relationships contained in said keyword connection table dynamically changing depending on (1) the direct keyword connection each time a document is registered in a storage means and (2) the indirect keyword connection; (c) said storage means, said storage means for storing said inverted file, said keyword connection table, the keywords and the documents as registered documents, in a form of the document database; (d) designating means for designating retrieval conditions including one or more designated keywords, said retrieval conditions determining one or more of said registered documents to be retrieved from said storage means; (e) selecting means for looking up said keyword connection table by use of one or more of designated keywords and referring to said inverted file so as to select said one or more registered documents which satisfy said retrieval conditions, said selecting means including; (1) a request processor for processing requests including a document retrieval request from said designating means; (2) a first calculation part coupled to said storage means for calculating a relevance of keywords which indicate the amplitude of a relation between (1) a group of designated keywords from said designating means and (2) each of the keywords registered in said storage means, in response to the document retrieval request processed by said request processor; (3) a second calculation part coupled to said storage means for calculating a relevance of text which indicates the amplitude of a relation between (1) the group of designated keywords from said designating means and (2) each of the documents registered in said storage means, in response to the document retrieval request processed in said request processor, said second calculation part calculating the relevance of text by (1) obtaining a sum of the values of the keyword connections between (i) the designated keywords and (ii) the keywords registered in said storage means, and (2) normalizing said sum; and (4) a display controller, coupled to said first and second calculation parts, for providing information including the registered documents from said storage means to an output means, depending on calculation results obtained in said first and second calculation parts; and (f) said output means, said output means coupled to said selecting means for outputting said one or more registered documents selected by said selecting means. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A document retrieval system for retrieving one or more registered documents from a document database responsive to designated retrieval conditions including one or more designated keywords, said document retrieval system comprising:
-
(a) inverted file generating means supplied with the documents and keywords for generating an inverted file which indicates a relation between the keywords and each of the documents; (b) table generating means supplied with the keywords for generating a keyword connection table containing keyword connections which describe relations between two arbitrary keywords in said inverted file, said keyword connection table containing relation information, the relation information including at least (1) a relation name and (2) a value of a relationship of the keyword connections, the value of the relationship indicating an amplitude of the relations between the two arbitrary keywords, the amplitudes of the relations between the two arbitrary keywords being dependent on whether the two arbitrary keywords are (1) directly coupled by a direct keyword connection or (2) are coupled indirectly by an indirect keyword connection through one or more keywords other than said two arbitrary keywords, the values of the relationships contained in said keyword connection table dynamically changing depending on (1) the direct keyword connection each time a document is registered in a storage means and (2) the indirect keyword connection; (c) said storage means, said storage means for storing said inverted file, said keyword connection table, the keywords and the documents as registered documents, in a form of the document database; (d) designating means for designating retrieval conditions including one or more designated keywords, said retrieval conditions determining one or more of said registered documents to be retrieved from said storage means; (e) selecting means for looking up said keyword connection table by use of one or more of designated keywords and referring to said inverted file so as to select said one or more registered documents which satisfy said retrieval conditions, said selecting means including; (1) a request processor for processing request including a document retrieval request from said designating means; (2) a first calculation part coupled to said storage means for calculating a relevance of keywords which indicate the amplitude of a relation between (1) a group of designated keywords from said designating means and (2) each of the keywords registered in said storage means, in response to the document retrieval request processed by said request processor; (3) a second calculation part coupled to said storage means for calculating a relevance of text which indicates the amplitude of a relation between (1) the group of designated keywords from said designating means and (2) each of the documents registered in said storage means, in response to the document retrieval request processed in said request processor, said second calculation part calculating the relevance of text by (1) obtaining a sum of the values of the keyword connections between (i) the designated keywords and (ii) the keywords registered in said storage means, and (2l ) normalizing said sum; and (4) a display controller, coupled to said first and second calculation parts, for providing information including the registered documents from said storage means to an output means, depending on calculation results obtained in said first and second calculation parts; and (f) said output means, said output means coupled to said selecting means for outputting said one or more registered documents selected by said selecting means; wherein; (A) the amplitude of the relation between the two arbitrary keywords is dependent on whether; (1) the two arbitrary keywords are directly coupled by a direct keyword connection or (2) coupled indirectly by an indirect keyword connection through one or a plurality of keywords other than said two arbitrary keywords; (B) the value of a relationship KREL [Kp, Kq ] of a keyword connection (p, q) between two arbitrary keywords Kp and Kq is defined as
space="preserve" listing-type="equation">K.sub.REL [K.sub.p, K.sub.q ]=f[S.sub.ij, N.sub.ij, p, q]by use of a relation name Nij and a relationship value Sij of a keyword connection (i, j)ε
UCALL between keywords Ki and Kj existing in a keyword layer;
wherein;(1) UCALL denotes a set of keyword connections between the keywords Ki and Kj in the keyword layer; and (2) Kp, Kq, Ki and Kj respectively denote keywords p, q, i and j; (C) the function f[Sij, Nij, p, q] is equal to Sij xAMP [Nij ] when a direct keyword connection exists between the two keywords Kp and Kq, where AMP [Nij ] denotes a weighting coefficient which varies depending on the relation name; and (D) the function f[Sij, Nij, p, q] is equal to ##EQU5## when no direct keyword connection exists between the two keywords Kp and Kq but one or more indirect keyword connections exist between the two keywords Kp and Kq, wherein; (1) C1 through Cn denote numbers of indirect keyword connections; (2) UCk denotes a set of direct keyword connections constituting indirect keyword connections Ck ; (3) Σ
denotes a sum total of the direct keyword connections (i, j) included in UCk ; and(4) MIN denotes a minimum value of the relationship values of the n indirect keyword connections C1 through Cn. - View Dependent Claims (8, 9)
-
Specification