Hierarchical presearch type text search method and apparatus and magnetic disk unit used in the apparatus
First Claim
1. A document information search system for searching document information, the system comprising:
- a plurality of document information search apparatuses connected to a communication network, each of the document information search apparatuses operating in parallel to each other in accordance with search conditions broadcast from a plurality of search terminals connected to said communication network, said plurality of search terminals receiving and displaying search results transmitted from said plurality of document information search apparatuses, each of the plurality of document information search apparatuses including;
a text data storage means for storing a document text data group;
a search expression input means for inputting a search conditional expression in which a complex condition is designated including key words for searching said document text data group, and positional and logical relationships among said key words;
a search expression analysis means for analyzing the inputted search conditional expression to extract a search subject key word and a complex condition descriptive portion;
a synonym development means for generating synonyms of said search subject key word based on said search subject key word outputted from said search expression analysis means;
a different notation development means for generating different notation words of said search subject key word, and said synonyms;
a complex condition analysis means for analyzing said complex condition descriptive portion outputted from said search expression analysis means and developing said complex condition descriptive portion into said positional and logical relationships;
at least one term comparator means for reading text data from said text data storage means and collectively comparing said text data with respect to said different notation words generated by said different notation development means;
at least one complex condition judgment means for detecting documents corresponding to conditions designated by said complex condition analysis means based on results of a comparison outputted from said at least one term comparator means and outputting identifiers of the detected documents;
a search result output means for outputting identifier information of the documents corresponding to said search conditional expression based on the identifiers outputted from said at least one complex condition judgment means; and
,a communication means connected to said communication network for receiving said search conditional expression and supplying the search conditional expression to said search expression input means and for forwarding the identifier information from said search result output means to an operatively associated search conversation terminal from which a search request corresponding to the identifier information is transmitted.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for performing a document information search to uncover specified text data containing a given search subject key word from a group of document text data stored in a memory. In the document information search method, two stages of presearch are carried out to perform the document search with respect to a desired subject key word. In a first stage of presearch, a character component table is generated in which the existence of character codes for every document is set forth with respect to all the character codes contained in the group of document text data of stored documents. The character component table is searched for all the character codes comprising a designated search subject key word to thereby extract all documents containing all the character codes comprising the search subject key word. Further, in the presearch step, all texts without the possibility of containing the search subject key word are eliminated. A comprehensive, narrowed text search is thereby performed in accordance with the search subject key word.
61 Citations
8 Claims
-
1. A document information search system for searching document information, the system comprising:
a plurality of document information search apparatuses connected to a communication network, each of the document information search apparatuses operating in parallel to each other in accordance with search conditions broadcast from a plurality of search terminals connected to said communication network, said plurality of search terminals receiving and displaying search results transmitted from said plurality of document information search apparatuses, each of the plurality of document information search apparatuses including; a text data storage means for storing a document text data group; a search expression input means for inputting a search conditional expression in which a complex condition is designated including key words for searching said document text data group, and positional and logical relationships among said key words; a search expression analysis means for analyzing the inputted search conditional expression to extract a search subject key word and a complex condition descriptive portion; a synonym development means for generating synonyms of said search subject key word based on said search subject key word outputted from said search expression analysis means; a different notation development means for generating different notation words of said search subject key word, and said synonyms; a complex condition analysis means for analyzing said complex condition descriptive portion outputted from said search expression analysis means and developing said complex condition descriptive portion into said positional and logical relationships; at least one term comparator means for reading text data from said text data storage means and collectively comparing said text data with respect to said different notation words generated by said different notation development means; at least one complex condition judgment means for detecting documents corresponding to conditions designated by said complex condition analysis means based on results of a comparison outputted from said at least one term comparator means and outputting identifiers of the detected documents; a search result output means for outputting identifier information of the documents corresponding to said search conditional expression based on the identifiers outputted from said at least one complex condition judgment means; and
,a communication means connected to said communication network for receiving said search conditional expression and supplying the search conditional expression to said search expression input means and for forwarding the identifier information from said search result output means to an operatively associated search conversation terminal from which a search request corresponding to the identifier information is transmitted. - View Dependent Claims (2, 3)
-
4. A document information search system for searching document information, the system comprising:
a plurality of document information search apparatuses connected to a communication network, each of the document information search apparatuses operating in parallel to each other in accordance with search conditions broadcast from a plurality of search terminals connected to said communication network, said plurality of search terminals receiving and displaying search results transmitted from said plurality of document information search apparatuses, each of the plurality of document information search apparatuses including; a text data storage means for storing a document text data group; a search expression input means for inputting a search conditional expression in which a complex condition is designated including key words for searching said document text data group, and positional and logical relationships among said key words; a search expression analysis means for analyzing the inputted search conditional expression to extract a search subject key word and a complex condition descriptive portion; a synonym development means for generating synonyms of said search subject key word based on said search subject key word outputted from said search expression analysis means; a different notation development means for generating different notation words of said search subject key word, and said synonyms; a complex condition analysis means for analyzing said complex condition descriptive portion outputted from said search expression analysis means and developing said complex condition descriptive portion into said positional and logical relationships; at least one term comparator means for reading text data from said text data storage means and collectively comparing said text data with respect to said different notation words generated by said different notation development means; at least one complex condition judgment means for detecting documents corresponding to conditions designated by said complex condition analysis means based on results of a comparison outputted from said at least one term comparator means and outputting identifiers of the detected documents; a search result output means for outputting identifier information of the documents corresponding to said search conditional expression based on the identifiers outputted from said at least one complex condition judgment means; an image data storage means storing image information relating to stored document information, for reading and outputting image information together with bibliographic information and text data based on said identifier information outputted from said search result output means; and
,a communication means connected to said communication network for receiving said search conditional expression through said communication means and supplying the search conditional expression to said search expression input means and for forwarding the identifier information from said search result output means to an operatively associated search conversation terminal from which a search request corresponding to the identifier information is transmitted.
-
5. A document information search apparatus comprising:
-
a text data storage means for storing a document text data group; a search expression input means for inputting a search conditional expression in which a complex condition is designated including key words for searching said document text data group, and positional and logical relationships among said key words; a search expression analysis means for analyzing the inputted search conditional expression to extract a search subject key word and a complex condition descriptive portion; a synonym development means for generating synonyms of said search subject key word based on said search subject key word outputted from said search expression analysis means; a different notation development means for generating different notation words of said search subject key word, and said synonyms, said different notation development means including a conversion rule table in which head terms constituted by terms of at least one character and at least one other term corresponding to every head term are stored, the different notation development means converting all portions of inputted terms coincident with said head terms in said conversion rule table into term lists each including one head term and the at least one other term corresponding to the head term, and developing terms inputted in combination with the term lists into a group of other terms; a complex condition analysis means for analyzing said complex condition descriptive portion outputted from said search expression analysis means and developing said complex condition descriptive portion into said positional and logical relationships; at least one term comparator means for reading text data from said text data storage means and collectively comparing said text data with respect to said different notation words generated by said different notation development means; at least one complex condition judgment means for detecting documents corresponding to conditions designated by said complex condition analysis means based on results of a comparison outputted from said at least one term comparator means and outputting identifiers of the detected documents; a search output means for outputting identifier information of the documents corresponding to said search conditional expression based on the identifiers outputted from said at least one complex condition judgment means; and
,a presearch means for preparing in advance contracted text data for every document in which adjuncts and duplication of repeatedly stated words are eliminated in the text data and for extracting documents in which key words in the designated search conditional expression are contained by words, so that the text data with respect to the document collection reduced by said presearch means are read out from said text data storage means and search is executed on said text data by said term comparator means and said complex condition judgment means, wherein said at least one term comparator means is used also as said presearch means. - View Dependent Claims (6, 7, 8)
-
Specification