Document information search apparatus and method and recording medium storing document information search program therein
First Claim
1. A document information search apparatus for searching document information on the basis of a search request transmitted through a computer network and responding, wherein:
- a search condition designating unit, which designates a file as a search condition and transmits contents of said designated file via the network, is provided for a search requesting source; and
a document search unit having a morpheme analyzing unit which extracts nouns by a morpheme analysis of a text document and which forms a keyword from the file contents transmitted from said search condition designating unit and searches similar documents from a database is provided on a search side, wherein said document search unit comprises a search executing unit which searches for similar documents by searching the database by using said keyword and notifies the search requesting source of a search result.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus for searching document information in response to a search request from a client is disclosed. When a document file is designated as a search condition by a search condition designating unit of the client, the unit causes contents of the designated file to be transmitted via a network. A document search unit of a search machine forms a keyword based on the file contents transmitted from the search condition designating unit, and searches similar documents from an index or selection of important words extracted from search target documents provided in a search database.
-
Citations
10 Claims
-
1. A document information search apparatus for searching document information on the basis of a search request transmitted through a computer network and responding, wherein:
-
a search condition designating unit, which designates a file as a search condition and transmits contents of said designated file via the network, is provided for a search requesting source; and
a document search unit having a morpheme analyzing unit which extracts nouns by a morpheme analysis of a text document and which forms a keyword from the file contents transmitted from said search condition designating unit and searches similar documents from a database is provided on a search side, wherein said document search unit comprises a search executing unit which searches for similar documents by searching the database by using said keyword and notifies the search requesting source of a search result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A document information search apparatus for searching document information on the basis of a search request transmitted through a network and responding, wherein:
-
a search condition designating unit which designates a file as a search condition and transmits contents of said designated file via the network is provided for a search requesting source; and
a document search unit which forms a keyword from the file contents transmitted from said search condition designating unit and searches similar documents from a database is provided on a search side;
wherein index information describing a list of important words extracted from search target documents is stored for every document in said database;
and said document search unit on the search side comprises;
a text extraction processing unit which extracts a text document from the file contents received in response to the search request;
a morpheme analyzing unit which extracts nouns by a morpheme analysis of said text document;
a keyword forming unit which extracts important words from said nouns and forms a keyword in which said important words are coupled by OR; and
a search executing unit which searches for similar documents by searching the search database by using said keyword and notifies the search requesting source of a search result; and
wherein said keyword forming unit allows property information extracted from the file received in response to the search request to be included in said keyword, thereby allowing the similar documents to be searched.
-
Specification