INFORMATION PROCESSING APPARATUS, FULL TEXT RETRIEVAL METHOD, AND COMPUTER-READABLE ENCODING MEDIUM RECORDED WITH A COMPUTER PROGRAM THEREOF
First Claim
Patent Images
1. An information processing apparatus for creating a retrieval result displaying a list of retrieval documents, comprising:
- a document retrieval part configured to retrieve the retrieval documents corresponding to a retrieval condition by conducting a full text retrieval of documents;
a document scoring part configured to order the retrieval documents by scores indicating degrees of relevance to the retrieval condition;
a feature word file database configured to register a document identification identifying a document, feature words extracted from full text data of the document, and weight values indicating weights of the feature words in which the feature words and the weight values are corresponded to the document identification, for each of the documents; and
a document clustering part configured to conduct a clustering process with respect to the retrieval documents based on the feature words and the weight values of the feature words acquired from the feature word file database, by using the document identifications of the retrieval documents as keys,wherein said information processing apparatus further comprises a document grouping part configured to group the retrieval documents based on the scores,wherein the document clustering part conducts the clustering process with respect to the retrieval documents in a group, for each of groups to which the retrieval documents are grouped by the document grouping part.
1 Assignment
0 Petitions
Accused Products
Abstract
An information processing apparatus for creating a retrieval result displaying a list of retrieval documents is disclosed. Retrieval documents corresponding to a retrieval condition are classified into groups based on scores indicating degrees of relevance to the retrieval condition. A clustering process is conducted with respect to the retrieval documents in a group, for each of groups to which the retrieval documents belong.
35 Citations
8 Claims
-
1. An information processing apparatus for creating a retrieval result displaying a list of retrieval documents, comprising:
-
a document retrieval part configured to retrieve the retrieval documents corresponding to a retrieval condition by conducting a full text retrieval of documents; a document scoring part configured to order the retrieval documents by scores indicating degrees of relevance to the retrieval condition; a feature word file database configured to register a document identification identifying a document, feature words extracted from full text data of the document, and weight values indicating weights of the feature words in which the feature words and the weight values are corresponded to the document identification, for each of the documents; and a document clustering part configured to conduct a clustering process with respect to the retrieval documents based on the feature words and the weight values of the feature words acquired from the feature word file database, by using the document identifications of the retrieval documents as keys, wherein said information processing apparatus further comprises a document grouping part configured to group the retrieval documents based on the scores, wherein the document clustering part conducts the clustering process with respect to the retrieval documents in a group, for each of groups to which the retrieval documents are grouped by the document grouping part. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A full text retrieval method in an information processing apparatus for creating a retrieval result displaying a list of retrieval documents, said information processing apparatus comprising:
-
a document retrieval part configured to retrieve the retrieval documents corresponding to a retrieval condition by conducting a full text retrieval of documents; a document scoring part configured to order the retrieval documents by scores indicating degrees of relevance to the retrieval condition; and a document clustering part configured to conduct a clustering process with respect to the retrieval documents based on feature words of the documents extracted from full text data of the documents and weight values indicating weights of the feature words, wherein said full text retrieval method comprises a step for grouping the retrieval documents based on the scores, wherein the document clustering part conducts the clustering process with respect to the retrieval documents in a group, for each of groups to which the retrieval documents are grouped in the step for grouping the retrieval documents.
-
-
8. A computer-readable encoding medium recorded with a computer program for causing an information processing apparatus to create a retrieval result displaying a list of retrieval documents, said information processing apparatus comprising:
-
a document retrieval part configured to retrieve the retrieval documents corresponding to a retrieval condition by conducting a full text retrieval of documents; a document scoring part configured to order the retrieval documents by scores indicating degrees of relevance to the retrieval condition; and a document clustering part configured to conduct a clustering process with respect to the retrieval documents based on feature words of the documents extracted from full text data of the documents and weight values indicating weights of the feature words, wherein said computer program comprises a code of grouping the retrieval documents based on the scores, wherein the document clustering part conducts the clustering process with respect to the retrieval documents in a group, for each of groups to which the retrieval documents are grouped in said code of grouping the retrieval documents.
-
Specification