Generating usage report in a question answering system based on question categorization
First Claim
1. A method, in a question answering system, for generating a usage report based on question categorization, the method comprising:
- clustering documents from a corpus of documents to form a nested structure of clusters of documents;
recording questions answered by a question answering system in a database in association with answers returned and answer confidence values;
mapping the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions;
ranking the nested structure of clusters of questions according to number of questions in each cluster;
associating each cluster of questions with a topic;
generating a usage report based on the nested structure of clusters of questions, wherein generating the usage report comprises;
identifying one or more topics having a number of questions that is below a predetermined threshold, and recommending removing or replacing documents from the corpus in the one or more clusters corresponding to the one or more topics;
presenting the usage report responsive to a requesting user; and
subsequent to the user removing or replacing documents from the corpus, thereby creating a modified corpus of documents, utilizing the modified corpus of documents to generate an answer to a question submitted to the question answering system.
1 Assignment
0 Petitions
Accused Products
Abstract
A mechanism is provided in a question answering system for generating a usage report based on question categorization. The mechanism clusters documents from a corpus of documents to form a nested structure of clusters of documents. The mechanism record questions answered by question answering system in a database in association with answers returned and answer confidence values. The mechanism maps the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions. The mechanism generates a usage report based on the nested structure of clusters of questions and presenting the usage report responsive to a requesting user.
56 Citations
14 Claims
-
1. A method, in a question answering system, for generating a usage report based on question categorization, the method comprising:
-
clustering documents from a corpus of documents to form a nested structure of clusters of documents; recording questions answered by a question answering system in a database in association with answers returned and answer confidence values; mapping the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions; ranking the nested structure of clusters of questions according to number of questions in each cluster; associating each cluster of questions with a topic; generating a usage report based on the nested structure of clusters of questions, wherein generating the usage report comprises;
identifying one or more topics having a number of questions that is below a predetermined threshold, and recommending removing or replacing documents from the corpus in the one or more clusters corresponding to the one or more topics;presenting the usage report responsive to a requesting user; and subsequent to the user removing or replacing documents from the corpus, thereby creating a modified corpus of documents, utilizing the modified corpus of documents to generate an answer to a question submitted to the question answering system. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
cluster documents from a corpus of documents to form a nested structure of clusters of documents; record questions answered by a question answering system in a database in association with answers returned and answer confidence values; map the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions; rank the nested structure of clusters of questions according to number of questions in each cluster; associate each cluster of questions with a topic; generate a usage report based on the nested structure of clusters of questions, wherein generating the usage report comprises;
identifying one or more topics having a number of questions that is below a predetermined threshold, and recommending removing or replacing documents from the corpus in the one or more clusters corresponding to the one or more topics;present the usage report responsive to a requesting user; and subsequent to the user removing or replacing documents from the corpus, thereby creating a modified corpus of documents, utilize the modified corpus of documents to generate an answer to a question submitted to the question answering system. - View Dependent Claims (9, 10, 11)
-
-
12. An apparatus comprising:
-
a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, when executed by the processor, cause the processor to; cluster documents from a corpus of documents to form a nested structure of clusters of documents; record questions answered by a question answering system in a database in association with answers returned and answer confidence values; map the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions; rank the nested structure of clusters of questions according to number of questions in each cluster; associate each cluster of questions with a topic; generate a usage report based on the nested structure of clusters of questions, wherein generating the usage report comprises;
identifying one or more topics having a number of questions that is below a predetermined threshold, and recommending removing or replacing documents from the corpus in the one or more clusters corresponding to the one or more topics;present the usage report responsive to a requesting user; and subsequent to the user removing or replacing documents from the corpus, thereby creating a modified corpus of documents, utilize the modified corpus of documents to generate an answer to a question submitted to the question answering system. - View Dependent Claims (13, 14)
-
Specification