Categorizing questions in a question answering system
First Claim
Patent Images
1. A method, in a question answering system, for categorizing questions, the method comprising:
- storing a corpus of documents in a first database;
clustering documents from the corpus of documents to form a nested structure of clusters of documents;
recording questions answered by the question answering system in a second database that is different from the first database, wherein the questions are answered using supporting evidence from the corpus of documents, and wherein the questions are recorded in association with answers returned and answer confidence values;
mapping the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions, wherein the mapping includes, for a given question, determining a document containing supporting evidence for the answer to the question, and mapping the question to a cluster containing the document;
mapping a question history of a user and context information of the user to the nested structure of clusters of questions;
ranking the nested structure of clusters of questions based on the mapping of the question history and context information, and on the number of questions in each cluster;
presenting, in a user interface, one or more topics corresponding to one or more of the highest ranking clusters of questions; and
responsive to receiving user selection of a topic of the presented one or more topics, presenting recommended questions in the user interface, wherein the recommended questions are mapped to a cluster corresponding to the selected topic.
1 Assignment
0 Petitions
Accused Products
Abstract
A mechanism is provided in a question answering system for categorizing questions. The mechanism clusters documents from a corpus of documents to form a nested structure of clusters of documents. The mechanism records questions answered by question answering system in a database in association with answers returned and answer confidence values. The mechanism maps the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions. The mechanism presents recommended questions based on the nested structure of clusters of questions in a user interface of the question answering system.
85 Citations
14 Claims
-
1. A method, in a question answering system, for categorizing questions, the method comprising:
-
storing a corpus of documents in a first database; clustering documents from the corpus of documents to form a nested structure of clusters of documents; recording questions answered by the question answering system in a second database that is different from the first database, wherein the questions are answered using supporting evidence from the corpus of documents, and wherein the questions are recorded in association with answers returned and answer confidence values; mapping the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions, wherein the mapping includes, for a given question, determining a document containing supporting evidence for the answer to the question, and mapping the question to a cluster containing the document; mapping a question history of a user and context information of the user to the nested structure of clusters of questions; ranking the nested structure of clusters of questions based on the mapping of the question history and context information, and on the number of questions in each cluster; presenting, in a user interface, one or more topics corresponding to one or more of the highest ranking clusters of questions; and responsive to receiving user selection of a topic of the presented one or more topics, presenting recommended questions in the user interface, wherein the recommended questions are mapped to a cluster corresponding to the selected topic. - View Dependent Claims (2, 3, 4, 5, 6, 8)
-
-
7. The method 6, wherein generating the type-ahead suggestion output comprises generating a set of suggested questions based on the user input in a range of topics;
wherein presenting the commonly asked questions further comprises narrowing the range of topics and modifying the set of suggested questions responsive to receiving subsequent user input.
-
9. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
store a corpus of documents in a first database; cluster documents from the corpus of documents to form a nested structure of clusters of documents; record questions answered by the question answering system in a second database that is different from the first database, wherein the questions are answered using supporting evidence from the corpus of documents, and wherein the questions are recorded in association with answers returned and answer confidence values; map the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions, wherein the mapping includes, for a given question, determining a document containing supporting evidence for the answer to the question, and mapping the question to a cluster containing the document; map a question history of a user and context information of the user to the nested structure of clusters of questions; rank the nested structure of clusters of questions based on the mapping of the question history and context information, and on the number of questions in each cluster; present, in a user interface, one or more topics corresponding to one or more of the highest ranking clusters of questions; and responsive to receiving user selection of a topic of the presented one or more topics, present recommended questions in the user interface, wherein the recommended questions are mapped to a cluster corresponding to the selected topic. - View Dependent Claims (10, 11)
-
-
12. An apparatus comprising:
-
a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, when executed by the processor, cause the processor to; store a corpus of documents in a first database; cluster documents from the corpus of documents to form a nested structure of clusters of documents; record questions answered by the question answering system in a second database that is different from the first database, wherein the questions are answered using supporting evidence from the corpus of documents, and wherein the questions are recorded in association with answers returned and answer confidence values; map the questions to the nested structure of clusters of documents to form a nested structure of clusters of questions, wherein the mapping includes, for a given question, determining a document containing supporting evidence for the answer to the question, and mapping the question to a cluster containing the document; map a question history of a user and context information of the user to the nested structure of clusters of questions; rank the nested structure of clusters of questions based on the mapping of the question history and context information, and on the number of questions in each cluster; present, in a user interface, one or more topics corresponding to one or more of the highest ranking clusters of questions; and responsive to receiving user selection of a topic of the presented one or more topics, present recommended questions in the user interface, wherein the recommended questions are mapped to a cluster corresponding to the selected topic. - View Dependent Claims (13, 14)
-
Specification