Corpus augmentation system
First Claim
Patent Images
1. A method, in an information handling system comprising a processor and a memory, for ingesting additional content in a knowledge base, the method comprising:
- mining, by the system, an interaction history comprising a plurality of questions and answer results to identify a first question by performing a natural language processing (NLP) analysis of the plurality of questions and answer results to detect the first question that meets specified answer deficiency criteria;
generating, by the system, a second question which is correlated to the first question by extracting a text sentence from one or more documents correlated to the first question and parsing the text sentence to populate a defined question template used to construct the second question requesting additional answer information for answering the first question;
selecting, by the system, at least one persona to post the second question;
posting, by the system, the second question to a forum using the at least one persona;
monitoring, by the system, the forum for responses to the second question; and
ingesting, by the system, any response to the second question as additional content in the knowledge base.
1 Assignment
0 Petitions
Accused Products
Abstract
An approach is provided for automatically ingesting additional corpus based on an interaction history that is mined to identify a question that meets specified answer deficiency criteria, and then generate a second question which is correlated to the first question by requesting additional answer information for answering the first question, where the second question is posted to a forum using a selected persona so that forum responses can be monitored and ingested as additional content in the knowledge base.
9 Citations
7 Claims
-
1. A method, in an information handling system comprising a processor and a memory, for ingesting additional content in a knowledge base, the method comprising:
-
mining, by the system, an interaction history comprising a plurality of questions and answer results to identify a first question by performing a natural language processing (NLP) analysis of the plurality of questions and answer results to detect the first question that meets specified answer deficiency criteria; generating, by the system, a second question which is correlated to the first question by extracting a text sentence from one or more documents correlated to the first question and parsing the text sentence to populate a defined question template used to construct the second question requesting additional answer information for answering the first question; selecting, by the system, at least one persona to post the second question; posting, by the system, the second question to a forum using the at least one persona; monitoring, by the system, the forum for responses to the second question; and ingesting, by the system, any response to the second question as additional content in the knowledge base. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
Specification