×

Generating secondary questions in an introspective question answering system

  • US 10,621,880 B2
  • Filed: 09/11/2012
  • Issued: 04/14/2020
  • Est. Priority Date: 09/11/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method of generating secondary questions in a question-answer system comprising a processor running software for performing a plurality of question answering processes and a corpus of data;

  • the processor comparing the first question to the corpus of data;

    obtaining, from the question-answer system, candidate answers to the first question posed to the question-answer system, the candidate answers for the first question being generated from the corpus of data, each candidate answer being associated with evidence from the corpus of data;

    analyzing the evidence from the corpus of data associated with each candidate answer, the evidence supporting or refuting each candidate answer, the analyzing the evidence further comprises assigning an evidence score to the evidence based on how well the evidence matches the first question, wherein the evidence comprises good evidence, marginal evidence, and bad evidence;

    calculating confidence scores for the candidate answers to the first question based on the evidence from the corpus of data;

    identifying information to supplement the marginal evidence, the information improves the confidence scores for the candidate answers to the first question, the information not being in the corpus of data;

    automatically generating a plurality of hypotheses concerning the information that supplements the marginal evidence and improves the confidence scores for the candidate answers to-the first question, each hypothesis of the plurality of hypotheses being related to the information to improve the ability of the question-answer system to understand and evaluate the evidence associated with the candidate answers to the first question;

    automatically generating at least one secondary question based on each hypothesis of the plurality of hypotheses concerning the information that supplements the marginal information and improves the confidence scores for the candidate answers to the first question, each the at least one secondary question being formulated as a natural language inquiry in a human understandable format;

    ranking the hypotheses based on relative utility to determine an order in which to output the at least one secondary question to external sources to obtain responses;

    outputting, in rank order of the hypotheses, the at least one secondary question to the external sources in natural language format, the external sources comprising a community of human respondents;

    receiving responses to the at least one secondary question from the external sources;

    validating the responses to the at least one secondary question to extract a piece of data, fact, syntactical relationship, grammatical relationship, logical rule, or taxonomy rule that improves said confidence scores for the candidate answers to the first question, the validating comprises validating that the responses are supported by a threshold number of external sources; and

    adding the piece of data, fact, syntactical relationship, grammatical relationship, logical rule, or taxonomy rule extracted from the responses to the at least one secondary question to the corpus of data.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×