×

Post-processing for identifying nonsense passages in a question answering system

  • US 10,169,328 B2
  • Filed: 05/12/2016
  • Issued: 01/01/2019
  • Est. Priority Date: 05/12/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method, in a data processing system, for identifying nonsense passages, the method comprising:

  • annotating, by an annotator in a nonsense identification component within a natural language processing pipeline configured to execute in the data processing system, an input passage with linguistic features to form an annotated passage;

    counting, by metric counters component in the nonsense identification component, a number of instances of each type of linguistic feature in the annotated passage to form a set of feature counts;

    determining, by the metric counters component, a value for a metric based on the set of feature counts;

    comparing, by a comparator component of the nonsense identification component, the value for the metric to a predetermined model threshold;

    determining, by a filter component of the nonsense identification component, whether the input passage is a nonsense passage based on a result of the comparison;

    responsive to the filter component determining the given evidence passage is a nonsense passage, sending, by the filter component of the nonsense identification component, the input passage to a semi-structured data pipeline configured to execute in the data processing system and preventing the input passage from proceeding in the natural language processing pipeline; and

    responsive to the filter component not determining that the input passage is a nonsense passage, passing, by the filter component, the input passage to the natural language processing pipeline.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×