×

Ingesting documents using multiple ingestion pipelines

  • US 10,318,591 B2
  • Filed: 06/02/2015
  • Issued: 06/11/2019
  • Est. Priority Date: 06/02/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method for analyzing a primary ingestion pipeline, the primary ingestion pipeline configured for using natural language processing (NLP) to populate a corpus with documents annotated with metadata tags such that the corpus becomes usable by a computer system for generating answers to questions as they are posed by users, the primary ingestion pipeline including a plurality of annotators configured for annotating documents passing through the primary ingestion pipeline, wherein each annotator of the plurality of annotators is configured to annotate the documents with a different defined subset of the metadata tags, the method comprising:

  • evaluating the plurality of annotators;

    evaluating a plurality of documents to be annotated by the plurality of annotators;

    generating, based on the evaluating the plurality of annotators and further based on the evaluating the plurality of documents, an ingestion risk score for each document of the plurality of documents, wherein each ingestion risk score represents a likelihood that an associated document will not successfully be annotated by all of the plurality of annotators while passing sequentially through each annotator in the primary ingestion pipeline;

    comparing each ingestion risk score to a set of risk criteria;

    determining, based on the comparing, that each document of a first set of documents of the plurality of documents satisfies the set of risk criteria and that each document of a second set of documents of the plurality of documents does not satisfy the set of risk criteria;

    entering, in response to the determining, the first set of documents into the primary ingestion pipeline; and

    providing, in response to the determining, special handling to the second set of documents.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×