×

Natural language processing with dynamic pipelines

  • US 10,380,253 B2
  • Filed: 03/04/2014
  • Issued: 08/13/2019
  • Est. Priority Date: 03/04/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for natural language processing, the method comprising:

  • selecting, by a computer processor, a dynamic pipeline based, at least in part, on a corpus, wherein the dynamic pipeline links a first human language technology component and a second human language technology component, wherein the first human language technology component comprises a first set of algorithms and the second human language technology component comprises a second set of algorithms and wherein the corpus includes at least text, audio, and video;

    identifying, by a computer processor, a first algorithm of the first set of algorithms associated with the first human language technology component and a second algorithm of the second set of algorithms associated with the second human language technology component;

    applying, by the computer processor, the first algorithm based, at least in part, on the corpus to generate a first cluster space that reflects a dynamic determination of relationships within the corpus, wherein the first cluster space includes probabilities that each respective relationship within the corpus is true or untrue;

    amending, by the computer processor, an evidence chain that includes one or more findings of true relationships associated with the corpus in response to applying the first algorithm, to reflect a most recent finding of a true relationship of the true relationships that supersedes a previous finding in light of a probabilistic determination from new determined relationships in the first cluster space;

    standardizing, by the computer processor, a first ontology of the first cluster space, wherein the first ontology is a data structure on a computer;

    applying, by the computer processor, the second algorithm based, at least in part, on the corpus and the first ontology of the first cluster space to generate a second cluster space that is associated with the corpus;

    identifying, by the computer processor, a set of information of one or more corpora that has a relevance to the corpus that exceeds a pre-determined threshold based, at least in part, on the first and second cluster spaces of corpus; and

    generating, by the computer processor, a summary report based, at least in part, on the set of information of the one or more corpora.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×