×

Methods for generating natural language processing systems

  • US 10,127,214 B2
  • Filed: 12/09/2015
  • Issued: 11/13/2018
  • Est. Priority Date: 12/09/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for generating a natural language model, the method comprising:

  • ingesting, by a natural language platform comprising at least one processor coupled to at least one memory, training data representative of documents to be analyzed by the natural language model, wherein the training data includes at least one of a first document and a portion of the first document;

    generating, by the natural language platform and based on topical content within the training data, a hierarchical data structure, the hierarchical data structure comprising at least two topical nodes, wherein the at least two topical nodes represent partitions organized by two or more topical themes among the topical content of the training data within which the training data is to be subdivided into;

    selecting among the training data, by the natural language platform, a plurality of documents to be annotated;

    determining, by the natural language platform, for each document among the plurality of documents, a level of ambiguity in interpreting said document that the natural language platform is trying to resolve, wherein the level of ambiguity is dependent upon information currently possessed by the natural language platform;

    generating, by the natural language platform, an annotation prompt for each document among the plurality of documents to be annotated, said annotation prompt being dynamically generated as either a first level prompt corresponding to a first level of specificity or a second level prompt corresponding to a second level of specificity, wherein both the first level prompt and the second level prompt comprise a human readable textual instruction generated by the natural language platform worded according to the first level of specificity or the second level of specificity, and the first level prompt and the second level prompt are presented alternatively,the first level of specificity and the second level of specificity corresponding to the level of ambiguity of said document,said annotation prompt configured to elicit an annotation about said document designed to resolve said level of ambiguity and indicating which node among the at least two topical nodes of the hierarchical data structure said document is to be classified into,wherein the first level of specificity comprises a first level of true-or-false question and the second level of specificity comprises a multiple-choice question comprising at least three options, wherein the first level of specificity corresponds to a lower level of ambiguity than the second level of specificity;

    causing display of, by the natural language platform, the annotation prompt for each document among the plurality of documents to be annotated;

    receiving, by the natural language platform, for each document among the plurality of documents to be annotated, the annotation in response to the displayed annotation prompt; and

    generating, by the natural language platform, the natural language model using an adaptive machine learning process configured to determine, among the received annotations, patterns for how the documents in the training data are to be subdivided according to the at least two topical nodes of the hierarchical data structure.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×