×

Scalable ground truth disambiguation

  • US 10,572,826 B2
  • Filed: 04/18/2017
  • Issued: 02/25/2020
  • Est. Priority Date: 04/18/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for disambiguating training data in natural language classification (NLC), comprising:

  • obtaining, by one or more processor of a computer, an utterance input from a user agent;

    collecting, by the one or more processor, context data of the utterance input from the user agent, wherein the context data describes circumstances of the utterance input;

    generating, by the one or more processor, a context tag of one or more context tag based on the context data, wherein the one or more context tag corresponds to the utterance input;

    selecting, by the one or more processor, one or more ground truth from the training data by use of the utterance input and the context tag, wherein each of the one or more ground truth respectively includes an utterance and an intent, wherein the utterance of each ground truth is semantically identical to the utterance input, and wherein the intent of each ground truth is semantically consistent with the context tag; and

    updating, by the one or more processor, the one or more ground truth by attaching the context tag, wherein the selecting is performed by invoking a machine learning process with the utterance input and the context tag so that the machine learning process provides a first ground truth having a first utterance and a first intent, wherein the updating the one or more ground truth by attaching the context tag includes updating the first ground truth so that the first ground truth includes the context tag, and training the machine learning process using first training data, wherein the first training data used to train the machine learning process includes the first ground truth tagged with the context tag and having the first utterance and the first intent.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×