×

System and method for learning latent representations for natural language tasks

  • US 9,135,241 B2
  • Filed: 12/08/2010
  • Issued: 09/15/2015
  • Est. Priority Date: 12/08/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • analyzing, for a first natural language processing task, a first natural language corpus to generate a latent representation for words in the first natural language corpus;

    calculating, for each word in the latent representation, a Euclidian distance between a left context of the each word and a right context of the each word, to yield a centroid of latent vectors for each word in the latent representation;

    analyzing, for a second natural language processing task, a second natural language corpus having a target word, the target word being a word that is not in the first natural language corpus; and

    predicting, via a processor, a label for the target word based on the latent representation and the centroid of latent vectors for each word in the latent representation, wherein the predicting comprises iteratively executing an alternating gradient descent algorithm until convergence, the alternating gradient descent algorithm comprising, for each iteration, computing a low dimensional continuous embedding and passing the low dimensional continuous embedding through a multi-layer perceptron.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×