×

Translating terms using numeric representations

  • US 9,805,028 B1
  • Filed: 09/17/2015
  • Issued: 10/31/2017
  • Est. Priority Date: 09/17/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • maintaining data that associates each term in a vocabulary of terms in a first language with a respective high-dimensional representation of the term, wherein the high-dimensional representation of the term is a numeric representation of the term in a high-dimensional space, and wherein positions of high-dimensional representations of terms from the vocabulary of terms in the first language in the high-dimensional space reflect syntactic similarities, semantic similarities, or both between the terms from the vocabulary of terms in the first language;

    maintaining data that associates each term in a vocabulary of terms in a second language with a respective high-dimensional representation of the term, wherein the high-dimensional representation of the term is a numeric representation of the term in the high-dimensional space, and wherein positions of high-dimensional representations of terms from the vocabulary of terms in the second language in the high-dimensional space reflect syntactic similarities, semantic similarities, or both between the terms from the vocabulary of terms in the second language;

    receiving a first language term, wherein the first language term is a term from the vocabulary of terms in the first language; and

    determining a translation into the second language of the first language term from the high-dimensional representation of the first language term and the high-dimensional representations of terms in the vocabulary of terms in the second language, wherein determining the translation into the second language of the first language term comprises;

    identifying a high-dimensional representation of the first language term;

    applying a transformation to the high-dimensional representation of the first language term to generate a transformed representation, wherein applying the transformation to the high-dimensional representation of the first language term comprises applying the transformation in accordance with trained values of a set of parameters, the trained values of the set of parameters having been determined through applying a machine learning training procedure on training terms in the first language and a respective translation of each of the training terms into the second language;

    selecting, from the high-dimensional representations of the terms in the vocabulary of terms in the second language, a closest high-dimensional representation to the transformed representation; and

    selecting the term in the second language that is associated with the closest high-dimensional representation as the translation into the second language of the first language term.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×