MACHINE TRANSLATION IN CONTINUOUS SPACE
First Claim
Patent Images
1. A method for training a statistical machine translation model, comprising:
- creating a source word versus target word co-occurrence matrix to define word pairs;
reducing dimensionality of the matrix;
mapping word pairs as vectors into continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space; and
training a machine translation parametric model using an acoustic model training method based on word pair vectors in the continuous space.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for training a statistical machine translation model and decoding or translating using the same is disclosed. A source word versus target word co-occurrence matrix is created to define word pairs. Dimensionality of the matrix may be reduced. Word pairs are mapped as vectors into continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space. A machine translation parametric model is trained using an acoustic model training method based on word pair vectors in the continuous space.
41 Citations
18 Claims
-
1. A method for training a statistical machine translation model, comprising:
-
creating a source word versus target word co-occurrence matrix to define word pairs; reducing dimensionality of the matrix; mapping word pairs as vectors into continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space; and training a machine translation parametric model using an acoustic model training method based on word pair vectors in the continuous space. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer readable medium comprising a computer readable program for training a statistical machine translation model, wherein the computer readable program when executed on a computer causes the computer to perform the steps of:
-
creating a source word versus target word co-occurrence matrix to define word pairs; reducing dimensionality of the matrix; mapping word pairs as vectors into continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space; and training a machine translation parametric model using an acoustic model training method based on word pair vectors in the continuous space.
-
-
11. A method for machine translation, comprising:
-
adapting a parametric model in accordance with a given state, the model including mapped word pairs as vectors in continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space, and the model being trained using an acoustic model training method based on word pair vectors in the continuous space; computing a pair-probability of a word or phrase based on word-pair translation probabilities using the parametric model; and translating the word or phrase in accordance with a machine translation search that employs the translation probabilities. - View Dependent Claims (12, 13, 14)
-
-
15. A computer readable medium comprising a computer readable program for machine translation, wherein the computer readable program when executed on a computer causes the computer to perform the steps of:
-
adapting a parametric model in accordance with a given state, the model including mapped word pairs as vectors in continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space, and the model being trained using an acoustic model training method based on word pair vectors in the continuous space; computing a pair-probability of a word or phrase based on word-pair translation probabilities using the parametric model; and translating the word or phrase in accordance with a machine translation search that employs the translation probabilities. - View Dependent Claims (16, 17, 18)
-
Specification