Discriminative Syntactic Word Order Model for Machine Translation
First Claim
1. A method comprising:
- forming a source dependency tree for a source sentence in a source language, the source dependency tree indicating the syntactic hierarchy of source words in the source sentence;
forming a target dependency tree indicating the syntactic hierarchy of target words in a target language that are translations of source words in the source sentence;
identifying a set of target word orders, where each target word order that is identified is projective with respect to the target dependency tree;
using a discriminatively trained word order model to identify a most likely target word order from the set of target word orders, wherein for each set of target word orders, the discriminatively trained word order model uses features based on information in the source dependency tree and the target dependency tree and features based on the order of words in the target word order.
2 Assignments
0 Petitions
Accused Products
Abstract
A discriminatively trained word order model is used to identify a most likely word order from a set of word orders for target words translated from a source sentence. For each set of word orders, the discriminatively trained word order model uses features based on information in a source dependency tree and a target dependency tree and features based on the order of words in the word order. The discriminatively trained statistical model is trained by determining a translation metric for each of a set of N-best word orders for a set of target words. Each of the N-best word orders are projective with respect to a target dependency tree and the N-best word orders are selected using a combination of an n-gram language model and a local tree order model.
38 Citations
20 Claims
-
1. A method comprising:
-
forming a source dependency tree for a source sentence in a source language, the source dependency tree indicating the syntactic hierarchy of source words in the source sentence; forming a target dependency tree indicating the syntactic hierarchy of target words in a target language that are translations of source words in the source sentence; identifying a set of target word orders, where each target word order that is identified is projective with respect to the target dependency tree; using a discriminatively trained word order model to identify a most likely target word order from the set of target word orders, wherein for each set of target word orders, the discriminatively trained word order model uses features based on information in the source dependency tree and the target dependency tree and features based on the order of words in the target word order. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
receiving a target dependency tree comprising target words translated from source words of a source dependency tree, the target dependency tree and the source dependency tree providing hierarchical relationships between words; scoring a plurality of word orders for the target words that are projective with respect to the target dependency tree to form word order scores, wherein scoring a word order comprises; determining an n-gram language model probability for the word order; and determining a local tree order model probability that is based on information from the source dependency tree, the target dependency tree and the word order; using the word order scores to select a smaller set of word orders for the target words from the plurality of word orders for the target words; using the smaller set of word orders of the target words to discriminatively train a model for selecting orders of target words; and storing the parameters for the model on a computer-readable storage medium for use in ordering words in translations. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A computer-readable storage medium having computer-executable instructions for performing steps comprising:
-
receiving a target dependency tree for a target sentence having words ordered in a first word order, wherein the target dependency tree is formed based on a machine translation from a source language to a target language, reordering words in the target sentence to identify a second word order through steps comprising; while limiting consideration of word orders to those word orders that are projective with respect to the target dependency tree, using feature values derived from the target dependency tree to determine likelihoods for a plurality of possible orders for the target words; and selecting one of the orders for the target words based on the likelihoods. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification