Word alignment score computing apparatus, word alignment apparatus, and computer program
First Claim
1. A word alignment score computing apparatus for computing a score of word alignment of a bilingual sentence pair of first and second languages, comprising a processor configured to:
- responsive to said bilingual sentence pair and a word alignment for the bilingual sentence pair, successively select words of a sentence in said first language of said bilingual sentence pair in a prescribed order; and
compute, for every word of the sentence in said first language of said bilingual sentence pair, a score representing a probability of the selected word and a word in said second language aligned with said word by said word alignment forming a correct word pair, and based on this score, for computing a score of said word alignment;
wherein;
in computing a score of a certain word pair, said processor computes the score of the certain word pair based on all alignments of the selected words preceding that word in said first language which forms the certain word pair;
utilize a recurrent neural network having a first input receiving a word selected by said processor and a second input receiving a word in said second language aligned with said word by said word alignment, andapply the selected word and the word aligned with the word by said word alignment to said first and second inputs, respectively;
wherein said recurrent neural network includes;
an input layer having said first and second inputs, and computing and outputting word embedding vectors from words respectively applied to said first and second inputs,a hidden layer connected to receive outputs of said input layer for generating, by a predetermined non-linear operation, a vector representing a relation between two outputs from said input layer, andan output layer computing and outputting said score based on the output of said hidden layer; and
wherein the output of said hidden lever is applied as an input to said hidden layer when a next word pair is given to said word alignment score computing apparatus.
1 Assignment
0 Petitions
Accused Products
Abstract
[Object] An object is to provide an apparatus for attaining highly precise word alignment.
[Solution] The apparatus includes: selecting means receiving a bilingual sentence pair and a word alignment for the bilingual sentence pair, for successively selecting words fj of a sentence in a first language in a prescribed order; and a recurrent neural network (RNN) 100, computing, for all words of the sentence in the first language, a score 102 representing a probability that a word pair consisting of the word fj and a word ea_{j} aligned with the word fj by a word alignment aj in a second language of the bilingual sentence pair is a correct word pair, and based on this score, for computing a score of the word alignment aj. When computing a score of word pair (fj, ea_{j}), RNN 100 computes a score 102 of the word pair (fj, ea_{j}) based on all word alignments a1j-1 selected by the selecting means prior to the word fj of the word pair (fj, ea_{j}), of the word alignments aj, by means of a recurrent connection 118.
9 Citations
14 Claims
-
1. A word alignment score computing apparatus for computing a score of word alignment of a bilingual sentence pair of first and second languages, comprising a processor configured to:
-
responsive to said bilingual sentence pair and a word alignment for the bilingual sentence pair, successively select words of a sentence in said first language of said bilingual sentence pair in a prescribed order; and compute, for every word of the sentence in said first language of said bilingual sentence pair, a score representing a probability of the selected word and a word in said second language aligned with said word by said word alignment forming a correct word pair, and based on this score, for computing a score of said word alignment;
wherein;in computing a score of a certain word pair, said processor computes the score of the certain word pair based on all alignments of the selected words preceding that word in said first language which forms the certain word pair; utilize a recurrent neural network having a first input receiving a word selected by said processor and a second input receiving a word in said second language aligned with said word by said word alignment, and apply the selected word and the word aligned with the word by said word alignment to said first and second inputs, respectively; wherein said recurrent neural network includes; an input layer having said first and second inputs, and computing and outputting word embedding vectors from words respectively applied to said first and second inputs, a hidden layer connected to receive outputs of said input layer for generating, by a predetermined non-linear operation, a vector representing a relation between two outputs from said input layer, and an output layer computing and outputting said score based on the output of said hidden layer; and wherein the output of said hidden lever is applied as an input to said hidden layer when a next word pair is given to said word alignment score computing apparatus. - View Dependent Claims (2, 3, 10, 14)
-
-
4. A word alignment score computing apparatus for computing a score of word alignment of a bilingual sentence pair of first and second languages, comprising a processor configured to:
-
responsive to said bilingual sentence pair and a word alignment for the bilingual sentence pair, successively select words of a sentence in said first language of said bilingual sentence pair in a prescribed order; and compute, for every word of the sentence in said first language of said bilingual sentence pair, a score representing a probability of the selected word and a word in said second language aligned with said word by said word alignment forming a correct word pair, and based on this score, for computing a score of said word alignment;
wherein;in computing a score of a certain word pair, said processor computes the score of the certain word pair based on all alignments of the selected words preceding that word in said first language which forms the certain word pair; successively select words of the sentence in said first language of said bilingual sentence pair starting from a beginning of the sentence in said first language; utilize a recurrent neural network having a first input receiving the selected word and a second input receiving a word in said second language aligned with said word by said word alignment, and apply the selected word and the word aligned with the word by said word alignment to said first and second inputs, respectively; wherein said recurrent neural network includes; an input layer having said first and second inputs, and computing and outputting word embedding vectors from words respectively applied to said first and second inputs, a hidden layer connected to receive outputs of said input layer, for generating, by a predetermined non-linear operation, a vector representing a relation between two outputs from said input layer, and an output layer computing and outputting said score based on the output of said hidden layer; and wherein the output of said hidden layer is applied as an input to said hidden layer when a next word pair is given to said word alignment score computing apparatus. - View Dependent Claims (7, 11)
-
-
5. A word alignment score computing apparatus for computing a score of word alignment of a bilingual sentence pair of first and second languages, comprising a processor configured to:
-
responsive to said bilingual sentence pair and a word alignment for the bilingual sentence pair, successively select words of a sentence in said first language of said bilingual sentence pair in a prescribed order; and compute, for every word of the sentence in said first language of said bilingual sentence pair, a score representing a probability of the selected word and a word in said second language aligned with said word by said word alignment forming a correct word pair, and based on this score, for computing a score of said word alignment;
wherein;in computing a score of a certain word pair, said processor computes the score of the certain word pair based on all alignments of selected words preceding that word in said first language which forms the certain word pair; compute a score representing a probability of a word pair consisting of the selected word and a word in a sentence in said second language of said bilingual sentence pair aligned with said word by said word alignment being a correct word pair; compute, based on scores of all words of the sentence in said first language of said bilingual sentence pair, the score of said word alignment; utilize a recurrent neural network having a first input receiving the selected word and a second input receiving a word in said second language aligned with said word by said word alignment, and apply the selected word and the word aligned with the word by said word alignment to said first and second inputs, respectively; wherein said recurrent neural network includes; an input layer having said first and second inputs, and computing and outputting word embedding vectors from words respectively applied to said first and second inputs, a hidden layer connected to receive outputs of said input layer, for generating, by a predetermined non-linear operation, a vector representing a relation between two outputs from said input layer, and an output layer computing and outputting said score based on the output of said hidden layer; and wherein the output of said hidden layer is applied as an input to said hidden layer when a next word pair is given to said word alignment score computing apparatus. - View Dependent Claims (8, 12)
-
-
6. A word alignment score computing apparatus for computing a score of word alignment of a bilingual sentence pair of first and second languages comprising a processor configured to,
responsive to said bilingual sentence pair and a word alignment for the bilingual sentence pair, successively select words of a sentence in said first language of said bilingual sentence pair in a prescribed order; - and
compute, for every word of the sentence in said first language of said bilingual sentence pair, a score representing a probability or the selected word and a word in said second language aligned with said word by said word alignment forming a correct word pair, and based on this score, for computing a score of said word alignment;
wherein;in computing a score of a certain word pair, said processor computes the score of the certain word pair based on all alignments of the selected words preceding that word in said first language which forms the certain word pair; compute a score representing a probability of a word pair consisting of the selected word and a word in a sentence in said second language of said bilingual sentence pair aligned with said word by said word alignment being a correct word pair; compute, based on scores of all words of the sentence in said first language of said bilingual sentence pair computed by said processor, the score of said word alignment; utilize a recurrent neural network having a first input receiving the selected word and a second input receiving a word in said second language aligned with said word by said word alignment, and apply the selected word and the word aligned with the word by said word alignment to said first and second inputs, respectively; compute the score of said word alignment by multiplying the scores of all words of the sentence in said first language of said bilingual sentence pair computed by said processor; wherein said recurrent neural network includes; an input layer having said first and second inputs, and computing and outputting word embedding vectors from words respectively applied to said first and second inputs, a hidden layer connected to receive outputs of said input layer, for generating, by a predetermined non-linear operation, a vector representing a relation between two outputs from said input layer, and an output layer computing and outputting said score based on the output of said hidden layer; and wherein the output of said hidden layer is applied as an input to said hidden layer when a next word pair is given to said word alignment score computing apparatus. - View Dependent Claims (9, 13)
- and
Specification