×

Machine translation device and machine translation method in which a syntax conversion model and a word translation model are combined

  • US 10,198,437 B2
  • Filed: 07/20/2011
  • Issued: 02/05/2019
  • Est. Priority Date: 11/05/2010
  • Status: Active Grant
First Claim
Patent Images

1. A statistical machine translation device, comprising:

  • a language model generator configured to generate a language model by extracting a creation probability of a language from a single corpus configured by a target language;

    a syntax conversion knowledge extractor configured to;

    extract syntax conversion knowledge for the target language by using word reordering information between a source language and the target language in a plurality of parallel corpora that does not include the single corpus, and syntax analysis information of the source language, andcalculate a syntax conversion probability with respect to the syntax conversion knowledge corresponding to the plurality of parallel corpora that does not include the single corpus;

    a word translation knowledge extractor configured to;

    extract word translation knowledge by using the word reordering information and the syntax analysis information, andcalculate a word translation probability with respect to the word translation knowledge based on a feature function in which a predetermined constraint condition is defined in the word reordering information and the syntax analysis information;

    a translation model learning device configured to generate a syntax conversion model and a word translation model by learning the syntax conversion knowledge, the word translation knowledge, the syntax conversion probability and the word translation probability; and

    a translated sentence generator configured to;

    decode a source sentence into the target sentence by applying the syntax conversion model and the word translation model; and

    generate a target vocabulary string having a high probability into a final translation sentence by combining the syntax conversion probability and the creation probability,wherein the syntax conversion knowledge extractor includes;

    a tree generator configured to generate a target tree of the target language by using the word reordering information and the syntax analysis information,a tree node reorderer configured to reorder nodes based on the target tree and a source tree depending on the syntax analysis information of the source language,a tree conversion knowledge extractor configured to extract the syntax conversion knowledge of a sub-tree at each reordered node of the target tree and the source tree, anda probability calculator configured to calculate the syntax conversion probability with respect to the syntax conversion knowledge,wherein the feature function is a function configured to constrain, from a syntax of the target language and a syntax of the source language, and intersyntax arrangement information between the syntax of the target language and the syntax of the source language;

    a part of speech string of the target language, anda translation order of words included in the source language, and output the constrained part of speech string and translation order as a feature.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×