×

Extracting sentence translations from translated documents

  • US 7,054,803 B2
  • Filed: 12/19/2000
  • Issued: 05/30/2006
  • Est. Priority Date: 12/19/2000
  • Status: Expired due to Fees
First Claim
Patent Images

1. Method of extracting translations from translated texts, the method comprising the steps of:

  • accessing a first text in a first language;

    accessing a second text in a second language, the second language being different from the first language, the second text being a translation of the first text;

    dividing the first text and the second text each into a plurality of textual elements;

    forming a sequence of pairs of text portions from said plurality of textual elements, each pair comprising a text portion of the first text and a text portion of the second text, each text portion comprising zero or more adjacent textual elements, each textual element of the first and the second text being comprised in a text portion of the sequence;

    calculating a pair score of each pair in the sequence using a number of occurrences of each of a plurality of features in the text portions of the respective pair and using a plurality of weights, each weight being assigned to one feature of said plurality of features, wherein the pair scores are calculated by taking, for each feature occurring in the pair, a minimum number of the number of occurrences of the respective feature in the paired text portions, taking a product of the minimum number and the weight assigned to the respective feature, and summing up all the products of all features;

    calculating an alignment score of the sequence using said pair scores, said alignment score indicating the translation quality of the sequence; and

    optimizing said alignment score by systematically searching through the space of alternatives and combining optimal alignments for subsequences into optimal alignments for longer sequences.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×