×

Method and system for automatic management of reputation of translators

  • US 10,261,994 B2
  • Filed: 05/25/2012
  • Issued: 04/16/2019
  • Est. Priority Date: 05/25/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for reducing processor time and memory during automated scoring of a translation using computation of a hybrid translation edit rate (HyTER) score calculation for a result word set and an exponentially sized reference set in a computing environment, the method comprising:

  • receiving a translation hypothesis at a processor of the computing environment, the translation hypothesis comprising a result word set generated by a human or machine translation system in a target language, the result word set representing a translation of a test word set in a source language;

    developing a search space for automated computation of the HyTER score, the search space comprising a lazy composition of;

    a weighted finite-state acceptor (FSA) executable by the processor of the computing environment that represents a set of allowed permutations of the translation hypothesis and associated distance costs, the allowed permutations of the translation hypothesis constructed on demand according to local window constraints on movement of words within a fixed window size,the exponentially sized reference set of meaning equivalents encoded as a Recursive Transition Network stored in memory of the computing environment and expanded by the processor of the computing environment on demand, anda Levenshtein distance calculation between pairs of the search space comprising allowed permutations of the translation hypothesis and parts of the exponentially sized reference set that do not remain unexpanded, the calculation performed by the processor of the computing environment;

    calculating using the processor of the computing environment the HyTER score for pairs in the search space to identify a pair in the search space having a minimum edit distance, and reducing the number of pairs for the composition for which the Levenshtein distance is calculated to save processor computation time and computer memory used for automated calculations of the HyTER score by constraining a number of paths constructed by the processor on demand by the weighted FSA using the fixed window size, and not constructing permutation paths of the composition outside the window; and

    outputting the HyTER score for the human or machine translation system for the identified pair in the search space having a minimum edit distance, wherein a perfect score indicates that the result word set is an exact match of an acceptable translation in the exponentially sized reference set.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×