TRANSLITERATION USING INDICATOR AND HYBRID GENERATIVE FEATURES
First Claim
Patent Images
1. In a computing environment, a method performed on at least one processor, comprising:
- receiving a source string;
transliterating the source string using one or more discriminatively trained models into a target string; and
outputting the target string.
2 Assignments
0 Petitions
Accused Products
Abstract
Described is a transliteration engine/substring decoder that back-transliterates an input string from a source language into an output string in a target language. The transliteration engine may be based upon discriminately weighted indicator features and/or generative models in which the decoder'"'"'s discriminative parameters are learned. The training data may be based on source-target pairs, which may be transformed into derivations. Features extracted from these derivations include indicator features and hybrid generative model features.
34 Citations
20 Claims
-
1. In a computing environment, a method performed on at least one processor, comprising:
-
receiving a source string; transliterating the source string using one or more discriminatively trained models into a target string; and outputting the target string. - View Dependent Claims (2, 3, 4, 5, 6)
-
- 7. In a computing environment, a system, comprising, a transliteration engine that processes an input string in one language into an output string in another language, the transliteration engine including a decoder that uses one or more generative models, the models corresponding to weighted probabilities, with the weights learned as parameters via discriminative training based upon training data.
- 18. One or more computer-readable media having computer-executable instructions, which when executed perform steps, comprising, discriminatively training generative models for to tune parameters for transliteration, including learning relative weights of probabilities for generative features extracted from training data corresponding to derivations, the generative features comprising hybrid generative models, the probabilities representing emission information, emission information and lexicon related information, and using the discriminatively training generative models in transliteration of a source string to a target string.
Specification