System and method for reverse transliteration using statistical alignment
First Claim
Patent Images
1. A method of training a transliteration processing system, comprising:
- receiving a set of word pairs from different languages; and
using statistical textual alignment to align characters of each of the word pairs; and
identifying the transliteration relationships based on the aligned characters.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention obtains a set of word pairs. Each word of the set of word pairs is broken into its component characters, or clusters of commonly co-occurring characters, and using a conventional statistical machine translation algorithm, transliteration models are generated. The transliteration models are used to obtain correct spellings of original language source words from a transliterated form.
90 Citations
15 Claims
-
1. A method of training a transliteration processing system, comprising:
-
receiving a set of word pairs from different languages; and
using statistical textual alignment to align characters of each of the word pairs; and
identifying the transliteration relationships based on the aligned characters. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A transliteration processing system, comprising
a textual alignment component configured to receive a set of sentences and identify transliteration relationships between words in the set of words based on alignment of characters of the words.
-
15. A transliteration processing system, comprising:
a transliteration generator receiving a textual input and generating a transliteration of the textual input based on a transliteration relationship received from a textual alignment component configured to receive a set of sentences and identify transliteration relationships between words in the set of sentences based on statistical alignment of characters in the words in the form of machine translation models.
Specification