System, method to generate transliteration and method for generating decision tree to obtain transliteration
First Claim
1. A method to generate transliteration of source language script into target language script using decision tree based technique with automated supervised learning, said method comprising steps ofi. selecting text to identify predetermined character to train particular instance of usage of identified character;
- ii. representing various possible phonetic and/or pseudo-phonetic translations of identified character to generate a multitude of transliterated words;
iii. grouping generated words followed by annotating them with predetermined information;
iv. analyzing annotated words to generate possible decision questions to map the characters under predetermined focus number, wherein the step of analyzing is performed by a computer system; and
v. applying predetermined rules on analyzed words to generate transliteration of the source language into the target language.
1 Assignment
0 Petitions
Accused Products
Abstract
The transliteration scheme developed as the present invention is contingent upon two languages, namely the keyboard language and the target language. While the invention is independent of the actual languages in question, save to the extent that they must be phonetic or pseudo-phonetic and the keyboard language must possess a script, it is nonetheless convenient to cast the invention within the framework of two specific languages. In this regard, English has been chosen as the keyboard language, and Hindi (a language widely spoken in India) has been chosen as the target language. An understanding of the current invention is possible without any knowledge of Hindi by a practitioner skilled in the art.
332 Citations
27 Claims
-
1. A method to generate transliteration of source language script into target language script using decision tree based technique with automated supervised learning, said method comprising steps of
i. selecting text to identify predetermined character to train particular instance of usage of identified character; -
ii. representing various possible phonetic and/or pseudo-phonetic translations of identified character to generate a multitude of transliterated words; iii. grouping generated words followed by annotating them with predetermined information; iv. analyzing annotated words to generate possible decision questions to map the characters under predetermined focus number, wherein the step of analyzing is performed by a computer system; and v. applying predetermined rules on analyzed words to generate transliteration of the source language into the target language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27)
-
-
20. The method as claimed in 1, wherein the method, with heuristics to decide whether the input is for predictive or non-predictive rule-based transliteration, provides options for rule-based input as well without the user explicitly having to choose a user interface element to switch between the predictive and non-predictive modes.
Specification