System and method for writing Indian languages using English alphabet
First Claim
1. A transliteration system comprising:
- an input device inputting at least one English character;
an output device outputting at least one Indian language character;
a computer program converting the said input to the said output based on a character mapping table and a set of character mapping rules;
the said character mapping table;
specifying a superset of Indian language characters, where;
English characters, A, E, I, O, U, AA, EE, II, OO, UU, AI, AU, RW, RWU, LW, LWU, AMX, ANX, AM, and AHX are assigned respectively to Indian language vowels, and , where Indian language vowels are shown in Malayalam except for RWU, LW, LWU, AMX, and ANX, which are shown in Devanagiri;
English characters, K, KH, G, GH, NG, C, CH, J, JH, NJ, T, TD, D, DT, NH, TH, TJ, DJ, DH, N, P, F, B, BH, M, Y, R, RH, L, LH, ZH, V, S, SH, Z, H, Q, KQ, GW, JW, DW, DTW, NN, PF, YW, TT, and NT are assigned respectively to Indian language consonants, S, O, and , where Indian language consonants are shown in Malayalam except for KG, GW, JW, DW, DTW, PF, and YW, which are shown in Devanagiri, and NN, which is shown in Tamil;
English characters, A, E, I, O, U, AA, EE, II, OO, UU, AI, AU, RW, RWU, LW, LWU, AMX, ANX, AM, and AHX are also assigned respectively to Indian language vowel symbols, none, .., . . . . . and , where Indian language vowel symbols are shown in Malayalam, with dots showing position of accompanying consonant, except for RWU, LW, LWU, AMX, AND ANX, which are shown in Devanagiri;
English characters, L, LH, N, NH, and R are also assigned respectively to Indian language semi-consonants, and , where Indian language semi-consonants are shown in Malayalam; and
period, also known as dot character, is assigned to Indian language character I, which is shown in Devanagiri;
specifying for each Indian language vowel, its full form and its corresponding vowel symbol;
identifying for each Indian language vowel symbol, part of the vowel symbol to be placed before or after a consonant;
identifying for each Indian language vowel symbol, part of the vowel symbol to be placed before and part to be placed after a consonant;
specifying for each Indian language consonant, its full form, its short form when appearing before another consonant, and its short form when appearing after another consonant;
identifying set of characters supported by an Indian language;
mapping several English character sequences to same character in an Indian language;
specifying English character sequences to identify special characters in an Indian language; and
specifying Indian language character formation on a per font basis;
the said character mapping rules include;
when choosing between the said Indian language vowels and vowel symbols, selecting Indian language vowels when they are not preceded by one of the said consonants;
when choosing between the said Indian language vowels and vowel symbols, selecting Indian language vowel symbols when they are preceded by one of the said consonants;
when placing Indian language vowel symbols with a consonant, placing them before, after, or before and after the consonant they are joined with according to the said character mapping table;
selecting the said full form of Indian language consonants when they are not combined with another consonant;
selecting the said short form of Indian language consonants when they are combined with another consonant according to the said character mapping table;
placing the said short form of Indian language consonants before or after the consonants they are joined with according to the said character mapping table;
selecting the said full form of the consonant for output when no short form of Indian language consonant is given in the said character mapping table;
placing Indian language character , which is shown in Malayalam, following first consonant, indicated by two dots, when two consonants appear next to each other in the said output;
when converting input character X, selecting Indian language character , which is shown in Malayalam, if it is immediately preceded by one of the said Indian language consonants;
when converting input character X, selecting Indian language character , which is shown in Devanagiri, if it appears at end of a word and is not immediately preceded by one of the said Indian language consonants;
when converting input character X, selecting Indian language character , which is shown in Devanagiri, if character A follows input character X;
when choosing between multiple characters in Indian language, selecting Indian language character that matches longest input string;
not converting input character sequences, which are not specified in the said character mapping table;
converting input character X to no character if character X is not mapped according to the said character mapping table and other character mapping rules;
converting Indian language character , which is shown in Malayalam, to Indian language character , which is shown in Malayalam, where two dots indicate position of accompanying consonant, if corresponding input character is not followed by English character, A, E, I, O, or U, and Indian language character , which is shown in Malayalam, is supported by chosen output language;
not converting Indian language character , which is shown in Malayalam, to Indian character , which is shown in Malayalam, where two dots indicate position of accompanying consonant, if following character in input string is M, P, Y, R, or V;
converting Indian language characters, and , which are shown in Malayalam, respectively to and , which are shown in Malayalam, when they appear at end of a string;
outputting Indian language character which is shown in Malayalam, when final character in output string is one of the said Indian language consonants;
not outputting Indian language character @which is shown in Malayalam, following one of the said semi-consonants;
optionally replacing special symbol which is shown in Malayalam, appearing at end of a word by no vowel symbol when outputting into Indian language characters;
optionally converting the said input to a specific Indian language output; and
optionally converting the said input to a specific Indian language font output;
the said computer program accepting input English characters as upper case characters, lower case characters, or a mixture of both upper case and lower case characters and treating them without regard to their upper case or lower case status when converting them into Indian language characters; and
a microprocessor coupled to said input device, output device, and computer program, converting inputted at least one English character into corresponding output at least one Indian language character in accordance with the said character mapping table lookup and character mapping rules.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method for writing Indian languages using the English writing scheme is provided that includes specifying a script using the English alphabet to represent the various characters and character combinations in various Indian languages. The specified script follows the writing conventions of English. This script is based on how the Indian languages are spoken and rules are specified to facilitate mapping the sounds represented in English characters to the native language in its written form. This common method is intended for writing Hindi and related languages, such as Sanskrit, Marathi, and Gujarathi, and Bengali and somewhat distant, but closely related Dravidian languages, such as Malayalam, Tamil, Kannada, and Telegu.
-
Citations
11 Claims
-
1. A transliteration system comprising:
-
an input device inputting at least one English character; an output device outputting at least one Indian language character; a computer program converting the said input to the said output based on a character mapping table and a set of character mapping rules; the said character mapping table; specifying a superset of Indian language characters, where; English characters, A, E, I, O, U, AA, EE, II, OO, UU, AI, AU, RW, RWU, LW, LWU, AMX, ANX, AM, and AHX are assigned respectively to Indian language vowels, and , where Indian language vowels are shown in Malayalam except for RWU, LW, LWU, AMX, and ANX, which are shown in Devanagiri; English characters, K, KH, G, GH, NG, C, CH, J, JH, NJ, T, TD, D, DT, NH, TH, TJ, DJ, DH, N, P, F, B, BH, M, Y, R, RH, L, LH, ZH, V, S, SH, Z, H, Q, KQ, GW, JW, DW, DTW, NN, PF, YW, TT, and NT are assigned respectively to Indian language consonants, S, O, and , where Indian language consonants are shown in Malayalam except for KG, GW, JW, DW, DTW, PF, and YW, which are shown in Devanagiri, and NN, which is shown in Tamil; English characters, A, E, I, O, U, AA, EE, II, OO, UU, AI, AU, RW, RWU, LW, LWU, AMX, ANX, AM, and AHX are also assigned respectively to Indian language vowel symbols, none, .., . . . . . and , where Indian language vowel symbols are shown in Malayalam, with dots showing position of accompanying consonant, except for RWU, LW, LWU, AMX, AND ANX, which are shown in Devanagiri; English characters, L, LH, N, NH, and R are also assigned respectively to Indian language semi-consonants, and , where Indian language semi-consonants are shown in Malayalam; and period, also known as dot character, is assigned to Indian language character I, which is shown in Devanagiri; specifying for each Indian language vowel, its full form and its corresponding vowel symbol; identifying for each Indian language vowel symbol, part of the vowel symbol to be placed before or after a consonant; identifying for each Indian language vowel symbol, part of the vowel symbol to be placed before and part to be placed after a consonant; specifying for each Indian language consonant, its full form, its short form when appearing before another consonant, and its short form when appearing after another consonant; identifying set of characters supported by an Indian language; mapping several English character sequences to same character in an Indian language; specifying English character sequences to identify special characters in an Indian language; and specifying Indian language character formation on a per font basis; the said character mapping rules include; when choosing between the said Indian language vowels and vowel symbols, selecting Indian language vowels when they are not preceded by one of the said consonants; when choosing between the said Indian language vowels and vowel symbols, selecting Indian language vowel symbols when they are preceded by one of the said consonants; when placing Indian language vowel symbols with a consonant, placing them before, after, or before and after the consonant they are joined with according to the said character mapping table; selecting the said full form of Indian language consonants when they are not combined with another consonant; selecting the said short form of Indian language consonants when they are combined with another consonant according to the said character mapping table; placing the said short form of Indian language consonants before or after the consonants they are joined with according to the said character mapping table; selecting the said full form of the consonant for output when no short form of Indian language consonant is given in the said character mapping table; placing Indian language character , which is shown in Malayalam, following first consonant, indicated by two dots, when two consonants appear next to each other in the said output; when converting input character X, selecting Indian language character , which is shown in Malayalam, if it is immediately preceded by one of the said Indian language consonants; when converting input character X, selecting Indian language character , which is shown in Devanagiri, if it appears at end of a word and is not immediately preceded by one of the said Indian language consonants; when converting input character X, selecting Indian language character , which is shown in Devanagiri, if character A follows input character X; when choosing between multiple characters in Indian language, selecting Indian language character that matches longest input string; not converting input character sequences, which are not specified in the said character mapping table; converting input character X to no character if character X is not mapped according to the said character mapping table and other character mapping rules; converting Indian language character , which is shown in Malayalam, to Indian language character , which is shown in Malayalam, where two dots indicate position of accompanying consonant, if corresponding input character is not followed by English character, A, E, I, O, or U, and Indian language character , which is shown in Malayalam, is supported by chosen output language; not converting Indian language character , which is shown in Malayalam, to Indian character , which is shown in Malayalam, where two dots indicate position of accompanying consonant, if following character in input string is M, P, Y, R, or V; converting Indian language characters, and , which are shown in Malayalam, respectively to and , which are shown in Malayalam, when they appear at end of a string; outputting Indian language character which is shown in Malayalam, when final character in output string is one of the said Indian language consonants; not outputting Indian language character @which is shown in Malayalam, following one of the said semi-consonants; optionally replacing special symbol which is shown in Malayalam, appearing at end of a word by no vowel symbol when outputting into Indian language characters; optionally converting the said input to a specific Indian language output; and optionally converting the said input to a specific Indian language font output; the said computer program accepting input English characters as upper case characters, lower case characters, or a mixture of both upper case and lower case characters and treating them without regard to their upper case or lower case status when converting them into Indian language characters; and a microprocessor coupled to said input device, output device, and computer program, converting inputted at least one English character into corresponding output at least one Indian language character in accordance with the said character mapping table lookup and character mapping rules. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A transliteration system comprising:
-
an input device inputting at least one Indian language character; an output device outputting at least one English character; a computer program converting at least one Indian language character into corresponding at least one English character based on a character mapping table and a set of character mapping rules; the said character mapping table; specifying a superset of Indian language characters, where; Indian language vowels, and , where Indian language vowels are shown in Malayalam except for RWU, LW, LWU, AMX, and ANX, which are shown in Devanagiri, are assigned respectively to English characters A, E, I, O, U, AA, EE, II, OO, UU, AI, AU, RW, RWU, LW, LWU, AMX, ANX, AM, and AHX; Indian language consonants, , and , where Indian language consonants are shown in Malayalam except for KQ, GW, DW, DTW, PF, and YW, which are shown in Devanagiri and NN, which is shown in Tamil, are assigned respectively to English characters K, KH, G, GH, NG, C, CH, J, JH, NJ, T, TD, D, DT, NH, TH, TJ, DJ, DH, N, P, F, B, BH, M, Y, R, RH, L, LH, ZH, V, S, SH, Z, H, Q, KQ, GW, JW, DW, DTW, NN, PF, YW, TT, and NT; Indian language vowel symbols, none, and , where Indian language vowel symbols are shown in Malayalam except for RWU, LW, LWU, AMX, and ANX, which are shown in Devanagiri, are also assigned respectively to English characters, A, E, I, O, U, AA, EE, II, OO, UU, AI, AU, RW, RWU, LW, LWU, AMX, ANX, AM, and AHX; Indian language semi-consonants, and , where all Indian language semi-consonants are shown in Malayalam, are also assigned respectively to English characters, L, LH, N, NH, and R; Indian language special symbols, .., , and , which are shown in Devanagiri, are assigned respectively to English characters, X, XA, and X; and Indian language symbol I, which is shown in Devanagiri, is assigned to dot character; when converting short form of a consonant appearing in front of another consonant, if it is to be placed following the other consonant in output; when converting short form of a consonant appearing following another consonant, if it is to be placed in front of the other consonant in output; identifying set of characters supported by an Indian language; identifying Indian language character representation in a font; and specifying English character sequences assigned to special characters in an Indian language; the said character mapping rules include; when mapping vowel symbols to English characters, placing them following consonant they are joined with; converting semi-consonants, and , where all Indian language semi-consonants are shown in Malayalam, appearing in middle of an input string respectively to English characters, LX, LHX, NX, NHX, and RX; converting Indian language character , which is shown in Malayalam, where two dots indicate position of an accompanying consonant, to no character if it is followed by another consonant; when matching Indian language characters and their corresponding English character sequences, choosing longest English character sequence for output; converting characters not specified in the said character mapping table or the said character mapping rules to no character in the said output; optionally not converting no vowel symbol, indicated by the absence of any symbol along with a consonant, when it appears at end of a word; optionally accepting the said input in a specific Indian language; and optionally accepting the said input in a specific Indian language font; and a microprocessor coupled to the said input device, output device, and computer program, converting inputted at least one Indian language character into corresponding output at least one English character in accordance with the said character mapping table lookup and character mapping rules. - View Dependent Claims (8, 9, 10, 11)
-
Specification