Transliteration of text entry across scripts
First Claim
1. A method for transliterating text entries across different script systems, comprising:
- receiving an input string in a first script system input using a keyboard;
identifying segmentation candidates in the input string using a probabilistic model, the segmentation candidates dividing the input string into phonemes that correspond to characters or sets of characters in a script system, the probabilistic model based on a probability distribution of a particular phoneme in the first script system being transliterated into a particular character or set of characters in the second script system;
selecting a segmentation from the segmentation candidates using an additional model, the additional model based on a probability of the word or the word prefix in the second script system appearing in prior texts written in the second script system;
converting the phonemes in the first script system into the character or sets of characters in the second script system, the character or sets of characters forming a word or a word prefix in the second script system; and
outputting the word or the word prefix in the second script system.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.
8 Citations
16 Claims
-
1. A method for transliterating text entries across different script systems, comprising:
-
receiving an input string in a first script system input using a keyboard; identifying segmentation candidates in the input string using a probabilistic model, the segmentation candidates dividing the input string into phonemes that correspond to characters or sets of characters in a script system, the probabilistic model based on a probability distribution of a particular phoneme in the first script system being transliterated into a particular character or set of characters in the second script system; selecting a segmentation from the segmentation candidates using an additional model, the additional model based on a probability of the word or the word prefix in the second script system appearing in prior texts written in the second script system; converting the phonemes in the first script system into the character or sets of characters in the second script system, the character or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory machine-readable storage medium storing instructions configured to be executed by a processor to cause the processor to:
-
receive an input string in a first script system input using a keyboard; identify segmentation candidates in the input string using a probabilistic model, the segmentation candidates dividing the input string into phonemes that correspond to characters or sets of characters in a script system, the probabilistic model based on a probability distribution of a particular phoneme in the first script system being transliterated into a particular character or set of characters in the second script system; select a segmentation from the segmentation candidates using an additional model, the additional model based on a probability of the word or the word prefix in the second script system appearing in prior texts written in the second script system; convert the phonemes in the first script system into the character or sets of characters in the second script system, the character or sets of characters forming a word or a word prefix in the second script system; and output the word or the word prefix in the second script system. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computing device, comprising:
-
a processor; and a memory storing instructions configured to be executed by a processor to cause the processor to; identify segmentation candidates in an input string using a probabilistic model, the segmentation candidates dividing the input string into phonemes that correspond to characters or sets of characters in a script system, the probabilistic model based on a probability distribution of a particular phoneme in the first script system being transliterated into a particular character or set of characters in the second script system; and select a segmentation from the segmentation candidates using an additional model, the additional model based on a probability of the word or the word prefix in the second script system appearing in prior texts written in the second script system; convert the phonemes in the first script system into the character or sets of characters in the second script system, a word or a word prefix in the second script system comprising the character; and store the word or the word prefix in the second script system. - View Dependent Claims (16)
-
Specification