Systems and methods for multi-user multi-lingual communications
First Claim
1. A method implemented by at least one computer processor, the method comprising:
- obtaining, from a client device of a user, a text message in a first language, the text message comprising at least one word;
providing, using the at least one computer processor, the text message to a machine translation system;
obtaining, using the at least one computer processor, a translation of the text message from the machine translation system;
determining, using the at least one computer processor, that the text message and the translation both comprise the at least one word in the first language and that the at least one word is correctly spelled; and
performing one or more of the following;
(a) determining, using the at least one computer processor, Bayesian probabilities for neighboring words that appear before and after the at least one word and, when the Bayesian probabilities exceed a threshold value, adding the at least one word to a lexicon in a data store; and
(b) performing, using the at least one computer processor, k-means clustering to identify a cluster of words comprising synonyms and, when the cluster comprises the at least one word, adding the at least one word to the lexicon in the data store.
6 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for assessing the accuracy of translations are described.
-
Citations
30 Claims
-
1. A method implemented by at least one computer processor, the method comprising:
-
obtaining, from a client device of a user, a text message in a first language, the text message comprising at least one word; providing, using the at least one computer processor, the text message to a machine translation system; obtaining, using the at least one computer processor, a translation of the text message from the machine translation system; determining, using the at least one computer processor, that the text message and the translation both comprise the at least one word in the first language and that the at least one word is correctly spelled; and performing one or more of the following; (a) determining, using the at least one computer processor, Bayesian probabilities for neighboring words that appear before and after the at least one word and, when the Bayesian probabilities exceed a threshold value, adding the at least one word to a lexicon in a data store; and (b) performing, using the at least one computer processor, k-means clustering to identify a cluster of words comprising synonyms and, when the cluster comprises the at least one word, adding the at least one word to the lexicon in the data store. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a non-transitory computer readable medium having instructions stored thereon; and a data processing apparatus configured to execute the instructions to perform operations comprising; obtaining, from a client device of a user, a text message in a first language, the text message comprising at least one word; providing the text message to a machine translation system; obtaining a translation of the text message from the machine translation system; determining that the text message and the translation both comprise the at least one word in the first language and that the at least one word is correctly spelled; and performing one or more of the following; (a) determining Bayesian probabilities for neighboring words that appear before and after the at least one word and, when the Bayesian probabilities exceed a threshold value, adding the at least one word to a lexicon in a data store; and (b) performing k-means clustering to identify a cluster of words comprising synonyms and, when the cluster comprises the at least one word, adding the at least one word to the lexicon in the data store. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer program product stored in one or more non-transitory storage media for controlling a processing mode of a data processing apparatus, the computer program product being executable by the data processing apparatus to cause the data processing apparatus to perform operations comprising:
-
obtaining, from a client device of a user, a text message in a first language, the text message comprising at least one word; providing the text message to a machine translation system; obtaining a translation of the text message from the machine translation system; determining that the text message and the translation both comprise the at least one word in the first language and that the at least one word is correctly spelled; and performing one or more of the following; (a) determining Bayesian probabilities for neighboring words that appear before and after the at least one word and, when the Bayesian probabilities exceed a threshold value, adding the at least one word to a lexicon in a data store; and (b) performing k-means clustering to identify a cluster of words comprising synonyms and, when the cluster comprises the at least one word, adding the at least one word to the lexicon in the data store. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification