Systems and methods for determining translation accuracy in multi-user multi-lingual communications
First Claim
1. A method comprising:
- performing by one or more computer processors;
obtaining a translation of a message in a first language to a second language wherein the translation was submitted by a user;
generating a part-of-speech (POS) n-gram representation of the translation comprising a plurality of POS n-grams of two or more different lengths;
determining a respective probability for each POS n-gram as a ratio of a count of occurrences of the POS n-gram in a corpus for the second language to a count of all POS n-grams in the corpus having a same length as the POS n-gram; and
determining an accuracy of the translation based on a combination of the probabilities.
6 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for assessing the accuracy of translations are described.
-
Citations
21 Claims
-
1. A method comprising:
performing by one or more computer processors; obtaining a translation of a message in a first language to a second language wherein the translation was submitted by a user; generating a part-of-speech (POS) n-gram representation of the translation comprising a plurality of POS n-grams of two or more different lengths; determining a respective probability for each POS n-gram as a ratio of a count of occurrences of the POS n-gram in a corpus for the second language to a count of all POS n-grams in the corpus having a same length as the POS n-gram; and determining an accuracy of the translation based on a combination of the probabilities. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A system comprising:
-
a non-transitory computer readable medium having instructions stored thereon; and at least one processor configured to execute the instructions to perform operations comprising; obtaining a translation of a message in a first language to a second language wherein the translation was submitted by a user; generating a part-of-speech (POS) n-gram representation of the translation comprising a plurality of POS n-grams of two or more different lengths; determining a respective probability for each POS n-gram as a ratio of a count of occurrences of the POS n-gram in a corpus for the second language to a count a all POS n-grams in the corpus having a same length as the POS n-gram; and determining an accuracy of the translation based on a combination of the probabilities. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A manufacture comprising:
-
non-transitory computer readable media comprising executable instructions, the executable instructions being executable by one or more processors to perform operations comprising; obtaining a translation of a message in a first language to a second language wherein the translation was submitted by a user; generating a part-of-speech (POS) n-gram representation of the translation comprising a plurality of POS n-grams of two or more different lengths; determining a respective probability for each POS n-gram as a ratio of a count of occurrences of the POS n-grams in a corpus for the second language to a count of all POS n-grams in the corpus having a same length as the POS n-gram and determining an accuracy of the translation based on a combination of the probabilities. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification