Translation confidence scores
First Claim
1. A method for training and applying a confidence scoring model, comprising:
- receiving multiple training items, wherein a training item comprises;
a source content item, a translation of the source content item, and one or more user scores for the translation of the source content item;
training a confidence scoring model by, for a selected training item of the multiple training items;
extracting features of the selected training item;
combining the extracted features of the selected training item into an input for the confidence scoring model to produce an intermediate confidence score, wherein the intermediate confidence score is computed based on parameters or weights of the confidence scoring model;
comparing the intermediate confidence score to the one or more user scores for the translation of the source content item of the selected training item; and
based on the comparison of the intermediate confidence score to the one or more user scores, modifying one or more of the parameters or weights of the confidence scoring model, wherein the modification of the parameters or weights of the confidence scoring model adjusts the confidence scoring model in favor of the one or more user scores;
computing a confidence score for a given translation generated by a first machine translation system applying first translation logic using the trained confidence scoring model;
determining that the confidence score is below a threshold; and
in response to determining that the confidence score is below the threshold;
submitting request for an updated version of the translation, the request comprising one or more of a request for a translation by a second machine translation system different from the first translation system, a request for a translation by second translation logic different from the first translation logic, or a request for a translation by a translator user;
receiving the updated version of the translation in response to the request; and
providing the updated version of the translation to a receiving user.
2 Assignments
0 Petitions
Accused Products
Abstract
A confidence scoring system can include a model trained using features extracted from translations that have received user translation ratings. The features can include, e.g. sentence length, an amount of out-of-vocabulary or rare words, language model probability scores of the source or translation, or a semantic similarity between the source and a translation. Parameters of the confidence model can then be adjusted based on a comparison of the confidence model output and user translation ratings, where the user translation ratings can be selected or weighted based on a determination of individual user fluentness. After the confidence model has been trained, it can produce confidence scores for new translations. If a confidence score is higher than a threshold, it can indicate the translation should be selected for automatic presentation to users. If the confidence score is below another threshold, it can indicate the translation should be updated.
208 Citations
20 Claims
-
1. A method for training and applying a confidence scoring model, comprising:
-
receiving multiple training items, wherein a training item comprises;
a source content item, a translation of the source content item, and one or more user scores for the translation of the source content item;training a confidence scoring model by, for a selected training item of the multiple training items; extracting features of the selected training item; combining the extracted features of the selected training item into an input for the confidence scoring model to produce an intermediate confidence score, wherein the intermediate confidence score is computed based on parameters or weights of the confidence scoring model; comparing the intermediate confidence score to the one or more user scores for the translation of the source content item of the selected training item; and based on the comparison of the intermediate confidence score to the one or more user scores, modifying one or more of the parameters or weights of the confidence scoring model, wherein the modification of the parameters or weights of the confidence scoring model adjusts the confidence scoring model in favor of the one or more user scores; computing a confidence score for a given translation generated by a first machine translation system applying first translation logic using the trained confidence scoring model; determining that the confidence score is below a threshold; and in response to determining that the confidence score is below the threshold; submitting request for an updated version of the translation, the request comprising one or more of a request for a translation by a second machine translation system different from the first translation system, a request for a translation by second translation logic different from the first translation logic, or a request for a translation by a translator user; receiving the updated version of the translation in response to the request; and providing the updated version of the translation to a receiving user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable storage medium storing instructions that, when executed by a computing system, cause the computing system to perform operations for applying a confidence scoring model, the operations comprising:
-
receiving a translation of a source content item; extracting features of the translation; combining the extracted features of the translation into an input for the confidence scoring model; applying the confidence scoring model to the input for the confidence scoring model to produce a confidence score, wherein the confidence score is computed based on parameters or weights of the confidence scoring model; determining that the confidence score is above an auto-translate threshold; and in response to determining that the confidence score is above the auto-translate threshold, causing the translation to be automatically displayed in a user interface of a user of a social media website. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A system for training and applying a confidence scoring model, comprising:
-
a memory; one or more processors; an interface configured to receive multiple training items, wherein a training item comprises;
a translation of a source content item and one or more user scores for the translation;a confidence model trainer configured to train a confidence scoring model by, for a selected training item of the multiple training items; using a translation feature extractor to extract features of the selected training item; combining the extracted features of the selected training item into an input for the confidence scoring model to produce an intermediate confidence score, wherein the intermediate confidence score is computed based on parameters or weights of the confidence scoring model; comparing the intermediate confidence score to the one or more user scores for the translation of the selected training item; and based on the comparison of the intermediate confidence score to the one or more user scores, modifying one or more of the parameters or weights of the confidence scoring model, wherein the modification of the parameters or weights of the confidence scoring model adjusts the confidence scoring model using the input in favor of the one or more user scores; one or more confidence models comprising at least the trained confidence scoring model; and a translation sorter configured to; receive, from the one or more confidence models, multiple scores each corresponding to one of multiple translations of a content item; and select, from the multiple translations, the translation with the highest corresponding score to use as a translation of the content item. - View Dependent Claims (17, 18, 19, 20)
-
Specification