System and method for capitalizing machine translated text
First Claim
Patent Images
1. A method for capitalizing translated text comprising:
- executing a translator module stored on a device to automatically translate a capitalized source text to a target text, wherein prior to translation the capitalized source text is converted to lower case and then translated; and
capitalizing the target text according to capitalization information in the capitalized source text and the target text, wherein the step of capitalizing the target text according to capitalization information in the capitalized source text includes;
generating one or more capitalization configurations for the target text;
computing a configuration probability for each of the one or more capitalization configurations, the configuration probability computed from capitalization information in the capitalized source text and at least one capitalization model feature function based on an alignment between the capitalized source text and the target text or the capitalized source text and the capitalization configuration; and
selecting the best capitalization configuration based on the highest configuration probability.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and method for capitalizing translated text is provided. A capitalized source text is automatically translated to a target text. The target text is capitalized according to information in the capitalized source text.
395 Citations
24 Claims
-
1. A method for capitalizing translated text comprising:
-
executing a translator module stored on a device to automatically translate a capitalized source text to a target text, wherein prior to translation the capitalized source text is converted to lower case and then translated; and capitalizing the target text according to capitalization information in the capitalized source text and the target text, wherein the step of capitalizing the target text according to capitalization information in the capitalized source text includes; generating one or more capitalization configurations for the target text; computing a configuration probability for each of the one or more capitalization configurations, the configuration probability computed from capitalization information in the capitalized source text and at least one capitalization model feature function based on an alignment between the capitalized source text and the target text or the capitalized source text and the capitalization configuration; and selecting the best capitalization configuration based on the highest configuration probability. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A translation system comprising:
-
a device; an automatic translator module executable and stored on the device and configured to automatically convert a capitalized source text to lower case text and translate the lower case text to a target text; and a capitalization module configured to recover a capitalized text from the target text according to capitalization information in the capitalized source text and the target text, and capitalize the target text, the capitalization of the target text including; generating a plurality of capitalization configurations for the target text; for each capitalization configuration, computing a feature probability for each of a plurality of capitalization model feature functions; associating a feature weight with each capitalization model feature function; applying the associated feature weight to the respective computed feature probability for each of the plurality of capitalization model feature functions; for each capitalization configuration, calculating a capitalization configuration probability based on a weighted sum of the computed feature probabilities and applied feature weights, and based on an alignment between the capitalized source text and the target text or the capitalized source text and the capitalization configuration; and selecting the best capitalization configuration from the plurality of capitalization configurations based on the highest calculated capitalization configuration probability. - View Dependent Claims (16)
-
-
17. A translation system comprising:
-
a device; an automatic translator module executable and stored on the device and configured to automatically convert a capitalized source text to lower case text and translate the lower case text to a target text; an aligner configured to determine an alignment between one or more phrases in the capitalized source text and one or more respective phrases in the target text of a capitalization configuration; and a capitalization module configured to recover a capitalized text from the target text according to capitalization information in the capitalized source text and the target text and the alignment determined by the aligner, and to capitalize the target text, the capitalization of the target text including; generating a plurality of capitalization configurations for the target text; for each capitalization configuration, computing a feature probability for each of a plurality of capitalization model feature functions; associating a feature weight with each capitalization model feature function; applying the associated feature weight to the respective computed feature probability for each of the plurality of capitalization model feature functions; for each capitalization configuration, calculating a capitalization configuration probability based on a weighted sum of the computed feature probabilities and applied feature weights, and based on the alignment between the one or more phrases in the capitalized source text and the one or more phrases in the target text or between the capitalized source text and the capitalization configuration; assigning the calculated capitalization configuration probability to each respective capitalization configuration; and selecting the best capitalization configuration from the plurality of capitalization configurations based on the highest calculated capitalization configuration probability. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A computer program embodied on a non-transitory computer readable medium having instructions for capitalizing translated text, comprising:
-
executing a translator module stored on a device to automatically translate a capitalized source text to a target text, the translation of the capitalized source text including converting source text to lower case and translating the lower case source text; and capitalizing the target text according to capitalization information in the capitalized source text, the step of capitalizing the target text according to the capitalized source text including; generating one or more capitalization configurations for the target text; computing a configuration probability for each of the one or more capitalization configurations, the configuration probability computed from capitalization information in the capitalized source text and at least one capitalization model feature function based on an alignment between the capitalized source text and the target text or the capitalized source text and the capitalization configuration; and selecting the best capitalization configuration based on the highest computed configuration probability.
-
Specification