System and method for translating chat messages
First Claim
1. A method, comprising:
- receiving an electronic text message from a client device of a user;
normalizing the electronic text message to generate a normalized text message comprising characters having a consistent width, the normalizing comprising at least one of;
converting a full-width character into a half-width character;
orconverting a half-width character into a full-width character;
tagging at least one phrase in the normalized text message with a marker to generate a tagged text message, the marker indicating that the at least one phrase will be translated using a rule-based system;
translating the tagged text message using the rule-based system and a machine translation system to generate an initial translation,wherein translating the tagged text message comprises (i) translating the at least one phrase using the rule-based system and (ii) translating other words or phrases using the machine translation system, andwherein the machine translation system is trained using training data comprising characters having a consistent width; and
post-processing the initial translation to generate a final translation.
2 Assignments
0 Petitions
Accused Products
Abstract
Implementations of the present disclosure are directed to a method, a system, and an article for translating chat messages. An example method can include: receiving an electronic text message from a client device of a user; normalizing the electronic text message to generate a normalized text message; tagging at least one phrase in the normalized text message with a marker to generate a tagged text message, the marker indicating that the at least one phrase will be translated using a rule-based system; translating the tagged text message using the rule-based system and a machine translation system to generate an initial translation; and post-processing the initial translation to generate a final translation.
324 Citations
18 Claims
-
1. A method, comprising:
-
receiving an electronic text message from a client device of a user; normalizing the electronic text message to generate a normalized text message comprising characters having a consistent width, the normalizing comprising at least one of; converting a full-width character into a half-width character;
orconverting a half-width character into a full-width character; tagging at least one phrase in the normalized text message with a marker to generate a tagged text message, the marker indicating that the at least one phrase will be translated using a rule-based system; translating the tagged text message using the rule-based system and a machine translation system to generate an initial translation, wherein translating the tagged text message comprises (i) translating the at least one phrase using the rule-based system and (ii) translating other words or phrases using the machine translation system, and wherein the machine translation system is trained using training data comprising characters having a consistent width; and post-processing the initial translation to generate a final translation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, comprising:
one or more computer processors programmed to perform operations comprising; receiving an electronic text message from a client device of a user; normalizing the electronic text message to generate a normalized text message comprising characters having a consistent width, the normalizing comprising at least one of; converting a full-width character into a half-width character;
orconverting a half-width character into a full-width character; tagging at least one phrase in the normalized text message with a marker to generate a tagged text message, the marker indicating that the at least one phrase will be translated using a rule-based system; translating the tagged text message using the rule-based system and a machine translation system to generate an initial translation, wherein translating the tagged text message comprises (i) translating the at least one phrase using the rule-based system and (ii) translating other words or phrases using the machine translation system, and wherein the machine translation system is trained using training data comprising characters having a consistent width; and post-processing the initial translation to generate a final translation. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
18. An article, comprising:
a non-transitory computer-readable medium having instructions stored thereon that, when executed by one or more computer processors, cause the computer processors to perform operations comprising; receiving an electronic text message from a client device of a user; normalizing the electronic text message to generate a normalized text message comprising characters having a consistent width, the normalizing comprising at least one of; converting a full-width character into a half-width character;
orconverting a half-width character into a full-width character; tagging at least one phrase in the normalized text message with a marker to generate a tagged text message, the marker indicating that the at least one phrase will be translated using a rule-based system; translating the tagged text message using the rule-based system and a machine translation system to generate an initial translation, wherein translating the tagged text message comprises (i) translating the at least one phrase using the rule-based system and (ii) translating other words or phrases using the machine translation system, and wherein the machine translation system is trained using training data comprising characters having a consistent width; and post-processing the initial translation to generate a final translation.
Specification