Multi-lingual semantic parser based on transferred learning
First Claim
1. A system of generating a multi-lingual semantic parser based on transferred learning from a first language having sufficient training data to a second language having insufficient training data, the system comprising:
- a computer system programmed to;
obtain a corpus of words in the first language;
generate a plurality of cross-lingual word features that each predicts a target word in the second language that corresponds to a respective source word in the corpus of words based on contextual information in the corpus of words and a translation of the respective source word from the first language to the second language;
obtain at least a first predefined utterance in the first language;
encode, using at least a first encoder, the first predefined utterance as a plurality of first vectors, wherein each first vector is based on a lookup of the plurality of cross-lingual word features using a corresponding word in the first predefined utterance;
decode, using a decoder, the plurality of first vectors to generate one or more words in the second language corresponding to the first predefined utterance in the first language; and
adapt a semantic parser trained on the first language based at least on the decoded plurality of first vectors to generate the multi-lingual semantic parser that parses utterances having one or more words in the second language and utterances having one or more words in the first language.
5 Assignments
0 Petitions
Accused Products
Abstract
The disclosure relates to transferred learning from a first language (e.g., a source language for which a semantic parser has been defined) to a second language (e.g., a target language for which a semantic parser has not been defined). A system may use knowledge from a trained model in one language to model another language. For example, the system may transfer knowledge of a semantic parser from a first (e.g., source) language to a second (e.g., target) language. Such transfer of knowledge may occur and be useful when the first language has sufficient training data but the second language has insufficient training data. The foregoing transfer of knowledge may extend the semantic parser for multiple languages (e.g., the first language and the second language).
-
Citations
16 Claims
-
1. A system of generating a multi-lingual semantic parser based on transferred learning from a first language having sufficient training data to a second language having insufficient training data, the system comprising:
-
a computer system programmed to; obtain a corpus of words in the first language; generate a plurality of cross-lingual word features that each predicts a target word in the second language that corresponds to a respective source word in the corpus of words based on contextual information in the corpus of words and a translation of the respective source word from the first language to the second language; obtain at least a first predefined utterance in the first language; encode, using at least a first encoder, the first predefined utterance as a plurality of first vectors, wherein each first vector is based on a lookup of the plurality of cross-lingual word features using a corresponding word in the first predefined utterance; decode, using a decoder, the plurality of first vectors to generate one or more words in the second language corresponding to the first predefined utterance in the first language; and adapt a semantic parser trained on the first language based at least on the decoded plurality of first vectors to generate the multi-lingual semantic parser that parses utterances having one or more words in the second language and utterances having one or more words in the first language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented method of generating a multi-lingual semantic parser based on transferred learning from a first language having sufficient training data to a second language having insufficient training data, the method being implemented on a computer system, the method comprising:
-
obtaining a corpus of words in the first language; generating a plurality of cross-lingual word features that each predicts a target word in the second language that corresponds to a respective source word in the corpus of words based on contextual information in the corpus of words and a translation of the respective source word from the first language to the second language; obtaining at least a first predefined utterance in the first language; encoding, using at least a first encoder, the first predefined utterance as a plurality of first vectors, wherein each first vector is based on a lookup of the plurality of cross-lingual word features using a corresponding word in the first predefined utterance; decoding, using a decoder, the plurality of first vectors to generate one or more words in the second language corresponding to the first predefined utterance in the first language; and adapting a semantic parser trained on the first language based at least on the decoded plurality of first vectors to generate the multi-lingual semantic parser that parses utterances having one or more words in the second language and utterances having one or more words in the first language. - View Dependent Claims (16)
-
Specification