Translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions
First Claim
1. A method for a computer system to represent the meaning of a source sentence from a source language, the method comprising:
- obtaining a language-independent semantic structure to represent the meaning of the source sentence;
synthesizing a syntactic structure of an output sentence from the language independent semantic structure using information which includes semantic descriptions, lexical descriptions of an output language, syntactic descriptions of the output language, and morphological descriptions of the output language, wherein the syntactic structure is built at least in part based on ratings of syntactic constructions for each element of the source sentence; and
constructing the output sentence to represent the meaning of the source sentence in the output language by performing a lexical selection on the language-independent semantic structure of the sentence using lexical descriptions of the output language and semantic descriptions, wherein performing the lexical selection further comprises applying one or more semantic structure correction rules to overcome asymmetries between the language-independent semantic structure and the syntactic structure of the output sentence in the output language, and wherein applying one or more semantic structure correction rules to overcome asymmetries includes the use of one or more semanteme calculating rules and one or more semanteme normalization rules.
6 Assignments
0 Petitions
Accused Products
Abstract
A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The method and computer system can be applied to in automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc.
145 Citations
19 Claims
-
1. A method for a computer system to represent the meaning of a source sentence from a source language, the method comprising:
-
obtaining a language-independent semantic structure to represent the meaning of the source sentence; synthesizing a syntactic structure of an output sentence from the language independent semantic structure using information which includes semantic descriptions, lexical descriptions of an output language, syntactic descriptions of the output language, and morphological descriptions of the output language, wherein the syntactic structure is built at least in part based on ratings of syntactic constructions for each element of the source sentence; and constructing the output sentence to represent the meaning of the source sentence in the output language by performing a lexical selection on the language-independent semantic structure of the sentence using lexical descriptions of the output language and semantic descriptions, wherein performing the lexical selection further comprises applying one or more semantic structure correction rules to overcome asymmetries between the language-independent semantic structure and the syntactic structure of the output sentence in the output language, and wherein applying one or more semantic structure correction rules to overcome asymmetries includes the use of one or more semanteme calculating rules and one or more semanteme normalization rules. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for a computer system to generate an output sentence in an output language, the method comprising:
-
performing a lexical selection on a language-independent semantic structure of a source sentence using lexical descriptions of the output language and semantic descriptions, wherein the lexical descriptions and the semantic descriptions are combined into lexical-semantic descriptions, and wherein performing the lexical selection includes applying one or more semantic structure correction rules to overcome asymmetries between the language-independent semantic structure and the syntactic structure of the output sentence in the output language; building a syntactic structure of the output sentence from the language independent semantic structure using syntactic descriptions and morphological descriptions of the output language, wherein the syntactic structure is built at least in part based on ratings of syntactic constructions for elements of the source sentence; performing a morphological synthesis on the syntactic structure of the output sentence using morphological descriptions of the output language; and constructing the output sentence in the output language. - View Dependent Claims (11, 12, 13)
-
-
14. A method for a computer system to generate an output sentence in an output language from a language-independent semantic structure, the method comprising:
-
performing a lexical selection on the language-independent semantic structure of a source sentence using lexical descriptions of the output language and semantic descriptions, wherein performing the lexical selection includes applying one or more semantic structure correction rules to overcome asymmetries between the language-independent semantic structure and the syntactic structure of the output sentence in the output language; building a syntactic structure of the output sentence in the output language from the language-independent semantic structure using syntactic descriptions and lexical descriptions of the output language and using a priori rating scores related to a degree of correspondence of precise constituents of the source sentence to corresponding semantic descriptions; determining a linear order and restoring movements on the syntactic structure of the output sentence in the output language; performing a morphological synthesis on the syntactic structure of the output sentence in the output language using morphological descriptions of the output language; and constructing the output sentence in the output language. - View Dependent Claims (15)
-
-
16. One or more computer readable media encoded with instructions for causing a computing system to carry out steps comprising:
-
performing a lexical selection on a language-independent semantic structure using lexical descriptions of an output language and semantic descriptions, wherein performing the lexical selection includes applying one or more semantic structure correction rules to overcome asymmetries between the language-independent semantic structure and the syntactic structure of an output sentence in the output language; building a surface structure of the output sentence in the output language from the language-independent semantic structure using syntactic descriptions and lexical descriptions of the output language, wherein building the surface structure includes rating a linear order for each syntactic form (“
syntform”
) and creating a precise constituent for each syntform;performing a morphological synthesis on the surface structure of the output sentence in the output language using morphological descriptions of the output language; and constructing the output sentence in the output language. - View Dependent Claims (17, 18)
-
-
19. One or more computer readable media encoded with instructions for causing a computing system to carry out steps for translating a sentence from a source language into an output language, comprising:
-
analyzing a language-independent semantic structure for the sentence; performing a lexical selection on the language-independent semantic structure of the sentence using lexical descriptions of the output language and semantic descriptions, wherein performing the lexical selection includes applying one or more semantic structure correction rules to overcome asymmetries between the language-independent semantic structure and a syntactic structure of an output sentence in the output language, wherein the syntactic structure is built at least in part from ratings of syntactic constructions for elements of the source sentence, and wherein applying one or more semantic structure correction rules to overcome asymmetries includes the use of one or more semanteme calculating rules and one or more semanteme normalization rules; building a surface structure of the output sentence in the output language from the language-independent semantic structure using syntactic descriptions and lexical descriptions of the output language; determining a linear order and restoring movements on the syntactic structure of the output sentence in the output language; performing a morphological synthesis on the surface structure of the output sentence in the output language using morphological descriptions of the output language; and constructing the sentence in the output language.
-
Specification