Method and system for translating sentence between languages based on semantic structure of the sentence
First Claim
1. A method of translating a source sentence in a source language into an output sentence in an output language, the method comprising:
- performing, by a computer system, a lexico-morphological analysis of the source sentence to build a lexico-morphological structure representing the source sentence;
performing, using the lexico-morphological structure, a rough syntactic analysis of the source sentence to generate a graph of generalized constituents representing multiple possible syntactic relationships between words of the source sentence;
performing, using the graph of generalized constituents, a precise syntactic analysis of the source sentence to generate a plurality of syntactic trees representing the source sentence, wherein each syntactic tree comprises a subgraph of the graph of generalized constituents;
determine, for the plurality of syntactic trees, values of a syntactic tree rating reflecting a degree of correspondence of a filler of a surface slot of a constituent of the syntactic structure to a restriction associated with the surface slot;
selecting, based on the values of the syntactic tree rating, one or more syntactic trees among the plurality of the syntactic trees;
generating, based on the selected syntactic trees, a syntactic structure representing the source sentence;
using the syntactic structure to generate a language-independent semantic structure representing the source sentence; and
using the language-independent semantic structure to generate the output sentence.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The method and computer system can be applied to in automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc.
196 Citations
22 Claims
-
1. A method of translating a source sentence in a source language into an output sentence in an output language, the method comprising:
-
performing, by a computer system, a lexico-morphological analysis of the source sentence to build a lexico-morphological structure representing the source sentence; performing, using the lexico-morphological structure, a rough syntactic analysis of the source sentence to generate a graph of generalized constituents representing multiple possible syntactic relationships between words of the source sentence; performing, using the graph of generalized constituents, a precise syntactic analysis of the source sentence to generate a plurality of syntactic trees representing the source sentence, wherein each syntactic tree comprises a subgraph of the graph of generalized constituents; determine, for the plurality of syntactic trees, values of a syntactic tree rating reflecting a degree of correspondence of a filler of a surface slot of a constituent of the syntactic structure to a restriction associated with the surface slot; selecting, based on the values of the syntactic tree rating, one or more syntactic trees among the plurality of the syntactic trees; generating, based on the selected syntactic trees, a syntactic structure representing the source sentence; using the syntactic structure to generate a language-independent semantic structure representing the source sentence; and using the language-independent semantic structure to generate the output sentence. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable storage medium encoded with instructions for causing a computing system to:
-
perform a lexico-morphological analysis of a source sentence to build a lexico-morphological structure representing the source sentence; perform, using the lexico-morphological structure, a syntactic analysis of the source sentence to generate a plurality of syntactic trees representing the source sentence; determine, for the plurality of syntactic trees, values of a syntactic tree rating reflecting a degree of correspondence of a filler of a surface slot of a constituent of the syntactic structure to a restriction associated with the surface slot; select, based on the values of the syntactic tree rating, a first syntactic tree among the plurality of the syntactic trees; responsive to failing to assign non-tree links to the first syntactic tree, selecting a second syntactic tree among the plurality of the syntactic trees; generate, based on the second syntactic tree, a syntactic structure representing the source sentence; using the syntactic structure to generate a language-independent semantic structure representing the source sentence; and using the language-independent semantic structure to generate an output sentence in a specified output language. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer system for translating a source sentence from a source language into an output sentence in an output language, the computer system comprising:
-
a memory; a processor coupled to the memory, the processor configured to; perform a lexico-morphological analysis of the source sentence to build a lexico-morphological structure representing the source sentence; perform, using the lexico-morphological structure, a syntactic analysis of the source sentence to generate a plurality of syntactic trees representing the source sentence; determine, for the plurality of syntactic trees, values of a syntactic tree rating reflecting a degree of correspondence of a filler of a surface slot of a first constituent of the syntactic structure to a restriction associated with the surface slot; select, based on the values of the syntactic tree rating, a first syntactic tree among the plurality of the syntactic trees; responsive to failing to assign non-tree links to the first syntactic tree, select a second syntactic tree among the plurality of the syntactic trees; generate, based on the second syntactic tree, a syntactic structure representing the source sentence; use the syntactic structure of the source sentence to generate a language-independent semantic structure representing the source sentence; and use the language-independent semantic structure to generate the output sentence. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
Specification