Translating sentences between languages
First Claim
1. A method for a computer system to synthesize a sentence into an output language, the method comprising:
- obtaining a language-independent semantic structure of the sentence expressed as a language-independent directed acyclic graph including nodes, wherein the nodes correspond to semantic classes;
selecting at least one lexical meaning from multiple lexical meanings in a corresponding semantic class for each node of the acyclic graph using;
a lexical-semantic description of the output language,a rating related to lexical meanings in the output language,a rating related to combinations of the lexical meanings in the output language, anda most number of semantemes assigned to a constituent core of a corresponding word of the sentence, on the basis of the rating related to lexical meanings in the output language and the rating related to combinations of the lexical meanings in the output language;
building a surface structure of the output sentence in the output language from the language-independent semantic structure using a syntactic description of the output language, wherein the surface structure of the output sentence is built at least in part based on a rating of syntactic constructions of the output language;
restoring a movement and determining a linear order on the syntactic structure of the output sentence in the output language; and
performing a morphological synthesis of the output sentence using morphological descriptions of the output language.
4 Assignments
0 Petitions
Accused Products
Abstract
A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The method and computer system can be applied to in automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc.
-
Citations
27 Claims
-
1. A method for a computer system to synthesize a sentence into an output language, the method comprising:
-
obtaining a language-independent semantic structure of the sentence expressed as a language-independent directed acyclic graph including nodes, wherein the nodes correspond to semantic classes; selecting at least one lexical meaning from multiple lexical meanings in a corresponding semantic class for each node of the acyclic graph using; a lexical-semantic description of the output language, a rating related to lexical meanings in the output language, a rating related to combinations of the lexical meanings in the output language, and a most number of semantemes assigned to a constituent core of a corresponding word of the sentence, on the basis of the rating related to lexical meanings in the output language and the rating related to combinations of the lexical meanings in the output language; building a surface structure of the output sentence in the output language from the language-independent semantic structure using a syntactic description of the output language, wherein the surface structure of the output sentence is built at least in part based on a rating of syntactic constructions of the output language; restoring a movement and determining a linear order on the syntactic structure of the output sentence in the output language; and performing a morphological synthesis of the output sentence using morphological descriptions of the output language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A device for translating a text fragment in an input language into an output language, the device comprising:
-
a processor; and a memory in electronic communication with the processor, wherein the memory is configured with instructions to cause the processor to perform actions comprising; obtaining a language-independent semantic structure of the sentence to be synthesized in language-independent terms wherein the meaning of the sentence is expressed as a language-independent directed acyclic graph including nodes, wherein the nodes correspond to semantic classes; selecting at least one lexical meaning from multiple lexical meanings in a corresponding semantic class for each node of the acyclic graph using; a lexical-semantic description of the output language, a rating related to lexical meanings in the output language, a rating related to combinations of the lexical meanings in the output language, and a most number of semantemes assigned to a constituent core of a corresponding word of the sentence, on the basis of the rating related to lexical meanings in the output language and the rating related to combinations of the lexical meanings in the output language; building a surface structure of the output sentence in the output language from the language-independent semantic structure using a syntactic description of the output language, wherein the surface structure of the output sentence is built at least in part based on a rating of syntactic constructions of the output language; restoring a movement and determining a linear order on the syntactic structure of the output sentence in the output language; and performing a morphological synthesis of the output sentence using morphological descriptions of the output language. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. One or more non-transitory computer-readable media having stored thereon instructions, which when executed by a processing system, cause the system to perform a method for correcting distortions in a scanned image of a page, the method comprising:
-
obtaining a language-independent semantic structure of the sentence to be synthesized in language-independent terms wherein the meaning of the sentence is expressed as a language-independent directed acyclic graph including nodes, wherein the nodes correspond to semantic classes; selecting at least one lexical meaning from multiple lexical meanings in a corresponding semantic class for each node of the acyclic graph using; a lexical-semantic description of the output language, a rating related to lexical meanings in the output language, a rating related to combinations of the lexical meanings in the output language, and a most number of semantemes assigned to a constituent core of a corresponding word of the sentence, on the basis of the rating related to lexical meanings in the output language and the rating related to combinations of the lexical meanings in the output language; building a surface structure of the output sentence in the output language from the language-independent semantic structure using a syntactic description of the output language, wherein the surface structure of the output sentence is built at least in part based on a rating of syntactic constructions of the output language; restoring a movement and determining a linear order on the syntactic structure of the output sentence in the output language; and performing a morphological synthesis of the output sentence using morphological descriptions of the output language. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification