Translating texts between languages
First Claim
1. A computer-implemented method of translating a source text in a source language into an output language, comprising:
- obtaining, using a processor, the source text in the source language from an input detected in a device;
dividing, using a processor, the source text into sentences;
translating, using a processor, each sentence of the source text, said translating comprising;
executing a first syntactic analysis, said first syntactic analysis includinggenerating a set of possible relationships between words of the respective sentence and building a graph of generalized constituents;
executing a second syntactic analysis of each sentence, said second syntactic analysis comprising;
putting forward at least one hypothesis related to a syntactic structure of the sentence, wherein the putting forward the at least one hypothesis comprisesperforming a lexical selection to select a first lexical meaning from a first lexical class for a node of a syntactic tree of the sentence from a set of possible lexical classes; and
verifying the hypothesis about the syntactic structure of the sentence;
generating a language-independent semantic structure of the sentence based at least in part on the verified hypothesis;
generating a syntactic structure of an output sentence in the output language based at least in part on the generated language-independent semantic structure; and
performing a morphological synthesis of the output sentence in the output language based at least in part on the generated syntactic structure of the output sentence;
generating, using a processor, a text in the output language based on said translating of each of the sentences of the source text;
detecting, using a processor, a translation error associated with incorrect lexical selection comprisingcomparing the first lexical class of the selected first lexical meaning to a topic of the source text; and
repeating, using a processor, translation of the identified sentences with a second lexical meaning from a second lexical class, wherein the second lexical class is different from the first lexical class.
4 Assignments
0 Petitions
Accused Products
Abstract
Methods and computer systems for translating sentences between languages from an intermediate language-independent semantic representation are provided. Based on a comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, build syntactic structures and language independent semantic structures and representations, and synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The methods and systems can be applied to automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc.
87 Citations
20 Claims
-
1. A computer-implemented method of translating a source text in a source language into an output language, comprising:
-
obtaining, using a processor, the source text in the source language from an input detected in a device; dividing, using a processor, the source text into sentences; translating, using a processor, each sentence of the source text, said translating comprising; executing a first syntactic analysis, said first syntactic analysis including generating a set of possible relationships between words of the respective sentence and building a graph of generalized constituents; executing a second syntactic analysis of each sentence, said second syntactic analysis comprising; putting forward at least one hypothesis related to a syntactic structure of the sentence, wherein the putting forward the at least one hypothesis comprises performing a lexical selection to select a first lexical meaning from a first lexical class for a node of a syntactic tree of the sentence from a set of possible lexical classes; and verifying the hypothesis about the syntactic structure of the sentence; generating a language-independent semantic structure of the sentence based at least in part on the verified hypothesis; generating a syntactic structure of an output sentence in the output language based at least in part on the generated language-independent semantic structure; and performing a morphological synthesis of the output sentence in the output language based at least in part on the generated syntactic structure of the output sentence; generating, using a processor, a text in the output language based on said translating of each of the sentences of the source text; detecting, using a processor, a translation error associated with incorrect lexical selection comprising comparing the first lexical class of the selected first lexical meaning to a topic of the source text; and repeating, using a processor, translation of the identified sentences with a second lexical meaning from a second lexical class, wherein the second lexical class is different from the first lexical class. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented method for disambiguation during translating a source text from a source language into an output language, comprising:
-
obtaining, using a processor, the source text in the source language from an input detected in a device; dividing, using a processor, the source text into sentences; translating, using a processor, each sentence of the source text, wherein translating comprises; generating a set of possible relationships between words in respective sentences; building a graph of generalized constituents for respective sentences; putting forward at least one hypothesis related to a syntactic structure of each sentence; and verifying the hypothesis about the syntactic structure of the sentence; generating a language-independent semantic structure of the source text; generating a syntactic structure of an output text in the output language, comprising; making a lexical selection to select a first lexical meaning from a first lexical class for a node of the semantic structure from a set of possible lexical classes; and performing a morphological synthesis of an output sentence in the output language; obtaining, using a processor, a text in the output language based on the output sentences obtained as result of said translating; identifying, using a processor, the output sentences that include one or more errors by comparing the first lexical class of the selected first lexical meaning to a topic of the source text; repeating, using a processor, translation of the identified sentences with a second lexical meaning from a second lexical class, wherein the second lexical class is different from the first lexical class. - View Dependent Claims (16, 17, 18)
-
-
19. One or more physical non-transitory computer accessible media encoded with instructions for performing a method, the method comprising:
-
obtaining, using a processor, a source text in a source language from an input detected in a device; dividing, using a processor, the source text into sentences; translating, using a processor, each sentence of the source text, said translating comprising; executing a first syntactic analysis, said first syntactic analysis including generating a set of possible relationships between words of the respective sentence and building a graph of generalized constituents; executing a second syntactic analysis of each sentence, said second syntactic analysis comprising; putting forward at least one hypothesis related to a syntactic structure of the sentence, wherein the putting forward the at least one hypothesis comprises performing a lexical selection to select a first lexical meaning from a first lexical class for a node of a syntactic tree of the sentence from a set of possible lexical classes; and verifying the hypothesis about the syntactic structure of the sentence; generating a language-independent semantic structure of the sentence based at least in part on the verified hypothesis; generating a syntactic structure of an output sentence in an output language based at least in part on the generated language-independent semantic structure; and performing a morphological synthesis of the output sentence in the output language based at least in part on the generated syntactic structure of the output sentence; generating, using a processor, a text in the output language based on said translating of each of the sentences of the source text; detecting, using a processor, a translation error associated with incorrect lexical selection comprising comparing the first lexical class of the selected first lexical meaning to a topic of the source text; and repeating, using a processor, translation of the identified sentences with a second lexical meaning from a second lexical class, wherein the second lexical class is different from the first lexical class.
-
-
20. A computer-implemented method of translating a source text in a source language into an output language, comprising:
-
obtaining, using a processor, the source text in the source language from an input detected in a device; dividing, using a processor, the source text into sentences; translating, using a processor, each sentence of the source text, said translating comprising; executing a first syntactic analysis, said first syntactic analysis including generating a set of possible relationships between words of the respective sentence and building a graph of generalized constituents; executing a second syntactic analysis of each sentence, said second syntactic analysis comprising; putting forward at least one hypothesis related to a syntactic structure of the sentence, wherein the putting forward the at least one hypothesis comprises performing a lexical selection to select a first lexical meaning from a first lexical class for a node of a syntactic tree of the sentence from a set of possible lexical classes; and verifying the hypothesis about the syntactic structure of the sentence; generating a language-independent semantic structure of the sentence based at least in part on the verified hypothesis; generating a syntactic structure of an output sentence in the output language based at least in part on the generated language-independent semantic structure; and performing a morphological synthesis of the output sentence in the output language based at least in part on the generated syntactic structure of the output sentence; generating, using a processor, a text in the output language based on said translating of each of the sentences of the source text; detecting, using a processor, a translation error associated with incorrect lexical selection comprising comparing the first lexical class of the selected first lexical meaning to a lexical class of another word with the same lemma as a word corresponding to the node; and repeating, using a processor, translation of the identified sentences with a second lexical meaning from a second lexical class, wherein the second lexical class is different from the first lexical class.
-
Specification