Deep model statistics method for machine translation
First Claim
Patent Images
1. A method for machine translation of a source document in an input language to a target document in an output language, the method comprising:
- creating a language-independent semantic structure for at least portions of each sentence in the source document;
generating by a processor translation options corresponding to at least portions of each sentence in the input language, wherein statistics are used in generating the translation options, wherein said generating the translation options includes;
performing a syntactic analysis to determine surface elements associated with the sentence in the input language, andperforming a semantic analysis to determine deep elements associated with the sentence in the input language;
selecting a translation option for the said at least portions of each sentence based on ratings associated with the translation options, wherein the ratings include a use of statistics that can include a combinatorial rating for the surface elements associated with the sentence in the input language and the deep elements associated with the sentence in the input language, wherein the combinatorial rating includes a probability of objects of certain semantic classes being combined with objects of a same or another semantic class; and
synthesizing one or more output sentences corresponding to the said at least portions of each sentence in the source document based at least in part on a respective language-independent semantic structure and respective selected translation option.
4 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.
-
Citations
24 Claims
-
1. A method for machine translation of a source document in an input language to a target document in an output language, the method comprising:
-
creating a language-independent semantic structure for at least portions of each sentence in the source document; generating by a processor translation options corresponding to at least portions of each sentence in the input language, wherein statistics are used in generating the translation options, wherein said generating the translation options includes; performing a syntactic analysis to determine surface elements associated with the sentence in the input language, and performing a semantic analysis to determine deep elements associated with the sentence in the input language; selecting a translation option for the said at least portions of each sentence based on ratings associated with the translation options, wherein the ratings include a use of statistics that can include a combinatorial rating for the surface elements associated with the sentence in the input language and the deep elements associated with the sentence in the input language, wherein the combinatorial rating includes a probability of objects of certain semantic classes being combined with objects of a same or another semantic class; and synthesizing one or more output sentences corresponding to the said at least portions of each sentence in the source document based at least in part on a respective language-independent semantic structure and respective selected translation option. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A physical, non-transitory computer storage medium having stored thereon a sequence of instructions which when executed by a computer system cause said computer system to perform a method for machine translation of a source document in an input language to a target document in an output language, the instructions comprising:
-
creating a language-independent semantic structure for each sentence in the source document; generating by a processor of the computer system translation options corresponding to at least portions of each respective sentence in the input language, wherein said generating the translation options is based at least in part on statistics, and wherein said generating the translation options includes; performing a syntactic analysis to determine surface elements associated with the sentence in the input language, and performing a semantic analysis to determine deep elements associated with the sentence in the input language; selecting a translation option for one or more sentences based on ratings associated with the translation options, wherein the ratings associated with the translation options include the use of statistics, wherein the statistics include the use of a combinatorial rating for the surface elements associated with the sentence in the input language and the deep elements associated with the sentence in the input language, wherein the combinatorial rating includes a use of a probability of objects of certain semantic classes being combined with objects of a same or another semantic class; synthesizing an output sentence for the target document corresponding to one or more sentences in the source document based at least in part on one or more respective language-independent semantic structures and a selected translation option; and writing the output sentence for the target document to a component of the computing system. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer system comprising:
-
a processor; and a storage medium coupled to the processor, the storage medium storing instructions which when executed by the processor cause the computer system to perform a method for machine translation of a source document in an input language to a target document in an output language, the instructions comprising; creating a language-independent semantic structure for one or more sentences in the source document; generating translation options corresponding to at least portions of each sentence in the input language, wherein said generating the translation options is based at least in part on a use of statistics, and wherein said generating the translation options includes; performing a syntactic analysis to determine surface elements associated with the sentence in the input language, and performing a semantic analysis to determine deep elements associated with the sentence in the input language; selecting a translation option for the said at least portions of each sentence based on ratings associated with the translation options, wherein the ratings associated with the translation options include the use of statistics, wherein said statistics include the use of a combinatorial rating for the surface elements associated with the sentence in the input language and the deep elements associated with the sentence in the input language, wherein the combinatorial rating includes a use of a probability of objects of certain semantic classes being combined with objects of a same or another semantic class; synthesizing one or more output sentences corresponding to the said at least portions of each sentence in the source document based at least in part on one or more respective language-independent semantic structures and respective selected translation option; and writing the target document to a component of the computer system. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification