Machine translation
First Claim
1. A computer natural language translation system, comprising:
- means for inputting source language text;
means for outputting target language text;
transfer means for generating said target language text from said source language text using stored translation data generated from examples of source and corresponding target language texts, characterised in that said stored translation data comprises a plurality of translation units, each comprising;
respective surface data representative of the order of occurrence of language units of said source and target languages;
respective dependency data related to the semantic relationship between said language units of said source and target languages; and
the dependency data of language units of said source language being aligned with corresponding dependency data of language units of said target language, and in that said transfer means is arranged to use said surface data of said source language in analysing said source language text, and said surface data of said target language in generating said target language text, and said dependency data in transforming the analysis of said source text into an analysis for said target language.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer natural language translation system, comprising: means for inputting source language text; means for outputting target language text; transfer means for generating said target language text from said source language text using stored translation data generated from examples of source and corresponding target language texts, characterised in that said stored translation data comprises a plurality of translation components, each comprising: surface data representative of the order of occurrence of language units in said component; dependency data related to the semantic relationship between language units in said component; and link data linking dependency data of language components of said source language with corresponding dependency data of language components of said target language, and in that said transfer means is arranged to used said surface data of said source language in analysing the source language text, and said surface data of said target language in generating said target language text, and said dependency data and link data in transforming the analysis of said source text into an analysis for said target language.
-
Citations
19 Claims
-
1. A computer natural language translation system, comprising:
-
means for inputting source language text;
means for outputting target language text;
transfer means for generating said target language text from said source language text using stored translation data generated from examples of source and corresponding target language texts, characterised in that said stored translation data comprises a plurality of translation units, each comprising;
respective surface data representative of the order of occurrence of language units of said source and target languages;
respective dependency data related to the semantic relationship between said language units of said source and target languages; and
the dependency data of language units of said source language being aligned with corresponding dependency data of language units of said target language, and in that said transfer means is arranged to use said surface data of said source language in analysing said source language text, and said surface data of said target language in generating said target language text, and said dependency data in transforming the analysis of said source text into an analysis for said target language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 17)
-
-
13. A computer language translation development system, for developing data
for use in translation, comprising: -
means for allowing corresponding source and target example texts to be linked into source and target language dependency graphs;
means for allowing corresponding translatable nodes of said source and target language dependency graphs representing translatable parts of the source and target texts to be aligned; and
means for automatically generating, from said source and target language dependency graphs, respective associated surface representative graph having a tree structure. - View Dependent Claims (14, 15, 16)
-
-
18. A method for inferring a surface tree representation of the analysis of a text from a dependency graph comprising;
-
simplification of the dependency graph in a manner that is dependent on the alignment between source and target dependency graphs, creation of an initial (not necessarily planar) surface tree from the above simplified dependency graph, and creation of a final planar surface tree from the above surface tree.
-
-
19. A method of generating translation units for a computer natural language translation system, the method comprising the steps:
-
(a) providing examples of translations of source language text into target language text, and for each such example;
(b) generating for the source language text and target language text respective dependency graphs in which the respective language unit orders are retained;
(c) linking language units of the source language dependency graph with respective corresponding language units of the target language dependency graph where those language units are deemed directly translatable;
(d) processing the dependency graphs with a transformation algorithm to detect plural root nodes and, upon such detection, making one of the plural root nodes into a daughter, thereby producing corresponding trees having a single root node;
(e) processing said corresponding trees to detect any parent/daughter node dependency violating the planar tree projection constraint and, upon such detection, making the relevant daughter node dependent from a higher node of the tree; and
(if) for each linked pair of language units, generating a respective translation unit comprising those linked language units as respective head language units together with daughter language units, if any, and together with data relating to dependency change as the result of the operation of steps (d) and (e).
-
Specification