Automatic extraction of transfer mappings from bilingual corpora
First Claim
1. A computer-implemented method of associating dependency structures from two different languages on a tangible computer readable medium, wherein the dependency structures comprise nodes organized in a hierarchical parent/child structure, the computer-implemented method comprising:
- associating nodes of the dependency structures with a computer to form tentative correspondences on the tangible medium, wherein associating includes forming tentative correspondences comprising translations of morphological bases and derivations;
aligning nodes of the dependency structures as a function of at least one of eliminating at least one of the tentative correspondences and structural considerations on the tangible medium, wherein aligning does not require beginning with either a top or bottom node of the hierarchical parent/child structure of the dependency structures; and
providing an output from the computer indicative of the alignment of the dependency structures.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of aligning nodes of dependency structures obtained from a bilingual corpus includes a two-phase approach wherein a first phase comprises associating nodes of the dependency structures to form tentative correspondences. The nodes of the dependency structures are then aligned as a function of the tentative correspondences and structural considerations. Mappings are obtained from the aligned dependency structures. The mappings can be expanded with varying types and amounts of local context in order that a more fluent translation can be obtained when translation is performed.
63 Citations
54 Claims
-
1. A computer-implemented method of associating dependency structures from two different languages on a tangible computer readable medium, wherein the dependency structures comprise nodes organized in a hierarchical parent/child structure, the computer-implemented method comprising:
-
associating nodes of the dependency structures with a computer to form tentative correspondences on the tangible medium, wherein associating includes forming tentative correspondences comprising translations of morphological bases and derivations; aligning nodes of the dependency structures as a function of at least one of eliminating at least one of the tentative correspondences and structural considerations on the tangible medium, wherein aligning does not require beginning with either a top or bottom node of the hierarchical parent/child structure of the dependency structures; and providing an output from the computer indicative of the alignment of the dependency structures. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer-implemented method of associating dependency structures from two different languages stored on a tangible computer readable medium, wherein the dependency structures comprise nodes organized in a parent/child structure, the computer-implemented method comprising:
-
aligning nodes of the dependency structures with correspondences on the tangible medium with a computer as a function of a set of rules comprising at least three different rules, wherein the dependency structures comprise a set of unaligned nodes and wherein after each of the rules are applied any aligned nodes are removed from the set of unaligned nodes before applying another rule, and wherein aligning does not require beginning with either a top or bottom node of the hierarchical parent/child structure of the dependency structures, and wherein aligning is not based on top-down processing or bottom-up processing of nodes; and providing an output from the computer indicative of the alignment of the dependency structures. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A computer-implemented method of associating dependency structures from two different languages stored on a tangible computer readable medium, wherein the dependency structures comprise nodes organized in a hierarchical parent/child structure, the computer-implemented method comprising:
-
aligning nodes of the dependency structures with correspondences on the tangible medium with a computer as a function of a set of rules comprising at least two different rules where aligned nodes are determined based on the parent/child structure, and wherein aligning does not require beginning with either a top or bottom node of the hierarchical parent/child structure of the dependency structures, and wherein an order of aligning nodes is based on linguistic relevance, beginning with aligning nodes having more linguistic relevance than aligning nodes having less linguistic relevance; and providing an output from the computer indicative of the alignment of the dependency structures. - View Dependent Claims (42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
-
Specification