System, method, and product for dynamically aligning translations in a translation-memory system
First Claim
1. A computer program product comprising a computer-readable medium and instructions stored thereon, said instructions, when executed by an appropriate digital computer causing the digital computer to operate as an aligner which associates translatable source segments from one or more source files having a first format with corresponding target segments extracted from one or more target files having the first format, such association being based upon commonality of attributes associated with the segments, wherein at least one such attribute is selectable, whereby said commonality of attributes allows an imperfect match between sequences of attributes associated with the segments and at least some times excludes an attribute from a commonality of attributes necessary to achieve a match.
14 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented system, method, and product associate words, phrases, or other characters (“words”) to be translated with previous translations of such words, if a previous translation exists. The words to be translated, and their identifying attributes (such as, for example, formatting information), are extracted from one or more source files. The previous translations, and their identifying attributes, are extracted from one or more target files. If a previous translation does not exist, the words to be translated, and their attributes, are copied to create a pseudo-previous translation, with attributes. Each word to be translated is associated with a corresponding previous translation based upon commonalities of their attributes. Such words may also be associated based upon their respective locations in their respective files. Each word to be translated, and its attributes, may be stored with a previous translation in a source-target pair record of a source-target pair database. Each source-target pair record may include a propagation flag identifying whether the previous translation stored in the source-target pair record is to be propagated to other occurrences of the associated word to be translated in the source-target pair database. Each source-target pair record may also include a pointer to a page of an occurrence book having pages, wherein each page includes pointers to a common word to be translated in records of the source-target pair database.
134 Citations
68 Claims
- 1. A computer program product comprising a computer-readable medium and instructions stored thereon, said instructions, when executed by an appropriate digital computer causing the digital computer to operate as an aligner which associates translatable source segments from one or more source files having a first format with corresponding target segments extracted from one or more target files having the first format, such association being based upon commonality of attributes associated with the segments, wherein at least one such attribute is selectable, whereby said commonality of attributes allows an imperfect match between sequences of attributes associated with the segments and at least some times excludes an attribute from a commonality of attributes necessary to achieve a match.
-
35. A method for associating translatable source segments extracted from one or more source files having a first format with corresponding target segments extracted from one or more target files having the first format, comprising the steps of:
-
(1) determining identifying attributes of each translatable source segment, wherein at least one such attribute is selectable;
(2) generating a unique attribute identifier for each translatable source segment based upon its identifying attributes;
(3) determining identifying attributes of each corresponding target segment, wherein at least one such attribute is selectable;
(4) generating a unique attribute identifier for each corresponding target segment based upon its identifying attributes;
(5) comparing the unique attribute identifiers of the translatable source segments and corresponding target segments; and
(6) associating a translatable source segment with a corresponding target segment when they have the same unique attribute identifier;
whereby said commonality of attributes allows an imperfect match between sequences of attributes associated with the segments and an attribute may be excluded from a commonality of attributes necessary to achieve a match. - View Dependent Claims (36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
(a) identifying a first type of file of the first format; and
(b) searching for identifying attributes based on a syntactical description of the first type of file.
-
-
37. The method of claim 35, wherein step (1) comprises the steps of:
-
(a) identifying a first type of file of the first format;
(b) customizing a syntactical description of the first type of file;
(b) searching for identifying attributes based on the customized syntactical description.
-
-
38. The method of claim 37, wherein step (1)(b) comprises the step of:
(i) tagging a syntactical element with a tag that is an extension to a conventional BNF notation.
-
39. The method of claim 35, wherein step (2) comprises the step of:
(a) further generating the unique attribute identifier of each translatable source segment based upon its locations in a source file.
-
40. The method of claim 35, wherein step (3) comprises the step of:
(a) further generating the unique attribute identifier of each corresponding target segment based upon its locations in a target file.
-
41. The method of claim 35, further comprising the step of:
(7) storing each translatable source segment and its attributes with its corresponding target segment in a source-target pair record of a source-target pair database.
-
42. The method of claim 41, further comprising the step of:
(8) storing in each source-target pair record a propagation flag identifying whether the corresponding target segment stored in the source-target pair record is to be propagated to other occurrences of the associated translatable source segment in the source-target pair database.
-
43. The method of claim 41, further comprising the step of:
(8) storing in each source-target pair record a pointer to a page of an occurrence book comprising pages, each page comprising pointers to a common translatable source-segment in records of the source-target pair database.
-
44. The method of claim 35, wherein the source and target files include files in a file system and further comprising the step of:
(7) selecting the source and target files from the files in the file system.
-
45. The method of claim 35, further comprising the step of:
(7) identifying existing legacy files associated with one or more of the source and target files.
-
46. The method of claim 35, wherein step (4) comprises the steps of:
-
(a) generating hashed representations of identifying attributes of each translatable source segment; and
(b) generating hashed representations of identifying attributes of each corresponding target segment.
-
- 47. A computer system having at least one central processing unit (CPU), an operating system, at least one memory unit, and an aligner configured to cooperate with the at least one CPU and the operating system to associate translatable source segments extracted from one or more source files having a first format with corresponding target segments extracted from one or more target files having the first format, such association being based upon commonality of attributes of the segments, wherein at least one such attribute is selectable, whereby said commonality of attributes allows an imperfect match between sequences of attributes associated with the segments and an attribute may be excluded from a commonality of attributes necessary to achieve a match.
-
52. Storage media that contains software that, when executed on an appropriate computing system having at least one central processing unit (CPU), an operating system, and at least one memory unit, performs a method to associate translatable source segments extracted from one or more source files having a first format with corresponding target segments extracted from one or more target files having the first format, such association being based upon commonality of attributes of the segments, the method comprising the steps of:
-
(1) determining identifying attributes of each translatable source segment, wherein at least one such attribute is selectable;
(2) generating a unique attribute identifier for each translatable source segment based upon its identifying attributes;
(3) determining identifying attributes of each corresponding target segment, wherein at least one such attribute is selectable;
(4) generating a unique attribute identifier for each corresponding target segment based upon its identifying attributes;
(5) comparing the unique attribute identifiers of the translatable source segments and corresponding target segments; and
(6) associating a translatable source segment with a corresponding target segment when they have the same unique attribute identifier;
whereby said commonality of attributes allows an imperfect match between sequences of attributes associated with the segments and an attribute may be excluded from a commonality of attributes necessary to achieve a match. - View Dependent Claims (53, 54, 55)
(7) storing each translatable source segment and its attributes with its corresponding target segment in a source-target pair record of a source-target pair database.
-
-
54. The storage media of claim 52, wherein the method further includes the step of:
(7) storing in each source-target pair record a propagation flag identifying whether the corresponding target segment stored in the source-target pair record is to be propagated to other occurrences of the associated translatable source segment in the source-target pair database.
-
55. The storage media of claim 54, wherein the method further includes the step of:
(8) storing in each source-target pair record a pointer to a page of an occurrence book comprising pages, each page comprising pointers to a common translatable source-segment in records of the source-target pair database.
-
56. A computer program product for use with an appropriate computing system having at least one central processing unit (CPU) and at least one memory unit, the computer program product comprising a computer usable medium having embodied therein computer readable program code method steps to associate translatable source segments extracted from one or more source files having a first format with corresponding target segments extracted from one or more target files having the first format, such association being based upon commonality of attributes of the segments, the method steps comprising:
-
(1) determining identifying attributes of each translatable source segment, wherein at least one such attribute is selectable;
(2) generating a unique attribute identifier for each translatable source segment based upon its identifying attributes;
(3) determining identifying attributes of each corresponding target segment, wherein at least one such attribute is selectable;
(4) generating a unique attribute identifier for each corresponding target segment based upon its identifying attributes;
(5) comparing the unique attribute identifiers of the translatable source segments and corresponding target segments; and
(6) associating a translatable source segment with a corresponding target segment when they have the same unique attribute identifier;
whereby said commonality of attributes allows an imperfect match between sequences of attributes associated with the segments and an attribute may be excluded from a commonality of attributes necessary to achieve a match. - View Dependent Claims (57, 58, 59)
(7) storing in each source-target pair record a propagation flag identifying whether the corresponding target segment stored in the source-target pair record is to be propagated to other occurrences of the associated translatable source segment in the source-target pair database.
-
-
59. The computer program product of claim 58, wherein the method steps further comprise:
(8) storing in each source-target pair record a pointer to a page of an occurrence book comprising pages, each page comprising pointers to a common translatable source-segment in records of the source-target pair database.
- 60. A translation memory system for translating translatable source segments in one or more source files of a source project, the system having an aligner constructed and arranged to associate each of at least one translatable source segment extracted from at least one source file having a first format with a corresponding target segment extracted from at least one target file having the first format, such association being based upon commonality of attributes of the segments, wherein at least one such attribute is selectable, whereby said commonality of attributes allows an imperfect match between sequences of attributes associated with the segments and an attribute may be excluded from a commonality of attributes necessary to achieve a match.
Specification