Method and system for document translation and extraction
First Claim
1. A method for translating an electronic source document having a first format into an electronic target document having a second format, the method comprising the steps of:
- selecting portions from the source document having various constructs and formats;
extracting the selected portions from the source document;
transforming the format of the extracted portions into the second format of the electronic target document;
deducing a translation rule set from the extracted portions and the transformed portions;
applying the translation rule set to the electronic source document;
producing a first draft of the electronic target document as the translation rule set is applied to the electronic source document; and
identifying portions from the electronic source document which were unable to be translated into the target document.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for translating an electronic document from one format to an electronic document in a second format. Selected portions from a source document are extracted and transformed into the format of a target document. A translation rule set is then deduced from the extracted portions and the transformed portions. The translation rule set is then applied to the source document, producing a first draft. If the translation rule set is unable to translate a portion from the source document, then the user is notified of the untranslatable portion. The user then provides examples of how the untranslatable portion should be translated into the format of the target document. The translation rule set is then modified in accordance with the examples. Next, the modified translation rule set is applied to the source document, producing a second draft. The above steps are repeated until the source document has been completely translated into the format of the target document or until the user is satisfied with the translation.
-
Citations
24 Claims
-
1. A method for translating an electronic source document having a first format into an electronic target document having a second format, the method comprising the steps of:
-
selecting portions from the source document having various constructs and formats; extracting the selected portions from the source document; transforming the format of the extracted portions into the second format of the electronic target document; deducing a translation rule set from the extracted portions and the transformed portions; applying the translation rule set to the electronic source document; producing a first draft of the electronic target document as the translation rule set is applied to the electronic source document; and identifying portions from the electronic source document which were unable to be translated into the target document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for translating an electronic source document having a first format into an electronic target document having a second format, the method comprising the steps of:
-
selecting portions from the source document having various constructs and formats; extracting the selected portions from the source document; transforming the format of the extracted portions into the second format of the electronic target document; deducing a translation rule set from the extracted portions and the transformed portions; applying the translation rule set to the electronic source document; producing a first draft of the electronic target document as the translation rule set is applied to the electronic source document; identifying portions from the electronic source document which were unable to be translated into the electronic target document; modifying the translation rule set to account for the identified untranslatable portions, the modified translation rule set being a new rule set; applying the new rule set to the electronic source document; and repeating the steps of producing, identifying, modifying, and applying, until the target document is in a desired format. - View Dependent Claims (11, 12)
-
-
13. A system for translating an electronic source document having a first format into an electronic target document having a second format, the system comprising:
-
means for selecting portions from the source document having various constructs and formats; means for extracting the selected portions from the source document; means for transforming the format of the extracted portions into the second format of the electronic target document; means for deducing a translation rule set from the extracted portions and the transformed portions; first means for applying the translation rule set to the electronic source document; first means for producing a first draft of the electronic target document as the translation rule set is applied to the electronic source document; and first means for identifying portions from the electronic source document which were unable to be translated into the target document. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A system for translating an electronic source document having a first format into an electronic target document having a second format, the system comprising:
-
means for selecting portions from the source document having various constructs and formats; means for extracting the selected portions from the source document; means for transforming the format of the extracted portions into the second format of the electronic target document; means for deducing a translation rule set from the transformed portions; first means for applying the translation rule set to the electronic source document; means for producing a first draft of the electronic target document as the translation rule set is applied to the electronic source document; means for identifying portions from the electronic source document which were unable to be translated into the electronic target document; means for modifying the translation rule set to account for the identified untranslatable portions, the modified translation rule set being a new rule set; second means for applying the new rule set to the electronic source document; and means for repeating the steps of producing, identifying, modifying, and applying, until the target document is in a desired format. - View Dependent Claims (23, 24)
-
Specification