×

Document alignment systems for legacy document conversions

  • US 20070150443A1
  • Filed: 12/22/2005
  • Published: 06/28/2007
  • Est. Priority Date: 12/22/2005
  • Status: Active Grant
First Claim
Patent Images

1. A document alignment method comprising:

  • inputting source leaves of a source document in first tree structured format;

    inputting target leaves of a target document in second tree structured format;

    assigning a cost to each of a plurality of matches, each match comprising a pair of elements selected from the group consisting of a source leaf and a target leaf, an unmatched source leaf, and an unmatched target leaf;

    identifying matches for which a total cost is minimal, wherein each of the leaves is in at least one of the identified matches;

    identifying, from the identified matches, groups of matches wherein each match in the group has a leaf in common;

    identifying, from the groups, probable matches in which more that one target leaf is matched with at least one source leaf and probable matches where more than one source leaf is matched with a target leaf;

    outputting an alignment between leaves of the target document and leaves of the source document which includes the probable matches.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×