×

DOCUMENT PROCESSING DEVICE AND DOCUMENT PROCESSING METHOD

  • US 20090132566A1
  • Filed: 03/28/2007
  • Published: 05/21/2009
  • Est. Priority Date: 03/31/2006
  • Status: Abandoned Application
First Claim
Patent Images

1. A document processing apparatus comprising:

  • a node-pair detection unit operative to detect from a structured file described using a predetermined tag set a tag pair having a predetermined positional relation as a node pair;

    an attribute-value acquisition unit operative to index as an attribute value according to a predetermined rule an appearance mode of a node pair in a structured document file;

    an index creation unit operative to create index information associating a node pair and an attribute value thereof;

    a common-pair detection unit operative to detect as a common pair a node pair that is common in a node pair group detected from a first structured document file and a node pair group detected from a second structured document file; and

    a node-similarity-value calculation unit operative to index as a node similarity value, by referring to the index information of the first structured document file and the index information of the second structured document file, the similarity between the attribute value of the common pair in the first structured document file and the attribute value of the common pair in the second structured document file.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×