Structured-document processing
First Claim
1. A structured document processing device comprising:
- state-transition-diagram information storing means for storing, as state-transition-diagram information, information concerning a state transition diagram in which a plurality of source nodes of a source structured document transit in order of occurrence from the head of the source structured document, with regard to the source structured document which is designed to be able to be partitioned into the plurality of source nodes as a plurality of nodes concerning structured partitions;
selection-pair extracting means for extracting, as a selection pair, a pair consisting of one of a plurality of new nodes in a new structured document and corresponding one of the source nodes in the state transition diagram in accordance with order in which the new nodes occur from the head of the new structured document, and in accordance with order in which the source nodes transit from one to another in the state transition diagram, with regard to the new structured document which is designed to be able to be partitioned into the plurality of new nodes as a plurality of nodes concerning structured partitions;
matchingness check means for determining whether or not there is matchingness between the source node and the new node in each of the selection pairs, on the basis of descriptions respectively of the source node and the new node;
node qualifying means for qualifying the source node as a node to be used for the new node with regard to the source node and the new node in the selection pair, when it is decided that there is matchingness between the source node and the new node in the same selection pair;
transition-history information generating means for generating, as transition-history information, information concerning a transition history which goes on through a plurality of nodes to be used in the state transition diagram in order of transition; and
degree-of-approximateness detecting means for detecting a degree of approximateness between the source structured document and the new structured document, on the basis of the transition-history information.
1 Assignment
0 Petitions
Accused Products
Abstract
Provides structured-document processing device for detecting a relationship of approximateness between structured documents effectively. State-transition-diagram information storing means stores information concerning a state transition diagram where source nodes of a source structured document transit in order of occurrence from the head thereof. Extracting means extracting, a pair consisting of a new node and a corresponding source node in accordance with order of occurrence. Matching means determining whether the source and new nodes in the selection pair have matchingness, on the basis of their descriptions. Node qualifying means qualifying the source node as a node to be used for the new node. Transition-history information generating means generates, as transition-history information, information concerning a transition history regarding the node to be used. Degree-of-approximateness detecting means detects a degree of approximateness between the source structured document and the new structured document on the basis of the transition-history information.
-
Citations
36 Claims
-
1. A structured document processing device comprising:
-
state-transition-diagram information storing means for storing, as state-transition-diagram information, information concerning a state transition diagram in which a plurality of source nodes of a source structured document transit in order of occurrence from the head of the source structured document, with regard to the source structured document which is designed to be able to be partitioned into the plurality of source nodes as a plurality of nodes concerning structured partitions;
selection-pair extracting means for extracting, as a selection pair, a pair consisting of one of a plurality of new nodes in a new structured document and corresponding one of the source nodes in the state transition diagram in accordance with order in which the new nodes occur from the head of the new structured document, and in accordance with order in which the source nodes transit from one to another in the state transition diagram, with regard to the new structured document which is designed to be able to be partitioned into the plurality of new nodes as a plurality of nodes concerning structured partitions;
matchingness check means for determining whether or not there is matchingness between the source node and the new node in each of the selection pairs, on the basis of descriptions respectively of the source node and the new node;
node qualifying means for qualifying the source node as a node to be used for the new node with regard to the source node and the new node in the selection pair, when it is decided that there is matchingness between the source node and the new node in the same selection pair;
transition-history information generating means for generating, as transition-history information, information concerning a transition history which goes on through a plurality of nodes to be used in the state transition diagram in order of transition; and
degree-of-approximateness detecting means for detecting a degree of approximateness between the source structured document and the new structured document, on the basis of the transition-history information. - View Dependent Claims (2, 33)
-
-
3. A structured document processing method comprising:
-
a state-transition-diagram information storing step of storing, as state-transition-diagram information, information concerning a state transition diagram in which a plurality of source nodes of a source structured document transit in order of occurrence from the head of the source structured document, with regard to the source structured document which is designed to be able to be partitioned into the plurality of source nodes as a plurality of nodes concerning structured partitions;
a selection-pair extracting step of extracting, as a selection pair, a pair consisting of one of a plurality of new nodes in a new structured document and corresponding one of the source nodes in the state transition diagram in accordance with order in which the new nodes occur from the head of the new structured document, and in accordance with order in which the source nodes transit from one to another in the state transition diagram, with regard to the new structured document which is designed to be able to be partitioned into the plurality of new nodes as a plurality of nodes concerning structured partitions;
a matchingness check step of determining whether or not there is matchingness between the source node and the new node in each of the selection pairs, on the basis of descriptions respectively of the source node and the new node;
a node qualifying step of qualifying the source node as a node to be used for the new node with regard to the source node and the new node in the selection pair, when it is decided that there is matchingness between the source node and the new node in the same selection pair;
a transition-history information generating step of generating, as transition-history information, information concerning a transition history which goes on through a plurality of nodes to be used in the state transition diagram in order of transition; and
a degree-of-approximateness detecting step of detecting a degree of approximateness between the source structured document and the new structured document, on the basis of the transition-history information. - View Dependent Claims (4, 30)
-
-
5. A structured-document processing device, which parses a structured document in response to a request from an application program to parse the structured document, thereby posting a result of parsing the structured document to the application program, the structured-document processing device comprising:
-
storage means for storing, as source contents, contents of a source structured document as a structured document which has been parsed, and for storing, as a source result of parsing, a result of parsing the same source structured document;
distinction means for comparing contents of a new structured document and the contents of the source structured document by use of their respective contents parts which have a predetermined corresponding relationship therebetween, and for distinguishing each of the contents parts of the new structured document between a first contents part and a second contents parts;
the first contents part being a contents part of the new structured document for which the source result of parsing concerning the source structured document can be used, the second contents part being a contents part of the new structured document for which the result of parsing concerning the source structured document can not be used, with regard to the new structured document as a structured document which the application program has requested to be parsed;
parsing means for parsing the second contents part of the new structured document, and for outputting a new parsed part as the result of the parsing; and
posting means for posting to the application program a part representing the source result of parting concerning the source structured document corresponding to the first contents part as a result of parsing the first content of the new structured document, and for posting to the application program the new parsed part as a result of parsing the second content part of the new structured document. - View Dependent Claims (34)
-
-
6. A structured-document processing device, which parses a structured document in response to a request from an application program to parse the structured document, thereby posting a series of events, as a result of parsing the structured document, to the application program, the structured-document processing device comprising:
-
storage means for storing, as source contents, contents of a source structured document as a structured document which has been parsed, and for storing, as a source parsed partition, a result of parsing each of the structured partitions of the same source structured document;
matching check means for performing a matching check on whether or not source contents of the source structured document and contents of a new structured document as a structured document, which the application program has requested to be parsed, match each other by comparing one of the structured partitions of the source structured document and corresponding one of the structured partitions of the new structured document, one by one in order from the heads respectively of the source and new structured documents;
classification means for classifying each of the structured partitions of the new structured document into two categories, that is a first-class structured partition and a second-class structured partition, on the basis of a result of performing the matching check concerning the structured partition;
parsing means for parsing a structured partition which the classification means has classified as a second-class structured partition, and for generating a new parsed partition as a result of parsing the structured partition; and
posting means for posting to the application program an event concerning each of the structured partitions in order from a structured partition in the head of the new structured document to a structured partition in the tail of the same new structured document, the posting means posting to the application program an event concerning a source parsed partition of a source structured document corresponding to each of the structured partitions of the new structured document as a result of parsing the structured partition, if the structured partition is a first-class structured partition, the posting means posting to the application program an event concerning a new parsed partition concerning a structured partition as a result of parsing the structured partition, if the structured partition is a second-class structured partition. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 35)
-
-
14. A structured document processing method, which parses a structured document in response to a request from an application program to parse the structured document, thereby posting a result of parsing the structured document to the application program, the structured-document processing method comprising:
-
a storage step of storing, as source contents, contents of a source structured document as a structured document which has been parsed, and for storing, as a source result of parsing, a result of parsing the same source structured document;
a distinction step of comparing contents of a new structured document and the contents of the source structured document by use of their respective contents parts which have a predetermined corresponding relationship therebetween, and for distinguishing each of the contents parts of the new structured document between a first contents part and a second contents parts;
the first contents part being a contents part of the new structured document for which the source result of parsing concerning the source structured document can be used, the second contents part being a contents part of the new structured document for which the result of parsing concerning the source structured document can not be used, with regard to the new structured document as a structured document which the application program has requested to be parsed;
a parsing step of parsing the second contents part of the new structured document, and for outputting a new parsed part as the result of the parsing; and
a posting step of posting to the application program a part representing the source result of parting concerning the source structured document corresponding to the first contents part as a result of parsing the first content of the new structured document, and for posting to the application program the new parsed part as a result of parsing the second content part of the new structured document.
-
-
15. A structured-document processing method, which parses a structured document in response to a request from an application program to parse the structured document, thereby posting a series of events, as a result of parsing the structured document, to the application program, the structured-document processing method comprising:
-
a storage step of storing, as source contents, contents of a source structured document as a structured document which has been parsed, and for storing, as a source parsed partition, a result of parsing each of the structured partitions of the same source structured document;
a matching check step of performing a matching check on whether or not source contents of the source structured document and contents of a new structured document as a structured document, which the application program has requested to be parsed, match each other by comparing one of the structured partitions of the source structured document and corresponding one of the structured partitions of the new structured document, one by one in order from the heads respectively of the source and new structured documents;
a classification step of classifying each of the structured partitions of the new structured document into two categories, that is a first-class structured partition and a second-class structured partition, on the basis of a result of performing the matching check concerning the structured partition;
a parsing step of parsing a structured partition which the classification means has classified as a second-class structured partition, and for generating a new parsed partition as a result of parsing the structured partition; and
a posting step of posting to the application program an event concerning each of the structured partitions in order from a structured partition in the head of the new structured document to a structured partition in the tail of the same new structured document, the posting step posting to the application program an event concerning a source parsed partition of a source structured document corresponding to each of the structured partitions of the new structured document as a result of parsing the structured partition, if the structured partition is a first-class structured partition, the posting step posting to the application program an event concerning a new parsed partition concerning a structured partition as a result of parsing the structured partition, if the structured partition is a second-class structured partition. - View Dependent Claims (16, 17, 18, 19, 20, 21, 31)
-
-
22. A structured document processing device comprising:
-
state-transition-diagram information storing means for storing, as state-transition-diagram information, information concerning a state transition diagram in which a plurality of source nodes of each of a plurality of source structured documents transit from one to another in order of occurrence from the head of the source structured document with regard to the plurality of source structured documents, each of which is designed to be able to be partitioned into the plurality of source nodes as a plurality of nodes concerning the respective structured partitions, the state-transition-diagram information storing means for storing a state transition, which would occur commonly in some of the plurality of source structured documents, as a single common state transition in the state transition diagram;
selection-pair extracting means for extracting, as a selection pair, a pair consisting of one of a plurality of new nodes of a new structured document and corresponding one of the plurality of source nodes in the state transition diagram, in accordance with order in which the new nodes occur from the head of the new structured document, and in accordance with order in which the source nodes transit from one to anther in the state transition diagram, with regard to the new structured document which is designed to be able to be partitioned into the plurality of new nodes as a plurality of nodes concerning the respective structured partitions;
matchingness check means for determining whether or not there is matchingness between the source node and the new node in each of the selection pairs, on the basis of descriptions respectively of the source node and the new node;
node qualifying means for qualifying the source node in a selection pair as a node to be used for the new node in the same selection pair when it is decided that there is matchingness between the source node and the new node in the selection pair;
transition-history information generating means for generating, as transition-history information, information concerning a transition history which goes on through a plurality of nodes to be used in the state transition diagram in order of transition;
structured-document-to-be-used selecting means for selecting a single source structured document as a structured document to be used for the new structured document on the basis of the transition-history information; and
source-structured-document using means for using, as a result of parsing a new node of the new structured document, a result of parsing a node to be used for the new node in the structured document to be used for the new structured document. - View Dependent Claims (23, 24, 25, 29, 36)
-
-
26. A structured document processing method comprising:
-
a state-transition-diagram information storing step of storing, as state-transition-diagram information, information concerning a state transition diagram in which a plurality of source nodes of each of a plurality of source structured documents transit from one to another in order of occurrence from the head of the source structured document with regard to the plurality of source structured documents, each of which is designed to be able to be partitioned into the plurality of source nodes as a plurality of nodes concerning the respective structured partitions, and storing a state transition, which would occur commonly in some of the plurality of source structured documents, as a single common state transition in the state transition diagram;
a selection-pair extracting step of extracting, as a selection pair, a pair consisting of one of a plurality of new nodes of a new structured document and corresponding one of the plurality of source nodes in the state transition diagram, in accordance with order in which the new nodes occur from the head of the new structured document, and in accordance with order in which the source nodes transit from one to anther in the state transition diagram, with regard to the new structured document which is designed to be able to be partitioned into the plurality of new nodes as a plurality of nodes concerning the respective structured partitions;
a matchingness check step of determining whether or not there is matchingness between the source node and the new node in each of the selection pairs, on the basis of descriptions respectively of the source node and the new node;
a node qualifying step of qualifying the source node in a selection pair as a node to be used for the new node in the same selection pair when it is decided that there is matchingness between the source node and the new node in the selection pair;
a transition-history information generating step of generating, as transition-history information, information concerning a transition history which goes on through a plurality of nodes to be used in the state transition diagram in order of transition;
a structured-document-to-be-used selecting step of selecting a single source structured document as a structured document to be used for the new structured document on the basis of the transition-history information; and
a source-structured-document using step of using, as a result of parsing a new node of the new structured document, a result of parsing a node to be used for the new node in the structured document to be used for the new structured document. - View Dependent Claims (27, 28, 32)
-
Specification