Methods and apparatus for mapping source schemas to a target schema using schema embedding
First Claim
1. A method for mapping one or more source schemas to at least one target schema, comprising:
- obtaining said one or more source schemas; and
mapping one or more edges in said one or more source schemas to one or more paths in said at least one target schema, wherein said mapping does not violate one or more prefix-free conditions and one or more path-type conditions, wherein said one or more prefix-free conditions require that for any two edges (A, B) and (A, B′
) in said one or more source schemas, the path (A,B) in said one or more source schemas is not a prefix of the path (A,B) in said at least one target schema and said one or more path-type conditions requires that each edge and a corresponding path to each edge in said one or more source schemas have a same type.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus are provided for mapping XML source documents to target documents using schema embeddings. According to one aspect of the invention, one or more edges in the one or more source schemas are mapped to one or more paths in at least one target schema. The disclosed mapping techniques ensure that (i) one or more source documents that conform to one or more of the source schemas can be recovered from one or more target documents that conform to the at least one target schema, if a mapping exists between the one or more of the source schemas and the at least one target schema; (ii) queries on one or more source documents that conform to one or more of the source schemas in a given query language can be answered on one or more target documents that conform to the at least one target schema; and (iii) the one or more target documents conform to a target schema.
-
Citations
23 Claims
-
1. A method for mapping one or more source schemas to at least one target schema, comprising:
-
obtaining said one or more source schemas; and mapping one or more edges in said one or more source schemas to one or more paths in said at least one target schema, wherein said mapping does not violate one or more prefix-free conditions and one or more path-type conditions, wherein said one or more prefix-free conditions require that for any two edges (A, B) and (A, B′
) in said one or more source schemas, the path (A,B) in said one or more source schemas is not a prefix of the path (A,B) in said at least one target schema and said one or more path-type conditions requires that each edge and a corresponding path to each edge in said one or more source schemas have a same type. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method for identifying one or more prefix-free paths in a directed acyclic graph (DAG) structure, comprising:
-
performing a depth-first-search on an input graph such that for each edge connecting to a source node (s) one or more paths are identified to target nodes in a set of target vertices, (Ltar); and identifying prefix-free paths among said identified paths, wherein said prefix-free paths require that for any two edges (A, B) and (A, B′
) in one or more source schemas, the path (A,B) in said one or more source schemas is not a prefix of the path (A,B) in at least one target schema. - View Dependent Claims (15)
-
-
16. An apparatus for mapping one or more source schemas to one or more target schemas, the apparatus comprising:
-
a memory; and at least one processor, coupled to the memory, operative to; obtain said one or more source schemas; and map one or more edges in said one or more source schemas to one or more paths in said one or more target schemas, wherein said mapping does not violate one or more prefix-free conditions and one or more path-type conditions, wherein said one or more prefix-free conditions require that for any two edges (A, B) and (A, B′
) in said one or more source schemas, the path (A,B) in said one or more source schemas is not a prefix of the path (A,B) in said at least one target schema and said one or more path-type conditions requires that each edge and a corresponding path to each edge in said one or more source schemas have a same type. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23)
-
Specification