Efficient type annontation of XML schema-validated XML documents without schema validation
First Claim
1. A method for compiling a structured document schema into type annotation records comprising steps of:
- a. building a type hierarchy ordered tree from a structured document schema from type record wherein each of said type records contains typing tuples, b. creating a typing set containing said typing tuples in said type hierarchy ordered tree, c. creating an ambiguity typing sequence for said typing tuples sharing a common first field and having a unique second field, d. arranging said ambiguity typing sequence based on an offset number assigned to a third field of each of said typing tuples in said ambiguity typing sequence, e. extracting a second field from each of said typing tuples accorded to sorted order of said ambiguity typing sequences, and creating a type indexing data structure populated with said extracted second field to map each type name to a type.
1 Assignment
0 Petitions
Accused Products
Abstract
Type annotation record information storage for annotated automaton encoding for high-performance XML schema validation is optimized in a space efficient aspect. Subsequent to type annotation record information organization, type annotation records are used for type annotation of validated XML documents, either by implementing annotation records and type annotation part of an algorithm only, or by skipping one or more validation steps in a full validation implementation. Given a schema context, a type annotation may be performed for a validated XML fragment as opposed to an entire document. In addition, default features such as attribute and type are supported.
-
Citations
26 Claims
-
1. A method for compiling a structured document schema into type annotation records comprising steps of:
-
a. building a type hierarchy ordered tree from a structured document schema from type record wherein each of said type records contains typing tuples, b. creating a typing set containing said typing tuples in said type hierarchy ordered tree, c. creating an ambiguity typing sequence for said typing tuples sharing a common first field and having a unique second field, d. arranging said ambiguity typing sequence based on an offset number assigned to a third field of each of said typing tuples in said ambiguity typing sequence, e. extracting a second field from each of said typing tuples accorded to sorted order of said ambiguity typing sequences, and creating a type indexing data structure populated with said extracted second field to map each type name to a type. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for a database engine to perform type annotation of structured documents or structured document fragments in the absence of full schema validation, comprising steps of:
-
a. building a type annotation data structure comprising a structured document type hierarchy, a type indexing data structure, and a type array, b. mapping a type name string to each element type in said structured document type hierarchy, and annotating a structured document or fragment using type annotation records obtained from said type annotation data structure and said type name mapping. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. An article of manufacture comprising a computer usable medium having computer readable program code embodied therein which implements the compilation of a structured document schema into type annotation records comprising modules to execute the steps of:
-
a. building a type hierarchy ordered tree from a structured document schema from type record wherein each of said type records contains typing tuples, b. creating a typing set containing said typing tuples in said type hierarchy ordered tree, c. creating an ambiguity typing sequence for said typing tuples sharing a common first field and having a unique second field, d. arranging said ambiguity typing sequence based on an offset number assigned to a third field of each of said typing tuples in said ambiguity typing sequence, e. extracting a second field from each of said typing tuples accorded to sorted order of said ambiguity typing sequences, and creating a type indexing data structure populated with said extracted second field to map each type name to a type.
-
-
26. An article of manufacture comprising a computer usable medium having computer readable program code embodied therein which comprising modules to execute the steps of:
-
a. loading type annotation data structure into a runtime validation engine, b. creating an empty offset stack data structure, c. pushing a record containing a value of zero onto said offset stack, d. using a token from an XML document or document fragment to key a search on a typing index to determine an index for said token, e. incrementing said index by value in topmost record of offset stack if said token is indicated to be of ambiguous type, and indicating element type in a typing array at said index location.
-
Specification