System and method for schema-driven compression of extensible mark-up language (XML) documents
First Claim
Patent Images
1. A method of compressing an extensible markup language (XML) document, comprising:
- compressing an XML document and its associated schema information such that information is a markup portion therein is maintained in a compressed form to allow the document to be reconstructed;
wherein, during said compressing, said markup portion and a non-markup portion of said document are separated, and the non-markup portion is compressed using a first compression method and the markup portion is compressed using a second compression method;
wherein said mark-up portion comprises structured component information; and
wherein said schema information associated with the document is used with compressing the structure component to obtain a predetermined compression rate while simultaneously retaining the structure.
3 Assignments
0 Petitions
Accused Products
Abstract
A method (and system) for compressing an extensible markup language (XML) document, includes compressing an XML document such that information in a markup portion therein is maintained in a compressed form to allow the document to be reconstructed. During the compressing, the markup portion and a non-markup portion of the document are separated, and the non-markup component is compressed using a first compression method and the markup component is compressed using a second compression method.
235 Citations
42 Claims
-
1. A method of compressing an extensible markup language (XML) document, comprising:
-
compressing an XML document and its associated schema information such that information is a markup portion therein is maintained in a compressed form to allow the document to be reconstructed;
wherein, during said compressing, said markup portion and a non-markup portion of said document are separated, and the non-markup portion is compressed using a first compression method and the markup portion is compressed using a second compression method;
wherein said mark-up portion comprises structured component information; and
wherein said schema information associated with the document is used with compressing the structure component to obtain a predetermined compression rate while simultaneously retaining the structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A system for compressing an extensible markup language (XML) document, comprising:
-
means for compressing an XML document such that information comprising a markup portion therein is maintained in a compressed form to allow the document to be reconstructed; and
means, during compressing, for separating said markup portion and a non-markup portion of said document, wherein said compressing means compresses the non-markup portion using a first compression method and compresses the markup portion using a second compression method, wherein said mark-up portion comprises structured component information, and wherein schema information associated with the document is used with compressing the structure component to obtain a predetermined compression rate while simultaneously retaining the structure.
-
-
42. A programmable storage medium tangibly embodying a program of machine-readable instructions executable by a digital processing apparatus to perform compressing of an extensible markup language (XML) document, said method comprising:
-
compressing an XML document such that information comprising a markup portion therein is maintained in a compressed form to allow the document to be reconstructed, wherein, during said compressing, said markup portion and a non-markup portion of said document are separated, and the non-markup portion is compressed using a first compression method and the markup portion is compressed using a second compression method, wherein said mark-up portion comprises structured component information, and wherein schema information associated with the document is used with compressing the structure component to obtain a predetermined compression rate while simultaneously retaining the structure.
-
Specification