System and method of processing streaming XML data
First Claim
Patent Images
1. A method of incrementally processing structured documents, the method comprising:
- receiving partial data of one of a plurality of structured documents, wherein each of the documents comprises a plurality of elements that define a hierarchy;
identifying which of the structured documents is associated with the partial data;
determining which of the elements of the document are included in the received partial data;
determining a position of each of the received elements in the hierarchy;
identifying a specified portion of the document, said specified portion comprising at least one of the received elements;
generating an isomorphic digest value based on the specified portion, wherein the isomorphic digest value comprises a cryptographic hash value based on the specified portion of the document;
providing, for processing, the isomorphic digest value, the received elements and the position of each of the received elements in the hierarchy; and
identifying data associated with at least one other document based on the isomorphic digest value.
9 Assignments
0 Petitions
Accused Products
Abstract
Embodiments include methods and systems for processing XML documents. One embodiment is a system that includes a method of efficiently processing XML documents received concurrently from a plurality of network connections in the form of streams of data. Other embodiments include systems configured to perform such processing of streamed XML documents. Other embodiments include systems and methods of efficiently performing document processing using digests for identifying XML document structure.
13 Citations
20 Claims
-
1. A method of incrementally processing structured documents, the method comprising:
-
receiving partial data of one of a plurality of structured documents, wherein each of the documents comprises a plurality of elements that define a hierarchy; identifying which of the structured documents is associated with the partial data; determining which of the elements of the document are included in the received partial data; determining a position of each of the received elements in the hierarchy; identifying a specified portion of the document, said specified portion comprising at least one of the received elements; generating an isomorphic digest value based on the specified portion, wherein the isomorphic digest value comprises a cryptographic hash value based on the specified portion of the document; providing, for processing, the isomorphic digest value, the received elements and the position of each of the received elements in the hierarchy; and identifying data associated with at least one other document based on the isomorphic digest value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus for processing structured documents received over a network, the apparatus comprising:
-
a network interface configured to receive partial data of one of a plurality of structured documents via at least one network, wherein each of the documents comprises a plurality of elements that define a hierarchy; a memory configured to store data associated with each of the documents; a processor configured to; identify which of the structured documents is associated with the partial data; determine which of the elements of the document are included in the received partial data; determine a position of each of the received elements in the hierarchy; store the position and at least a portion of the determined elements in the memory; identify a specified portion of the document, said specified portion comprising at least one of the received elements; generate an isomorphic digest value based on the specified portion, wherein generating the isomorphic digest value comprises generating a cryptographic hash value based on the specified portion of the document; process the partial data of the identified structured document based at least partly on the isomorphic digest value, the received elements and the position of each of the received elements in the hierarchy; and identify data associated with at least one other document based on the isomorphic digest value. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A content processor for incrementally processing structured documents, the processor comprising:
-
means for receiving partial data of one of a plurality of structured documents, wherein each of the documents comprises a plurality of elements that define a hierarchy; means for storing data associated with each of the documents; means for processing the received partial data, wherein the processing means is configured to; identify which of the structured documents is associated with the partial data; determine which of the elements of the document are included in the received partial data; determine a position of each of the received elements in the hierarchy; store the position and at least a portion of the determined elements in the storing means; and process the partial data of the identified structured document based at least partly on the received elements and the position of each of the received elements in the hierarchy; identify a specified portion of the document, said specified portion comprising at least one of the received elements; generate an isomorphic digest value based on the specified portion, wherein the isomorphic digest value comprises a cryptographic hash value based on the specified portion of the document; and process the received portion of the document based at least partly on the isomorphic digest value. - View Dependent Claims (18, 19, 20)
-
Specification