Methods and apparatus for converting markup language data to an intermediate representation
First Claim
1. A method for processing extensible markup language (XML) data, the method comprising:
- receiving a character stream of markup language data;
applying sequences of characters of the character stream to a set of state machines, the set of state machines including a plurality of construct state machines responsible for processing respective XML constructs identified by the sequences of characters;
operating a character provider in a character processor to provide characters from the character stream to the set of state machines;
operating a primary state machine to control to control invocation of the respective construct state machines to process different types of XML constructs encountered during reception of the character stream of XML data; and
producing, from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of markup language data, the intermediate representation having encoded items including type, length, value representations of the XML constructs and representative of the original stream of XML data.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods and apparatus provide a character process for processing markup language data, such as XML data, by receiving a character stream of markup language data and applying sequences of characters of the character stream to a set of state machines. The set of state machines includes a plurality of construct state machines responsible for processing respective markup language constructs identified by the sequences of characters. The character processor produces, from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of markup language data. The intermediate representation contains encoded items containing type, length, value representations representative of constructs within the character stream of markup language data.
-
Citations
28 Claims
-
1. A method for processing extensible markup language (XML) data, the method comprising:
-
receiving a character stream of markup language data; applying sequences of characters of the character stream to a set of state machines, the set of state machines including a plurality of construct state machines responsible for processing respective XML constructs identified by the sequences of characters; operating a character provider in a character processor to provide characters from the character stream to the set of state machines; operating a primary state machine to control to control invocation of the respective construct state machines to process different types of XML constructs encountered during reception of the character stream of XML data; and producing, from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of markup language data, the intermediate representation having encoded items including type, length, value representations of the XML constructs and representative of the original stream of XML data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A character processor device comprising:
-
an input interface for receiving a character stream of extensible markup language (XML) data; logic processing coupled to the input interface and configured to receive and apply sequences of characters of the character stream to a set of state machines encoded within the logic processing, the set of state machines including a plurality of construct state machines responsible for processing respective XML constructs identified by the sequences of characters; the logic processing operating a primary state machine to control to control invocation of the respective construct state machines to process different types of XML constructs encountered during reception of the character stream of XML data; the logic processing operating a character provider in a character processor to provide characters from the character stream to the set of state machines; and the logic processing producing, from an output interface coupled to the logic processing, from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of XML language including data type, length, value representations of the XML constructs, the intermediate representation containing encoded items representative of the original stream of markup language data. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A computer program product having a computer-readable medium including computer program logic encoded thereon that, when executed on processor within a computerized device, provides a character processor that processes extensible markup language (XML) data by performing the operations of:
-
receiving a character stream of markup language data; applying sequences of characters of the character stream to a set of state machines, the set of state machines including a plurality of construct state machines responsible for processing respective XML constructs identified by the sequences of characters; operating a character provider in a character processor to provide characters from the character stream to the set of state machines; operating a primary state machine to control to control invocation of the respective construct state machines to process different types of XML constructs encountered during reception of the character stream of XML data; and producing, from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of markup language data, the intermediate representation containing encoded items including type, length, value including type, length, value representations of the XML constructs and representative of the original stream of XML data. - View Dependent Claims (27)
-
-
28. A method for processing extensible markup language (XML) data, the method comprising:
-
receiving a character stream of markup language data; applying sequences of characters of the character stream to a set of state machines, the set of state machines including a plurality of construct state machines responsible for processing respective XML constructs identified by the sequences of characters; operating a character provider in a character processor to provide multiple characters from the character stream per state machine cycle to the set of state machines; operating a primary state machine to control to control invocation of the respective construct state machines to process different types of XML constructs encountered during reception of the character stream of XML data; and producing from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of markup language data, the intermediate representation having encoded items including type, length, value representations of the XML constructs and representative of the original stream of XML data.
-
Specification