Method and apparatus for processing a parseable document
First Claim
Patent Images
1. A data processing method, comprising:
- receiving an electronically parseable document; and
scanning the document according to at least one predefined rule to determine if the document is suspicious,wherein if the document is determined not to be suspicious, passing the document from a scanner to a first parser, and parsing the document with the first parser, andwherein if the document is determined to be suspicious, passing the document from the scanner to a second parser without passing the document from the scanner to the first parser, and parsing the document with the second parser,wherein the document is determined to be suspicious if the document includes a property that may cause a receiving computer system to fail,wherein the document is determined to be suspicious if the document comprises a well-formed document within a plurality of rules, but when the document is parsed, the document fails or slows down the receiving computer system to fail, andwherein the property includes at least one of a length of an attribute name, a number of an attribute per element, and a number of nested elements that exceeds a predetermined threshold.
0 Assignments
0 Petitions
Accused Products
Abstract
A data processing method comprises receiving an electronically parseable document, scanning the document according to at least one predefined rule to determine if the document is suspicious, and, if the document is determined not to be suspicious, parsing the document with a first parser, and, if the document is determined to be suspicious, parsing the document with a second parser.
19 Citations
19 Claims
-
1. A data processing method, comprising:
-
receiving an electronically parseable document; and scanning the document according to at least one predefined rule to determine if the document is suspicious, wherein if the document is determined not to be suspicious, passing the document from a scanner to a first parser, and parsing the document with the first parser, and wherein if the document is determined to be suspicious, passing the document from the scanner to a second parser without passing the document from the scanner to the first parser, and parsing the document with the second parser, wherein the document is determined to be suspicious if the document includes a property that may cause a receiving computer system to fail, wherein the document is determined to be suspicious if the document comprises a well-formed document within a plurality of rules, but when the document is parsed, the document fails or slows down the receiving computer system to fail, and wherein the property includes at least one of a length of an attribute name, a number of an attribute per element, and a number of nested elements that exceeds a predetermined threshold. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A data processing apparatus, comprising:
-
a scanner for receiving an electronically parseable document, and for scanning the document according to at least one predefined rule to determine if the document is suspicious; a first parser for parsing the document if the document is determined not to be suspicious; and a second parser for parsing the document if the document is determined to be suspicious, wherein the scanner is adapted to determine if the document is suspicious before passing the document to one of the first parser or the second parser, wherein if the document is determined to be suspicious, the document is passed from the scanner to the second parser without first passing the document from the scanner to the first parser, wherein the document is determined to be suspicious if the document includes a property that may cause a receiving computer system to fail, wherein the document is determined to be suspicious if the document comprises a well-formed document within a plurality of rules, but when the document is parsed, the document fails or slows down the receiving computer system to fail, and wherein the property includes at least one of a length of an attribute name, a number of an attribute per element, and a number of nested elements that exceeds a predetermined threshold. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A tangible computer-readable storage non-transitory medium encoded with a computer program which, when loaded in a processor, configures said processor to perform a method, said method comprising:
-
receiving an electronically parseable document; and scanning the document according to at least one predefined rule to determine if the document is suspicious, wherein if the document is determined not to be suspicious, passing the document from a scanner to a first parser, and parsing the document with the first parser, wherein if the document is determined to be suspicious, passing the document from the scanner to a second parser, and parsing the document with the second parser without passing the document from the scanner to the first parser, wherein the document is determined to be suspicious if the document includes a property that may cause a receiving computer system to fail, wherein the document is determined to be suspicious if the document comprises a well-formed document within a plurality of rules, but when the document is parsed, the document fails or slows down the receiving computer system to fail, and wherein the property includes at least one of a length of an attribute name, a number of an attribute per element, and a number of nested elements that exceeds a predetermined threshold. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification