Methods and systems for javascript parsing
First Claim
1. An apparatus for scanning an executable script object comprising:
- a digital dictionary configured to store tokens each comprising a possible piece of a uniform resource locator (URL);
a script parser configured to;
receive an executable script object comprising text,parse the text of the executable script object to find an instance of one of the tokens in the text,continue to parse the text of the executable script object adjacent to the instance of the token to find a syntax element, andconstruct a candidate URL from the instance of the token and the syntax element;
a URL rules detector configured to;
store rules for validating URLs, andutilize the stored rules to determine whether the candidate URL is a valid URL.
6 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to systems, apparatus, and methods of parsing a script within an HTML page. The method includes receiving a script object, where the script object includes text. The method further includes parsing the text of the script object for tokens and comparing the parsed tokens with a valid tokens list to determine locations within the text of the script object that include potential universal resource locators (URLs). The method includes parsing before and after the determined locations until complete URLs are constructed, and forwarding the complete URLs to a detector. The detector then determines whether the complete URLs are valid. The method then sends object retrieval requests to websites associated with the valid complete URLs.
14 Citations
20 Claims
-
1. An apparatus for scanning an executable script object comprising:
-
a digital dictionary configured to store tokens each comprising a possible piece of a uniform resource locator (URL); a script parser configured to; receive an executable script object comprising text, parse the text of the executable script object to find an instance of one of the tokens in the text, continue to parse the text of the executable script object adjacent to the instance of the token to find a syntax element, and construct a candidate URL from the instance of the token and the syntax element; a URL rules detector configured to; store rules for validating URLs, and utilize the stored rules to determine whether the candidate URL is a valid URL. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification