Scalable derivative services
First Claim
Patent Images
1. A method for efficiently identifying dynamic content of a webpage, the method comprising:
- (a) accessing, by a virtual browser of a plurality of virtual browsers executing on a device intermediary to a plurality of clients and a plurality of servers a first stored data file representing a first version of a web page and a first abstract syntax tree corresponding to the first stored data file, the abstract syntax tree comprising at least one static node, the static node including stored content;
(b) identifying, by the virtual browser of the plurality of virtual browsers, non-matching dynamic content between the first stored data file and a second data file representing a second version of the web page without using a second abstract syntax tree corresponding to the second data file; and
(c) replacing, by the virtual browser, the at least one static node corresponding to the non-matching dynamic content in the first abstract syntax tree with a token that identifies the portion of the abstract syntax tree containing the non-matching dynamic content.
4 Assignments
0 Petitions
Accused Products
Abstract
An efficient method for parsing HTML pages identifies pages containing a mix of static and dynamic content. The pages are parsed to form abstract syntax trees (ASTs), which are then cached along with the pages. When a later version of a page is retrieved, it is compared against the cached version, and only those portions of the AST that contain different content are reparsed.
-
Citations
20 Claims
-
1. A method for efficiently identifying dynamic content of a webpage, the method comprising:
-
(a) accessing, by a virtual browser of a plurality of virtual browsers executing on a device intermediary to a plurality of clients and a plurality of servers a first stored data file representing a first version of a web page and a first abstract syntax tree corresponding to the first stored data file, the abstract syntax tree comprising at least one static node, the static node including stored content; (b) identifying, by the virtual browser of the plurality of virtual browsers, non-matching dynamic content between the first stored data file and a second data file representing a second version of the web page without using a second abstract syntax tree corresponding to the second data file; and (c) replacing, by the virtual browser, the at least one static node corresponding to the non-matching dynamic content in the first abstract syntax tree with a token that identifies the portion of the abstract syntax tree containing the non-matching dynamic content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for efficiently identifying dynamic content of a webpage comprising:
-
a device intermediary to a plurality of clients and a plurality of servers; a comparison engine of the device; accessing a first stored data file representing a first version of a web page and first abstract syntax tree corresponding to the first stored data file, the abstract syntax tree comprising at least one static node, the static node including stored content; and identifying non-matching dynamic content between the first stored data file and a second data file representing a second version of the web page without using a second abstract syntax tree corresponding to the second data file; and a virtual browser of a plurality of virtual browsers executing on the device and replacing the at least one static node corresponding to the non-matching dynamic content in the first abstract syntax tree with a token that identifies the portion of the abstract syntax tree containing the non-matching dynamic content. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification