Methods, systems and computer program products for analyzing a hypertext markup language (HTML) document
First Claim
Patent Images
1. A method for generating a hierarchical representation of a hypertext markup language (HTML) document, the method comprising:
- capturing a state of a web page at a point in time;
identifying a plurality of content elements of the captured web page;
organizing the content elements to provide a grouping of the content elements based on an associated type and/or content of respective ones of the content elements to provide the hierarchical representation of the HTML document.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems and computer program products for generating a hierarchical representation of a hypertext markup language (HTML) document. A state of a web page is captures at a point in time. A plurality of content elements of the captured web page are identified. The content elements are organized to provide a grouping of the content elements based on an associated type and/or content of respective ones of the content elements to provide the hierarchical representation of the HTML document.
-
Citations
31 Claims
-
1. A method for generating a hierarchical representation of a hypertext markup language (HTML) document, the method comprising:
-
capturing a state of a web page at a point in time;
identifying a plurality of content elements of the captured web page;
organizing the content elements to provide a grouping of the content elements based on an associated type and/or content of respective ones of the content elements to provide the hierarchical representation of the HTML document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for generating a hierarchical representation of a hypertext markup language (HTML) document, the system comprising:
a representation module configured to capture a state of a web page at a point in time, identify a plurality of content elements of the captured web page and organize the content elements to provide a grouping of the content elements based on an associated type and/or content of respective ones of the content elements to provide the hierarchical representation of the HTML document. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22)
-
23. A computer program product for generating a hierarchical representation of a hypertext markup language (HTML) document, the computer program product comprising:
-
a computer readable medium having computer readable program code embodied therein, the computer readable program code comprising;
computer readable program code configured to capture a state of a web page at a point in time;
computer readable program code configured to identify a plurality of content elements of the captured web page;
computer readable program code configured to organize the content elements to provide a grouping of the content elements based on an associated type and/or content of respective ones of the content elements to provide the hierarchical representation of the HTML document. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31)
-
Specification