Auditing a website with page scanning and rendering techniques
First Claim
1. In a computerized environment comprising a rendering system and a website having one or more web pages that have one or more parent and child relationships, a method of generating a site map of the website using both page scan and page rendering techniques, comprising:
- requesting, from a server, one or more web pages of an identified website pursuant to generating a site map of the website;
receiving a response from the server, the response including page code that defines one or more web pages of the identified website;
scanning the page code of at least one of the web pages without rendering the at least one web page during the scanning to identify scanned web page information;
rendering all of the page code of the at least one web page, wherein the results of the rendered page code comprise rendered web page information; and
displaying a site map of the website that includes a combination of scanned web page information and rendered web page information for the at least one web page, wherein displaying the site map further comprises;
identifying an amount of image data to be rendered in the at least one web page;
providing a value in the site map that indicates a comparison between the amount of image data and a defined average amount of image data found in web pages from other websites; and
indicating whether content of the at least one web page is consistent with HTML meta tags included within the content of the at least one web page.
4 Assignments
0 Petitions
Accused Products
Abstract
An automated website analysis system includes mechanisms for automatically reviewing a website and identifying various features of the website. In one implementation, the analysis system can rely at least in part on the actual downloading and rendering of each page in the website to determine not only aspects of website content and layout, but also whether various content, such as third party objects or source code, execute as intended. Additional features include recording user input that may be required to progress past web pages where automated site mapping would ordinarily stall. The analysis system can identify various items of interest about each web page to the website owner, including web page parent/child relationships, as well as the extent to which the given page content is consistent with its metadata descriptions.
54 Citations
17 Claims
-
1. In a computerized environment comprising a rendering system and a website having one or more web pages that have one or more parent and child relationships, a method of generating a site map of the website using both page scan and page rendering techniques, comprising:
-
requesting, from a server, one or more web pages of an identified website pursuant to generating a site map of the website; receiving a response from the server, the response including page code that defines one or more web pages of the identified website; scanning the page code of at least one of the web pages without rendering the at least one web page during the scanning to identify scanned web page information; rendering all of the page code of the at least one web page, wherein the results of the rendered page code comprise rendered web page information; and displaying a site map of the website that includes a combination of scanned web page information and rendered web page information for the at least one web page, wherein displaying the site map further comprises; identifying an amount of image data to be rendered in the at least one web page; providing a value in the site map that indicates a comparison between the amount of image data and a defined average amount of image data found in web pages from other websites; and indicating whether content of the at least one web page is consistent with HTML meta tags included within the content of the at least one web page. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. In a computerized environment comprising a rendering system and a website having one or more web pages that have one or more parent and child relationships, a method of the rendering system automatically generating a site map that shows one or more of the parent and child relationships using page rendering techniques, comprising:
-
receiving a request to generate a site map, wherein the request comprises one or more end user provided processing parameters, and an identified website; requesting one or more web pages of the identified website; receiving the one or more web pages, each web page comprising page code that defines the web page; processing the one or more web pages corresponding to the identified website in accordance with the user provided processing parameters including; scanning the page code of at least one of the web pages without rendering the at least one web page, during scanning, to identify scanned web page information; rendering all of the page code of the at least one web page, wherein the results of the rendered page code comprise rendered web page information; and generating a site map for display in a user interface, wherein the site map displays the scanned web page information including one or more link relationships between a plurality of web pages in the website, and further displays; the rendered web page information including rendering results for the rendered one or more web pages; an indication of whether content of the at least one web page is consistent with HTML meta tags included within the content of the at least one web page; and an indication of whether the at least one web page requests user information that is consistent with a privacy policy, wherein the privacy policy defines the type of information that an owner of the website states it will and will not requests. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable medium that, when executed cause one or more processors in a computer system to perform a method of automatically generating a site map of the website using page rendering techniques, comprising:
-
requesting, from a server, one or more web pages of an identified website pursuant to generating a site map of the website; receiving a response from the server, the response including page code that defines one or more web pages of the identified website; scanning the page code of at least one of the web pages without rendering the at least one web page to identify scanned web page information; rendering all of the page code of the at least one web page, wherein the results of the rendered page code comprise rendered web page information; and displaying a site map of the website that includes a combination of scanned web page information and rendered web page information for the at least one web page, wherein displaying the site map further comprises; identifying an amount of image data to be rendered in the at least one web page; providing a value in the site map that indicates a comparison between the amount of image data and a defined average amount of image data found in web pages from other websites; and indicating whether the at least one web page requests user information that is consistent with a privacy policy, wherein the privacy policy defines the type of information that an owner of the website states it will and will not requests.
-
Specification