Automated client-based web application URL link extraction tool for use in testing and verification of internet web servers and associated applications executing thereon
First Claim
1. A method for extracting links from an Internet application executing on a web site server and having a plurality of web pages, each of said pages including links for use by an automated web server test and/or strain tool in testing and/or straining said application, comprising:
- generating a get request for a first of at least one of said pages from said application;
automatically extracting substantially all of said links from said first of said pages;
storing said extracted links in a format receivable by said test and/or strain tool;
retrieving said stored extracted links;
sorting said retrieved extracted links; and
transferring said sorted extracted links to said test and/or strain tool.
1 Assignment
0 Petitions
Accused Products
Abstract
A get request employing TCPIP sockets is issued which obtains requested HTML pages associated with the application from the web server. A search is executed extracting all links on the page automatically. The links are optionally sorted as designated in an input file or command line in random order so as to generate a typical access paths through the web server application during testing and verification thereof. The retrieved and extracted data is formatted and output in a common format employable in an input file by multiple test application tools which request, capture, store, verify data returned from, and stress the web servers and associated applications.
In a preferred embodiment, server port, and server name specified by host name or address, may be designated either in the input file or command line.
97 Citations
12 Claims
-
1. A method for extracting links from an Internet application executing on a web site server and having a plurality of web pages, each of said pages including links for use by an automated web server test and/or strain tool in testing and/or straining said application, comprising:
-
generating a get request for a first of at least one of said pages from said application;
automatically extracting substantially all of said links from said first of said pages;
storing said extracted links in a format receivable by said test and/or strain tool;
retrieving said stored extracted links;
sorting said retrieved extracted links; and
transferring said sorted extracted links to said test and/or strain tool. - View Dependent Claims (2, 3, 4, 5, 6)
generating a next get request for a next at least one of said pages from said application;
automatically extracting substantially all of said links from said next at least one of said pages from said application; and
storing said extracted links from said next at least one of said pages.
-
-
4. The method of claim 1 wherein said sorting is for creating non-typical access paths through said application.
-
5. The method of claim 4 comprising:
-
automatically repeating in order said steps of generating a next get request;
automatically extracting substantially all of said links from said next at least one of said pages; and
storing said extracted links from said next at least one of said pages;
for successive next ones of said at least one of said pages until all of said links for said plurality of web pages are extracted and stored.
-
-
6. The method of claim 5 further including:
-
retrieving all of said extracted links;
sorting said all of said extracted links; and
transferring said all of said sorted extracted links to said test and/or strain tool.
-
-
7. An apparatus for extracting links from an Internet application executing on a web site server, and having a plurality of web pages, each of said pages including links for use by an automated web server test and/or strain tool in testing and/or straining said application, comprising:
-
means for generating a get request for a first of at least one of said pages from said application;
means for automatically extracting substantially all of said links from said first of said pages;
means for storing said extracted links in a format receivable by said test and/or strain tool;
means for retrieving said stored extracted links;
means for sorting said retrieved extracted links; and
means for transferring said sorted extracted links to said test and/or strain tool. - View Dependent Claims (8, 9, 10, 11, 12)
means for generating a next get request for a next at least one of said pages said application;
means for automatically extracting substantially all of said links from said next at least one of said pages from said application; and
means for storing said extracted links from said next at least one of said pages.
-
-
10. The apparatus of claim 7 wherein
said means for sorting is for creating non-typical access paths through said application. -
11. The apparatus of claim 10 comprising:
-
means for automatically repeating in order said steps of repeating in order said steps of generating a next get request;
automatically extracting substantially all of said links from said next at least one of said pages; and
for storing said extracted links from said next at least one of said pages;
for successive next ones of said at least one of said pages until all of said links for said plurality of web pages are extracted and stored.
-
-
12. The apparatus of claim 11 further including:
-
means for retrieving all of said extracted links;
means for sorting said all of said extracted links; and
means for transferring said all of said sorted extracted links to said test and/or strain tool.
-
Specification