DETECTING ERROR PAGES BY ANALYZING SERVER REDIRECTS
First Claim
1. A computer-implemented method, comprising:
- analyzing previously stored target addresses;
determining one or more of the previously stored target addresses that result from more than a predetermined number of redirected originating addresses; and
on determining a respective target address, determining that one or more corresponding originating addresses are invalid based on a difference between information previously stored for the one or more corresponding originating addresses and information associated with the respective target address.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method is disclosed for detecting invalid webpages by analyzing server redirects. A storage comprising a set of previously stored target addresses is queried to determine whether one or more of the set of previously stored target addresses result from a redirect initiated from more than a predetermined number of originating addresses. On determining that a target address resulted from a redirect initiated from more than the predetermined number of originating addresses, the originating addresses are analyzed to determine, for each address, a difference between information previously stored for the originating address and information associated with the respective target address. If the difference satisfies a predetermined threshold, the originating address is marked as not valid or is removed.
-
Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
analyzing previously stored target addresses; determining one or more of the previously stored target addresses that result from more than a predetermined number of redirected originating addresses; and on determining a respective target address, determining that one or more corresponding originating addresses are invalid based on a difference between information previously stored for the one or more corresponding originating addresses and information associated with the respective target address. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A machine-readable media including instructions thereon that, when executed, perform a method, the method comprising:
-
determining one or more target addresses that result from a redirection from one or more originating addresses; and for a target address, storing a plurality of originating addresses, determining that a number of the plurality of originating addresses satisfies a predetermined threshold, and, on determining that the plurality of originating addresses satisfies the predetermined threshold, providing an indication that the plurality of originating addresses is not valid. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A system, comprising:
-
a processor; and a memory, including server instructions that, when executed, cause the processor to; analyze a plurality of internet addresses; store information corresponding to the plurality of internet addresses; from the plurality of internet addresses, determine one or more target addresses redirected from the plurality of internet addresses; store the one or more target addresses in a storage location; and for a target address, store a plurality of originating addresses, determine a number of the plurality of originating addresses, and, on determining that the number satisfies a first predetermined threshold, identify originating addresses associated with resources that include different information than a resource associated with the target address, and providing an indication that the identified originating addresses are not valid.
-
Specification