URL rescue by execution of search using information extracted from invalid URL
First Claim
1. A computer-implemented URL rescue process, said URL rescue process comprising:
- by a server system that hosts a site;
receiving a Uniform Resource Locator (URL) request from a user device, said URL request specifying a URL that includes a domain name and a character string;
detecting that the URL is invalid; and
in response to detecting that the URL is invalid, executing a set of one or more URL rescue strategies to attempt to locate site content that is related to the invalid URL, said set of one or more URL rescue strategies including a search term mining based strategy that comprises;
determining whether the character string of the invalid URL includes at least one term that is suitable for use as a search term for executing a search of content of the site, said determining comprising identifying at least one substring of the character string, and comparing the substring to entries in a selection list, the selection list including at least one of the following;
a whitelist, a blacklist, a list of parameters;
when the determination results in identification of a set of one or more search terms, executing a search of the site content using the set of one or more search terms, and sending a result of the search to the user device in response to the URL request, wherein the search is executed using an index of the site content.
0 Assignments
0 Petitions
Accused Products
Abstract
A server system is disclosed that is capable of providing responsive content to a user when a request for an invalid URL is received. In a preferred embodiment, the server system implements multiple rescue strategies for attempting to rescue the invalid URL. One such rescue strategy involves repairing the URL by correcting for encoding errors, such as errors introduced by some web clients. Another rescue strategy involves determining that the URL contains an obsolete or outdated item identifier (e.g., product identifier) that renders the URL invalid, and returning information regarding one or more related items to the user. Another rescue strategy involves mining the invalid URL for one or more text strings reflective of the type of content desired, and then using the one or more text strings to execute a keyword search to identify content to provide to the user. The various rescue strategies may be attempted in sequence according to a hierarchy.
-
Citations
21 Claims
-
1. A computer-implemented URL rescue process, said URL rescue process comprising:
-
by a server system that hosts a site; receiving a Uniform Resource Locator (URL) request from a user device, said URL request specifying a URL that includes a domain name and a character string; detecting that the URL is invalid; and in response to detecting that the URL is invalid, executing a set of one or more URL rescue strategies to attempt to locate site content that is related to the invalid URL, said set of one or more URL rescue strategies including a search term mining based strategy that comprises; determining whether the character string of the invalid URL includes at least one term that is suitable for use as a search term for executing a search of content of the site, said determining comprising identifying at least one substring of the character string, and comparing the substring to entries in a selection list, the selection list including at least one of the following;
a whitelist, a blacklist, a list of parameters;when the determination results in identification of a set of one or more search terms, executing a search of the site content using the set of one or more search terms, and sending a result of the search to the user device in response to the URL request, wherein the search is executed using an index of the site content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system that hosts a site in which content is addressable using Uniform Resource Locators (URLs), the system comprising:
-
a data repository that stores content of the site, including page content corresponding to specific URLs; a server system that provides network-based access to the site content in response to URL requests from user devices, said server system programmed to implement a set of one or more URL rescue strategies for processing requests for invalid URLs, said set of one or more URL rescue strategies comprising a rescue strategy in which the server system; makes a determination, using a selection list, of whether an invalid URL includes at least one term that is suitable for use as a search term for executing a keyword search of site content, wherein the selection list includes at least one of the following;
a whitelist, a blacklist, a list of parameters; andwhen the determination is positive, (1) executes a keyword search of the content of the site using one or more search terms extracted from the invalid URL and an index of the site content; and
(2) returns, to a device that requested the invalid URL, a page containing results of the keyword search. - View Dependent Claims (12, 13, 14, 15)
-
-
16. Non-transitory computer storage having stored thereon executable program code that directs a server system that hosts a site to implement a Uniform Resource Locator (URL) rescue process for responding to a URL request received from a requesting device, said URL rescue process comprising:
-
determining that a URL specified in the URL request is invalid; and at least partly in response to determining that the URL is invalid, executing a URL rescue strategy that comprises; mining, from a character string that follows a domain name portion of the invalid URL, a set of one or more search terms for executing a search of content of the site, wherein mining the set of one or more search terms comprises identifying at least one substring of the invalid URL and comparing the substring to entries in a selection list, the selection list including at least one of the following;
a whitelist, a blacklist, a list of parameters;in response to successful mining of said set of one or more search terms, executing a keyword search of the content of the site using the set of one or more search terms, wherein the keyword search is executed using an index of the content of the site; and sending results of the search to the requesting device in response to the URL request. - View Dependent Claims (17, 18, 19, 20, 21)
-
Specification