Updating a search index using reported browser history data
First Claim
1. A computer-implemented method for updating a search index to identify documents with spiking interest, the method comprising:
- periodically receiving, at a first computing device having a processor and a memory, a history file from each of a plurality of second computing devices, the history file including a uniform resource identifier (URI) that has been accessed via a respective one of the plurality of second computing devices and metadata indicating a time of day when the URIs were accessed, wherein the plurality of second computing devices form a group of second computing devices;
determining a number of instances of the URI in the history files for the group of second computing devices, wherein the number of instances occurs in a predetermined time period between a given start and stop time;
calculating, automatically without user interaction, an average number of instances of the URI in the history files for the group of second computing devices, wherein the average number of instances occurs in the predetermined time period over a predetermined number of previous days, and wherein the predetermined time period is between the given start and stop time each day;
determining that the number of instances of the URI in the history file is at least a predetermined amount greater than the average number of instances of the URI in the history files by, automatically without user interaction, comparing the number of instances of the URI for the group of second computing devices to the average number of instances of the URI over the predetermined number of days for said group of second computing devices;
identifying the URI as a spiking URI when the number of instances of the URI is at least a predetermined amount greater than the average number of instances of the URI occurring over the predetermined number of days; and
updating a search index to indicate the spiking URI, the search index being useable by a search engine to identify search results for a search query.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and computer-readable media are provided for updating a search index with new uniform resource locators (URLs) and spiking URLs with increased user interest. History data, provided from browser applications residing on users'"'"' computers that indicate URLs accessed by the users, is parsed to identify new/previously unknown URLs. The history data also indicates URLs in which there is increased interest based on a number of recent hits as compared to an average number of hits determined over time. Author postings of new URLs to social networking sites and a quality rating of the authors may also be used to identify and filter new URLs. Search indexes are updated with the new and spiking URL data. As such, lag time between posting of new URLs and spiking of URL interest and inclusion of this data in a search index is greatly decreased.
-
Citations
13 Claims
-
1. A computer-implemented method for updating a search index to identify documents with spiking interest, the method comprising:
-
periodically receiving, at a first computing device having a processor and a memory, a history file from each of a plurality of second computing devices, the history file including a uniform resource identifier (URI) that has been accessed via a respective one of the plurality of second computing devices and metadata indicating a time of day when the URIs were accessed, wherein the plurality of second computing devices form a group of second computing devices; determining a number of instances of the URI in the history files for the group of second computing devices, wherein the number of instances occurs in a predetermined time period between a given start and stop time; calculating, automatically without user interaction, an average number of instances of the URI in the history files for the group of second computing devices, wherein the average number of instances occurs in the predetermined time period over a predetermined number of previous days, and wherein the predetermined time period is between the given start and stop time each day; determining that the number of instances of the URI in the history file is at least a predetermined amount greater than the average number of instances of the URI in the history files by, automatically without user interaction, comparing the number of instances of the URI for the group of second computing devices to the average number of instances of the URI over the predetermined number of days for said group of second computing devices; identifying the URI as a spiking URI when the number of instances of the URI is at least a predetermined amount greater than the average number of instances of the URI occurring over the predetermined number of days; and updating a search index to indicate the spiking URI, the search index being useable by a search engine to identify search results for a search query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed, perform a method for updating a search index with new and spiking uniform resource identifiers (URIs), the method comprising:
-
periodically receiving, at a first computing device having a processor and a memory, a history file from each of a plurality of second computing devices, the history file including one or more URIs that have been accessed via a respective one of the plurality of second computing devices and metadata indicating a time of day when the URIs were accessed, wherein the history file is a file including one or more URIs, and wherein the plurality of second computing devices form a group of second computing devices; identifying, by the first computing device, a first new URI from one or more of the history files for the group of second computing devices, the first new URI associated with a document that was previously unknown to the first computing device; determining a number of instances of a second URI of the one or more URIs included in the history files for the group of second computing devices which were accessed within a predetermined time period between a given start and stop time; determining an average number of instances of the second URI in the history files for the group of second computing devices accessed in the predetermined time period over a predetermined number of days, wherein the predetermined time period is between the given start and stop time each day; determining that the number of instances of the second of the one or more URI included in the history file is at least a predetermined amount greater than the average number of instances of the second URI in the history files by, automatically without user interaction, comparing the number of instances of the second URI to the average number of instances of the second URI over the predetermined number of days for said group of second computing devices; identifying the second URI as a spiking URI when the number of instances of the second URI is at least a predetermined amount greater than the average number of instances of the second URI over the predetermined number of days; updating the search index with the first new URI; and updating the search index to indicate the spiking URI, the search index being useable by a search engine to identify search results for a search query. - View Dependent Claims (9, 10, 11, 12, 13)
-
Specification