UPDATING A SEARCH INDEX USING REPORTED BROWSER HISTORY DATA
First Claim
1. A computer-implemented method for updating a search index to identify documents with spiking interest, the method comprising:
- periodically receiving, at a first computing device having a processor and a memory, a history file from each of a plurality of second computing devices, the history file including a uniform resource locator (URLs) that has been accessed via a respective second computing device;
determining a number of instances of the URL in the history files occurring in a predetermined time period;
determining an average number of instances of the URL in the history files occurring in the predetermined time period over a predetermined number of days;
comparing the number of instances of the URL to the average number of instances of the URL over the predetermined number of days;
identifying the URL as a spiking URL when the number of instances of the URL is at least a predetermined amount greater than the average number of instances of the URL occurring over the predetermined number of days; and
updating a search index to indicate the spiking URL, the search index being usable by a search engine to identify search results for a search query.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and computer-readable media are provided for updating a search index with new uniform resource locators (URLs) and spiking URLs with increased user interest. History data, provided from browser applications residing on users'"'"' computers that indicate URLs accessed by the users, is parsed to identify new/previously unknown URLs. The history data also indicates URLs in which there is increased interest based on a number of recent hits as compared to an average number of hits determined over time. Author postings of new URLs to social networking sites and a quality rating of the authors may also be used to identify and filter new URLs. Search indexes are updated with the new and spiking URL data. As such, lag time between posting of new URLs and spiking of URL interest and inclusion of this data in a search index is greatly decreased.
-
Citations
20 Claims
-
1. A computer-implemented method for updating a search index to identify documents with spiking interest, the method comprising:
-
periodically receiving, at a first computing device having a processor and a memory, a history file from each of a plurality of second computing devices, the history file including a uniform resource locator (URLs) that has been accessed via a respective second computing device; determining a number of instances of the URL in the history files occurring in a predetermined time period; determining an average number of instances of the URL in the history files occurring in the predetermined time period over a predetermined number of days; comparing the number of instances of the URL to the average number of instances of the URL over the predetermined number of days; identifying the URL as a spiking URL when the number of instances of the URL is at least a predetermined amount greater than the average number of instances of the URL occurring over the predetermined number of days; and updating a search index to indicate the spiking URL, the search index being usable by a search engine to identify search results for a search query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed, perform a method for updating a search index with newly published documents, the method comprising:
-
providing an application to a first computing device having a processor and a memory, the application configured to generate a history file that includes an indication of each of one or more uniform resource locators (URLs) accessed by a browser application executing on the first computing device, wherein the application and the browser application are the same or different; receiving the history file at a second computing device having a processor and a memory; identifying, by the second computing device, a new URL from the one or more URLs indicated in the history file, the new URL being associated with a document that was previously unknown to the second computing device; and updating a search index with the new URL, the search index being usable by a search engine to identify search results for a search query. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed, perform a method for updating a search index with new and spiking uniform resource locators (URLs), the method comprising:
-
periodically receiving, at a first computing device having a processor and a memory, a history file from each of a plurality of second computing devices, the history file including one or more URLs that have been accessed via a respective second computing device; identifying, by the first computing device, a first new URL from one or more of the history files, the first new URL associated with a document that was previously unknown to the first computing device; determining a number of instances of a second URL of the one or more URLs included in the history files and within a predetermined time period, wherein the first new URL and the second URL are the same or different; determining an average number of instances of the second URL in the predetermined time period over a predetermined number of days; comparing the number of instances of the second URL to the average number of instances of the second URL over the predetermined number of days; identifying the second URL as a spiking URL when the number of instances of the second URL is at least a predetermined amount greater than the average number of instances of the second URL over the predetermined number of days; updating a search index with the first new URL; and updating the search index to indicate the spiking URL, the search index being usable by a search engine to identify search results for a search query. - View Dependent Claims (17, 18, 19, 20)
-
Specification