Updating a search index using reported browser history data
First Claim
1. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed, perform a method for updating a search index with newly published documents, the method comprising:
- providing an application to a user computing device having a processor and a memory, the application configured to generate and store on the user computing device a history file that includes an indication of each of one or more uniform resource identifiers (URIs) accessed by a browser application executing on the user computing device and metadata indicating a time of day when the URIs were accessed by the user computing device, wherein the application and the browser application are the same or different;
receiving the history file at a server having a processor and a memory from the user computer device based on a predetermined schedule;
identifying a new URI by the server from the one or more URIs indicated in the history file, the new URI being associated with a document that was previously unknown to the server; and
updating a search index with the new URI, the search index being useable by a search engine to identify search results for a search query.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, systems, and computer-readable media are provided for updating a search index with new uniform resource locators (URLs) and spiking URLs with increased user interest. History data, provided from browser applications residing on users'"'"' computers that indicate URLs accessed by the users, is parsed to identify new/previously unknown URLs. The history data also indicates URLs in which there is increased interest based on a number of recent hits as compared to an average number of hits determined over time. Author postings of new URLs to social networking sites and a quality rating of the authors may also be used to identify and filter new URLs. Search indexes are updated with the new and spiking URL data. As such, lag time between posting of new URLs and spiking of URL interest and inclusion of this data in a search index is greatly decreased.
-
Citations
19 Claims
-
1. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed, perform a method for updating a search index with newly published documents, the method comprising:
-
providing an application to a user computing device having a processor and a memory, the application configured to generate and store on the user computing device a history file that includes an indication of each of one or more uniform resource identifiers (URIs) accessed by a browser application executing on the user computing device and metadata indicating a time of day when the URIs were accessed by the user computing device, wherein the application and the browser application are the same or different; receiving the history file at a server having a processor and a memory from the user computer device based on a predetermined schedule; identifying a new URI by the server from the one or more URIs indicated in the history file, the new URI being associated with a document that was previously unknown to the server; and updating a search index with the new URI, the search index being useable by a search engine to identify search results for a search query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for updating a search index with newly published documents comprising:
-
a first computing device, having a processor coupled to a memory, configured to receive an application which is configured to generate a history file that includes an indication of each of one or more uniform resource identifiers (URIs) accessed by a browser application executing on the first computing device and metadata indicating a time of day when the URIs were accessed, wherein the application and the browser application are the same or different, wherein the history file includes one or more URIs, and wherein the first computing device is a user computing device; a second computing device, having a processor coupled to a memory, configured to receive the history file; the second computing device configured to determine a new URI from the one or more URIs in the received history file that is previously unknown to the second computing device; and the search index configured to update automatically without user interaction with the new UM, the search index being useable by a search engine to identify search results for a search query. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A system for updating a search index with newly published documents comprising:
-
a plurality of groups of one or more user computing devices, each user device having a processor coupled to a memory and configured to receive an application, the application configured to generate a history file that includes an indication of each of one or more uniform resource identifiers (URIs) accessed by a browser application executing on each of the user computing device and metadata indicating a time of day when the URIs were accessed, wherein the application and the browser application are the same or different, wherein the history files include one or more URIs; a server, having a processor coupled to a memory, configured to sequentially receive the history files from each group of the plurality of groups of one or more user computing devices based on a predetermined schedule for each group of one or more user computing devices, wherein the history files from each group of the plurality of groups of one or more user computing devices forms a group of history files; the server configured to determine upon receipt of each group of history files a new URI that is previously unknown to the server from the one or more URIs in one of the history files in the received group of history files; and the search index configured to update automatically without user interaction with the new URI, the search index being useable by a search engine to identify search results for a search query. - View Dependent Claims (17, 18, 19)
-
Specification