Generating an index for a network search engine
First Claim
1. A computer-implemented method of generating an index for a search engine, the method comprising:
- receiving monitored network traffic information from a plurality of network devices including a first network device and a second network traffic device, the plurality of network devices respectively receiving and forwarding at least a part of the monitored network traffic information, the monitored network traffic information including at least one Uniform Resource Locator (URL) and traffic statistics, the traffic statistics including at least one traffic ranking respectively assigned to the at least one URL;
filtering to identify and remove at least one duplicate access to the at least one URL from the monitored network traffic information to generate filtered monitored network traffic information, the filtering being based on a single source/destination in a single session, the filtering comprising;
identifying a first access to a first URL by identifying a first monitored network traffic information that is received from the first network device as indicating at least one packet as originated from a first network traffic source and destined for the first URL,identifying a second access to the first URL by identifying a second monitored network traffic information that is received from the second network device indicating at least one packet as originated from the first network traffic source and destined for the first URL, andremoving the second access to the first URL responsive to identifying the second access to the first URL as being performed within a predetermined time period after the first access to the first URL;
ranking the filtered monitored network traffic information according to the traffic statistics; and
updating a search engine index based on the filtered monitored network traffic information as ranked according to the traffic statistics.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of generating a TrafficRank index for a network search engine is described. The method includes monitoring network traffic through a network device. The method further uses the monitored traffic information to generate and maintain a TrafficRank index. The invention extends to a method of generating search results, including generating search results from a search engine index and ranking the search results based on the TrafficRank index. The invention also extends to a method of generating and updating a search engine index either in real-time or in batch mode. If the method is adopted, any website or URLs accessed through website monitor/notifier/updater module by a web user, either by clicking on a hypertext link, entering the URL address on a web browser, or selecting a bookmark on a web browser, would be included and updated in the search engine index.
-
Citations
20 Claims
-
1. A computer-implemented method of generating an index for a search engine, the method comprising:
-
receiving monitored network traffic information from a plurality of network devices including a first network device and a second network traffic device, the plurality of network devices respectively receiving and forwarding at least a part of the monitored network traffic information, the monitored network traffic information including at least one Uniform Resource Locator (URL) and traffic statistics, the traffic statistics including at least one traffic ranking respectively assigned to the at least one URL; filtering to identify and remove at least one duplicate access to the at least one URL from the monitored network traffic information to generate filtered monitored network traffic information, the filtering being based on a single source/destination in a single session, the filtering comprising; identifying a first access to a first URL by identifying a first monitored network traffic information that is received from the first network device as indicating at least one packet as originated from a first network traffic source and destined for the first URL, identifying a second access to the first URL by identifying a second monitored network traffic information that is received from the second network device indicating at least one packet as originated from the first network traffic source and destined for the first URL, and removing the second access to the first URL responsive to identifying the second access to the first URL as being performed within a predetermined time period after the first access to the first URL; ranking the filtered monitored network traffic information according to the traffic statistics; and updating a search engine index based on the filtered monitored network traffic information as ranked according to the traffic statistics. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method of generating an index for a network search engine, the method comprising:
-
receiving monitored network traffic information from a plurality of network devices including a first network device and a second network device, the plurality of network devices respectively receiving and forwarding at least a part of the monitored network traffic information, the monitored network traffic information including at least one Uniform Resource Locator (URL) and traffic statistics; filtering to identify and remove at least one duplicate access to the at least one URL from the monitored network traffic information to generate filtered monitored network traffic information, the filtering being based on a single source/destination in a single session, the filtering comprising; identifying a first access to a first URL by identifying a first monitored network traffic information that is received from the first network device as indicating at least one packet as originated from a first network traffic source and destined for the first URL, identifying a second access to the first URL by identifying a second monitored network traffic information that is received from the second network device indicating at least one packet as originated from the first network traffic source and destined for the first URL, removing the second access to the first URL responsive to identifying the second access to the first URL as being performed within a predetermined time period after the first access to the first URL; ranking the filtered monitored network traffic information according to the traffic statistics; and updating a search engine index based on the filtered monitored network traffic information as ranked according to the traffic statistics. - View Dependent Claims (8, 9, 10)
-
-
11. A computer-implemented method of providing search results, the method comprising:
-
responsive to a search request, generating search results using information from a search engine index, the search results including information associated with a plurality of websites; ranking the plurality of websites based on traffic statistics received from a plurality of network devices including a first network device and a second network device, the plurality of network devices respectively receiving and forwarding at least a part of monitored network traffic information that includes the traffic statistics, the traffic statistics including accesses to the plurality of websites that have been filtered to identify and remove at least one duplicate access based on a single source/destination in a single session, the filter comprises; an identification of a first access to a first URL by an identification of a first traffic statistic that is received from the first network device that indicates at least one packet as originated from a first network traffic source and destined for the first URL, an identification of a second access to the first URL by an identification of a second traffic statistic that is received from the second network device that indicates at least one packet as originated from the first network traffic source and destined for the first URL, a removal of the second access to the first URL responsive to an identification of the second access to the first URL as being performed within a predetermined time period after the first access to the first UR, and presenting the search results according to said ranking. - View Dependent Claims (12)
-
-
13. A non-transitory computer readable medium embodying instructions which, when executed by one or more processors, cause the one or more processors to perform operations comprising:
-
receiving monitored network traffic data for each of a plurality of websites, the monitored network traffic data being monitored by a plurality of network devices including a first network device and a second network device that respectively receive and forward at least part of the monitored network traffic data; storing the monitored network traffic data for each of the plurality of websites;
filtering to identify and remove at least one duplicate access to the plurality of websites from the monitored network traffic data, the filtering being based on a single source/destination in a single session and comprising;identifying a first access to a first URL by identifying a first monitored network traffic data that is received from the first network device as indicating at least one packet as originated from a first network traffic source and destined for the first URL, identifying a second access to the first URL by identifying a second monitored network traffic data that is received from the second network device indicating at least one packet as originated from the first network traffic source and destined for the first URL, and removing the second access to the first URL responsive to identifying the second access to the first URL as being performed within a predetermined time period after the first access to the first URL; and ranking the plurality of websites in search results at least according to their respective access frequency as determined from the filtered monitored network traffic data.
-
-
14. A system for generating an index for a network search engine, the system comprising:
-
a processor for executing; means for receiving monitored network traffic data monitored by a plurality of network devices including a first network device and a second network device, the plurality of network devices respectively receive and forward at least a part of the monitored network traffic data; means for determining data to be added to a search engine index using the monitored network traffic data; means for keeping track of access frequency of websites associated with Uniform Resource Locators (URLS) included in the monitored network traffic data; means for filtering to identify and remove at least one duplicate access to at least one of the websites from the monitored network traffic data to generate filtered monitored network traffic data, the filtering being based on a single source/destination in a single session and comprising; identifying a first access to a first URL, by identifying a first monitored network traffic data that is received from the first network device that indicates at least one packet as originated from a first network traffic source and destined for the first URL, identifying a second access to the first URL, by identifying a second monitored network traffic data that is received from the second network device that indicates at least one packet as originated from the first network traffic source and destined for the first URL, and removing the second access to the first URL responsive to an identification of the second access to the first URL as being performed within a predetermined time period after the first access to the first URL; means for including the filtered monitored network traffic data in the search engine index in search results responsive to a search request; and means for ranking websites included in the search results at least according to the access frequency of the websites. - View Dependent Claims (15, 16, 17)
-
-
18. A network device comprising:
-
a processor; a search engine index coupled to the processor and configured to store information related to a plurality of websites; a monitoring module coupled to the search engine index and configured to monitor network data received and forwarded by a network device and network data received and forwarded by a plurality of other network devices including a first network device and a second network device, the monitoring module to filter the network data to identify and remove at least one duplicate access to a first website that is included in the plurality of websites based on a single source/destination in a single session, the filter comprises the monitoring module to; identify a first access to a first URL by an identification of a first network data that is received from the first network device that indicates at least one packet as originated from a first network traffic source and destined for the first URL, identify a second access to the first URL by an identification of a second network data that is received from the second network device that indicates at least one packet as originated from a first network traffic source and destined for the first URL, and remove the second access to the first URL responsive to an identification of the second access to the first URL as being performed within a predetermined time period after the first access to the first URL, the monitoring module to determine whether the network data includes information to be added to the search engine index; a search module coupled to the search engine index to perform search against the search engine index responsive to a search request; and a ranking module to rank search results generated by the search module according to a filtered number of accesses of each of the websites included in the search results. - View Dependent Claims (19)
-
-
20. A search server system to provide search results to at least one computer connected to the server via a network, the system comprising:
-
a processor; a search engine index coupled to the processor and configured to store information used to generate search results, the search results includes information associated with a plurality of websites; a traffic rank index coupled to the search engine index and configured to store traffic rank information associated with at least some of the plurality of websites, the traffic rank information generated from network traffic information received and forwarded by a plurality of network devices including a first network device and a second network device, the search results are presented at least according to the traffic rank information, and the traffic rank information associated with a website is determined based on number of times the website is accessed and is filtered to identify and remove at least one duplicate access to the website based on a single source/destination in a single session, the filter comprises; an identification of a first access to a first URL by an identification of a first network traffic information that is received from the first network device that indicates at least one packet as originated from a first network traffic source and destined for the first URL; an identification of a second access to the first URL by an identification of a second network traffic information that is received from the second network device that indicates at least one packet as originated from a first network traffic source and destined for the first URL and a removal of the second access to the first URL responsive to an identification of the second access to the first URL as being performed within a predetermined time period after the first access to the first URL.
-
Specification