Method and apparatus for responding to end-user request for information-collecting
First Claim
1. A method of selectively collecting web pages that may be returned to users in response to search-requests, the method including:
- receiving, from behavior watching modules operating on a multiplicity of users'"'"' computers, information regarding the users'"'"' search engine usage across one or more search engines, including at least keywords (including phrases) submitted by particular users to the search engines, links selected by the particular users from results returned by the search engines, and at least one of dwell time or documents viewed by the particular users when following the selected links;
using the search engine usage information, choosing keywords of interest and the links selected by the users corresponding to the keywords of interest;
following the chosen links to collect at least part of the content of documents addressed by the chosen links; and
associating the collected content with the corresponding keyword of interest.
6 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to an advanced search engine. The advanced search engine may include a client component for monitoring an end-user'"'"'s browsing activity, a remote server (may comprise one or more computers) for storing and processing data received from the client component, and a module that process web pages and serves search results to end-users. The advanced search engine may collect web pages for keywords of proven interest, fetch web pages requested by end-users, generate snippets or abstracts of the web pages, eliminate duplicate web pages, rank the importance of the web pages, and provide relevant web pages or links to web pages in response to an end-user search request for information regarding one or more keywords, for example. Particular aspects of the present invention are described in the claims, specification and drawings.
-
Citations
29 Claims
-
1. A method of selectively collecting web pages that may be returned to users in response to search-requests, the method including:
-
receiving, from behavior watching modules operating on a multiplicity of users'"'"' computers, information regarding the users'"'"' search engine usage across one or more search engines, including at least keywords (including phrases) submitted by particular users to the search engines, links selected by the particular users from results returned by the search engines, and at least one of dwell time or documents viewed by the particular users when following the selected links;
using the search engine usage information, choosing keywords of interest and the links selected by the users corresponding to the keywords of interest;
following the chosen links to collect at least part of the content of documents addressed by the chosen links; and
associating the collected content with the corresponding keyword of interest. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of generating data using a network of behavior watching modules that transmit data to an affiliated system of servers, the method including:
-
using a multiplicity of behavior watching modules operating on a multiplicity of users'"'"' computers, the behavior watching modules collecting information regarding the users'"'"' search engine usage across one or more search engines, including at least keywords (including phrases) submitted by particular users to the search engines, links selected by the particular users from results returned by the search engines, and at least one of dwell time or documents viewed by the particular users when following the selected links; and
the behavior watching modules electronically reporting the search engine usage information to one or more affiliated servers. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer-implemented system that selectively collects web pages that may be returned to users in response to search requests, the system including:
-
at least one listener module with a network interface that receives from behavior watching modules operating on a multiplicity of users'"'"' computers connected by a network, information regarding the users'"'"' search engine usage across one or more search engines, including at least keywords (including phrases) submitted by particular users to the search engines, links selected by the particular users from results returned by the search engines, and at least one of dwell time or documents viewed by the particular users when following the selected links; and
at least one computer or cluster of computers operating one or more modules in communication with the listener module that includes logic and resources adapted to process the search engine usage information and choose keywords of interest and the links selected by the users corresponding to the keywords of interest, follow the chosen links to collect at least part of the content of documents addressed by the chosen links, and associate the collected content with the corresponding keyword of interest. - View Dependent Claims (22)
-
-
23. A method of selecting snippets from content of a document that may be returned to users in response to search requests, the snippet selecting including:
-
applying a window of predetermined length to the document, repeatedly shifting the window through the document;
dividing the keyword of interest (a phrase) into words;
repeatedly calculating one or more window scores for the words in the window, including a count of instances of the words in the window, and a measure of adjacency of the words in the window (when the keyword is a phrase);
choosing as snippets a plurality of non-overlapping window positions using the window scores; and
storing the chosen snippet(s) or references to the chosen snippets in a machine readable memory. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
Specification