USER QUERY GENERATE SEARCH RESULTS THAT RANK SET OF SERVERS WHERE RANKING IS BASED ON COMPARING CONTENT ON EACH SERVER WITH USER QUERY, FREQUENCY AT WHICH CONTENT ON EACH SERVER IS ALTERED USING WEB CRAWLER IN A SEARCH ENGINE
First Claim
Patent Images
1. System for monitoring the World Wide Web (WWW), comprising:
- a user interface coupled to the WWW operable to obtain user information, wherein the user information includes a query;
a ranking component operable to rank a set of servers wherein each one of the set of servers is coupled to the WWW and wherein the ranking is based on at least one of;
1) a comparison of content on each server with the query; and
2) a frequency at which content on each server is altered; and
a search engine coupled to the WWW including a Web crawler operable to search at least one of the ranked servers in order of rank based on the query and generate search results wherein the search results refer to content on ranked servers that satisfy the query.
7 Assignments
0 Petitions
Accused Products
Abstract
A system, computer readable medium and method for searching for recently altered documents on the World Wide Web is provided. The method selects a server to be searched or crawled by a Web crawler based on a user selected ranking. Servers are ranked by a filter program which compares a user query with the content of a server and the frequency in which content is altered. A top percentage of ranked servers are crawled and the recently altered information, such as hyperlinks, are then provided to the user.
-
Citations
24 Claims
-
1. System for monitoring the World Wide Web (WWW), comprising:
-
a user interface coupled to the WWW operable to obtain user information, wherein the user information includes a query;
a ranking component operable to rank a set of servers wherein each one of the set of servers is coupled to the WWW and wherein the ranking is based on at least one of;
1) a comparison of content on each server with the query; and
2) a frequency at which content on each server is altered; and
a search engine coupled to the WWW including a Web crawler operable to search at least one of the ranked servers in order of rank based on the query and generate search results wherein the search results refer to content on ranked servers that satisfy the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method adapted for obtaining information from the World Wide Web (WWW) comprising the steps of:
-
obtaining a query;
calculating a content score of a first document having a first address on the WWW wherein the content score is based on comparing a content vector for the first document with the query;
ranking the first document in a set of documents based on at least one of;
1) the content score; and
2) a frequency at which document content is altered;
selecting a highest ranked document from the set of documents; and
crawling a first processing device on which the highest ranked document is stored to obtain a first altered document. - View Dependent Claims (12, 13, 14, 15, 16, 17)
providing a hyperlink of the first altered document to a user.
-
-
13. The method of claim 11, further comprising the steps of:
-
obtaining a search interval from a user; and
crawling the first processing device periodically, using the search interval.
-
-
14. The method of claim 11, further comprising the steps of:
notifying a user that the content of the first document has changed.
-
15. The method of claim 11, wherein the step of calculating further includes:
obtaining the content vector of the first document.
-
16. The method of claim 11, wherein the query includes a key-word.
-
17. The method of claim 11, wherein the frequency based on a last modified field in the first document.
-
18. A machine readable medium having instructions stored thereon that when executed by a processor cause a system to:
-
obtain a query;
calculate a content score of a first document having a first address on the World Wide Web (WWW) wherein the content score is based on comparing a content vector for the first document with the query;
rank the first document in a set of documents based on at least one of;
1) the content score; and
2) a frequency at which content on the document is altered;
select the highest ranked document from the set of documents; and
crawl a first processing device on which the highest ranked document is stored to obtain a first altered document. - View Dependent Claims (19, 20, 21, 22, 23, 24)
provide a hyperlink of the first altered document to a user.
-
-
20. The machine readable medium of claim 18, further comprising instructions that when executed cause a processor to:
-
obtain a search interval from a user; and
crawl the first processing device periodically, using the search interval.
-
-
21. The machine readable medium of claim 18, further comprising instructions that when executed cause a processor to:
notify a user that the content of the first document has changed.
-
22. The machine readable medium of claim 18, further comprising instructions that when executed cause a processor to:
obtain the content vector of the first document.
-
23. The machine readable medium of claim 18 wherein:
the query includes a keyword.
-
24. The machine readable medium of claim 18 wherein:
the frequency is based on a last modified field in the document.
Specification