Mobile SiteMaps
First Claim
Patent Images
1. A method of analyzing documents or relationships between documents, comprising:
- receiving a notification of an available metadata document containing information about one or more network-accessible documents;
obtaining a document format indicator associated with the metadata document;
selecting a document crawler using the document format indicator; and
crawling at least some of the network-accessible documents using the selected document crawler.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
47 Citations
26 Claims
-
1. A method of analyzing documents or relationships between documents, comprising:
-
receiving a notification of an available metadata document containing information about one or more network-accessible documents; obtaining a document format indicator associated with the metadata document; selecting a document crawler using the document format indicator; and crawling at least some of the network-accessible documents using the selected document crawler. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of listing network-accessible documents, comprising:
-
generating a mapping document that represents an organization of related network-accessible documents; and transmitting to a remote computer a notification that includes an indication that the mapping document is available for access and an indication of the format of the documents. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A system for crawling network-accessible documents, comprising:
-
a memory storing organizational information about network-accessible documents at one or more websites, and format information for the documents; a crawler configured to access the network-accessible documents using the organizational information; and a format selector associated with the crawler to cause the crawler to assume a persona compatible with formats indicated by the format information. - View Dependent Claims (19, 20)
-
-
21. A system for crawling network-accessible documents, comprising:
-
a memory storing organizational information about network-accessible documents at one or more websites, and format information for the documents; a crawler configured to access the network-accessible documents using the organizational information; and means for selecting a crawler persona to present in accessing the network-accessible documents.
-
-
22. A computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage medium and a computer program mechanism embedded therein, the computer program mechanism comprising instructions for:
-
generating a mapping document that represents an organization of related network-accessible documents; and transmitting to a remote computer a notification that includes an indication that the list is available for access and an indication of the format of the documents. - View Dependent Claims (23, 24, 25, 26)
-
Specification