Mobile sitemaps
First Claim
Patent Images
1. A computer-implemented method of analyzing documents or relationships between documents, comprising:
- receiving a notification of an available metadata document containing information about one or more network-accessible documents;
obtaining a document format indicator associated with the metadata document, the document format indicator specifying a format in which content of at least one of the network-accessible documents is stored;
selecting, using the document format indicator, a document crawler having an operating mode that defines one or more content formats that the document crawler is capable of accessing, including the format specified by the document format indicator; and
crawling with a computer at least some of the network-accessible documents using the selected document crawler and operating mode.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
-
Citations
16 Claims
-
1. A computer-implemented method of analyzing documents or relationships between documents, comprising:
-
receiving a notification of an available metadata document containing information about one or more network-accessible documents; obtaining a document format indicator associated with the metadata document, the document format indicator specifying a format in which content of at least one of the network-accessible documents is stored; selecting, using the document format indicator, a document crawler having an operating mode that defines one or more content formats that the document crawler is capable of accessing, including the format specified by the document format indicator; and crawling with a computer at least some of the network-accessible documents using the selected document crawler and operating mode. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for crawling network-accessible documents, comprising:
-
a memory storing organizational information about network-accessible documents at one or more websites, and format information for the documents; a crawler configured to access the network-accessible documents using the organizational information; and a format selector associated with the crawler to cause the crawler to assume a persona compatible with formats indicated by the format information. - View Dependent Claims (14, 15)
-
-
16. A system for crawling network-accessible documents, comprising:
-
a memory storing organizational information about network-accessible documents at one or more websites, and format information for the documents; a crawler configured to access the network-accessible documents using the organizational information; and means for selecting a crawler persona to present in accessing the network-accessible documents.
-
Specification