Mapping the structure of a collection of computer resources
First Claim
Patent Images
1. A computer-implemented method of gathering information about a site of resources in a computer system, the method comprising:
- retrieving a first one of the resources from the site;
extracting from the first resource information about embedded hyperlinks to other resources in the site;
extracting from the first resource meta-data describing aspects of the first resource and the other resources; and
storing in a self-contained persistent data store the information about the hyperlinks and the meta-data.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method of gathering information about a site of resources in a computer system includes the steps of retrieving a first one of the resources from the site; extracting from the first resource information about embedded hyperlinks to other resources in the site; extracting from the first resource meta-data describing aspects of the first resource and the other resources; and storing in a self-contained persistent data store the information about the hyperlinks and the meta-data.
-
Citations
37 Claims
-
1. A computer-implemented method of gathering information about a site of resources in a computer system, the method comprising:
-
retrieving a first one of the resources from the site; extracting from the first resource information about embedded hyperlinks to other resources in the site; extracting from the first resource meta-data describing aspects of the first resource and the other resources; and storing in a self-contained persistent data store the information about the hyperlinks and the meta-data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 26, 36)
-
-
23. A computer program, residing on a computer-readable storage medium, comprising instructions for causing a computer in a computer system to:
-
retrieve a first resource in a collection of resources in the computer system; extract from the first resource information about embedded hyperlinks to other resources in the collection; extract from the first resource meta-data describing aspects of the first resource and the other resources; and store in a self-contained persistent data store the information about the hyperlinks and the meta-data. - View Dependent Claims (24, 28)
-
-
25. A method of obtaining information about the hyperlink structure of a collection of resources in a computer system, the method comprising:
-
first searching the computer system for a resource containing information about the hyperlink structure of the resources, obtaining the information from the resource if such a resource is found, and otherwise obtaining the information directly from the resources in the collection.
-
-
27. A method of preventing a spider from directly accessing resources in a computer system to gather information about the hyperlink structure of the resources, the method comprising:
-
detecting when the spider is attempting to access one of the resources, and instructing the spider instead to access another resource containing information about the hyperlink structure of the resources.
-
-
29. A method of discovering differences between two collections of resources in a computer system, the method comprising:
-
generating for each collection a self-contained database containing information about the hyperlink structure of the resources and meta-data describing aspects of the resources, and comparing the databases to discover differences between the collections of resources.
-
-
30. A method of discovering orphaned resources in a collection of resources, the method comprising:
-
generating a self-contained database containing information about the hyperlink structure of the collection of resources and meta-data describing aspects of the resources, acquiring from a user a definition of resources intended to be in the collection, and comparing the contents of the database to the contents of the list to determine which of the resources intended to be in the collection of resources is not included in the hyperlink structure of the resources. - View Dependent Claims (31)
-
-
32. A method of presenting to a user information only about resources of a particular type in a larger collection of resources, the method comprising:
-
gathering a database of information about the hyperlink structure of the resources in the collection and meta-data describing aspects of the resources, including information indicating the type of each resource, filtering the database to identify resources of the particular type, and presenting to the user only information about the hyperlink structure of the resources of the particular type and meta-data describing aspects of these resources.
-
-
33. A method of enabling a user to navigate to a particular resource in a collection of resources without accessing other resources in the collection, the method comprising:
-
building a resource map containing information about the hyperlink structure of the collection of resources and meta-data describing aspects of the resources, presenting the resource map to the user, and allowing the user to select the particular resource by selecting meta-data corresponding to the particular resource and retrieving the resource for the user. - View Dependent Claims (34, 35, 37)
-
Specification