Web scan process
First Claim
1. A system of autonomously maintaining a searchable database of information accessible over the Internet, said system comprising:
- a) a discrimination system coupleable to the Internet to receive messages including electronic mail messages and network news messages, said discrimination processing said electronic mail and network news messages to identify embedded URLs; and
b) a validation system coupleable to the Internet, said validation system coupled to said discrimination system to receive a predetermined embedded URL, said validation system enabling an access of the Internet to retrieve Web page information associated with said predetermined embedded URL; and
c) a database for searchably storing said predetermined embedded URL in association with the Web page information associated with said predetermined embedded URL.
6 Assignments
0 Petitions
Accused Products
Abstract
An information locator system providing for the expedient acquisition, validation and updating of information locators in a heterogenous network protocol environment. The locator system includes an information location discrimination engine coupleable to a network operating in the heterogeneous network protocol environment, a validation engine coupled to the information location discrimination engine to receive information locators and a database providing for the storage of information locators as discrete searchable resource locators. The validation engine is also connected to the data base for retrieving and storing resource locators. The validation engine provides for the autonomous interrogation of the heterogeneous network protocol environment to validate a predetermined information locator as a corresponding resource locator that is unique to the discrete searchable resource locators then stored by the database. Where a valid and inferred unique information locator is found, the validation engine provides a corresponding resource locator to the data base for subsequently searchable storage.
450 Citations
10 Claims
-
1. A system of autonomously maintaining a searchable database of information accessible over the Internet, said system comprising:
-
a) a discrimination system coupleable to the Internet to receive messages including electronic mail messages and network news messages, said discrimination processing said electronic mail and network news messages to identify embedded URLs; and b) a validation system coupleable to the Internet, said validation system coupled to said discrimination system to receive a predetermined embedded URL, said validation system enabling an access of the Internet to retrieve Web page information associated with said predetermined embedded URL; and c) a database for searchably storing said predetermined embedded URL in association with the Web page information associated with said predetermined embedded URL. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of maintaining the currentness of a listing of URLs used to establish a searchable state of at least a portion of the Internet, said method comprising the steps of:
-
a) associating predetermined volatility data with a predetermined URL within a list of URLs; b) associating predetermined contextual data with said predetermined URL; and c) periodically validating the currentness of said predetermined URL, including i) determining whether said predetermined volatility data corresponds to predetermined validation criteria; ii) determining the validity of said predetermined URL; iii) determining the currentness of said predetermined contextual data associated with said predetermined URL where said predetermined URL is determined to be valid iv) accessing at least a portion of the Internet to update said predetermined contextual data associated with said predetermined URL where said predetermined contextual data is determined to be not current; and iv) updating said predetermined volatility data to reflect the validity, currentness, and frequency that said predetermined contextual data is updated. - View Dependent Claims (7, 8, 9, 10)
-
Specification