Information archival and retrieval system for internetworked computers
First Claim
Patent Images
1. A computerized system for archiving and retrieving content collected from electronic addresses over time, comprising:
- data stored in a computer-accessible organized structure, including a content storage structure having data related to a plurality of archived content files stored therein, and a provider storage structure having data related to a plurality of content providers stored therein;
a content storage module in communication with the content storage structure and a source electronic address, the source electronic address referencing a source content file having source data therein, the content storage module;
deriving an archived data file having archived content therein from the source content file referenced by the source electronic address, andstoring the archived content file in the content storage structure with a content timestamp;
a mechanism in communication with the content storage structure and the provider storage structure, the mechanism determining a source content provider from the provider storage structure responsible for the source content at the source electronic address, the mechanism associating the archived content file within the content storage structure with the source content provider within the provider storage structure based on the content timestamp;
an indexer in communication with the content storage structure, the indexer calculating a searchable electronic index of the archived content of the archived content file;
a user interface module in communication with a user, the user interface module soliciting a query parameter from the user for a desired content, the desired content having a match within the archived content of the archived content file;
a query engine in communication with the user interface module and the searchable electronic index, the query engine identifying the archived content file based on the query parameter and the searchable electronic index; and
a query result presented to the user by the user interface module in response to the identified archived content file, the query result including a representation of the archived content file and the associated source content provider.
0 Assignments
0 Petitions
Accused Products
Abstract
A computing system can archive information from internetworked computers, such as Internet content, for later retrieval. A server system processes content providers, such as DNS registries and web sites, to extract and store content, including text, image, audio, and video content. For web sites, HTML source code is stored along with a browser-rendered display file. The content is perpetually archived to create a historical record of information for each content provider. An interface is used to retrieve the archived content in response to queries.
4 Citations
42 Claims
-
1. A computerized system for archiving and retrieving content collected from electronic addresses over time, comprising:
-
data stored in a computer-accessible organized structure, including a content storage structure having data related to a plurality of archived content files stored therein, and a provider storage structure having data related to a plurality of content providers stored therein; a content storage module in communication with the content storage structure and a source electronic address, the source electronic address referencing a source content file having source data therein, the content storage module; deriving an archived data file having archived content therein from the source content file referenced by the source electronic address, and storing the archived content file in the content storage structure with a content timestamp; a mechanism in communication with the content storage structure and the provider storage structure, the mechanism determining a source content provider from the provider storage structure responsible for the source content at the source electronic address, the mechanism associating the archived content file within the content storage structure with the source content provider within the provider storage structure based on the content timestamp; an indexer in communication with the content storage structure, the indexer calculating a searchable electronic index of the archived content of the archived content file; a user interface module in communication with a user, the user interface module soliciting a query parameter from the user for a desired content, the desired content having a match within the archived content of the archived content file; a query engine in communication with the user interface module and the searchable electronic index, the query engine identifying the archived content file based on the query parameter and the searchable electronic index; and a query result presented to the user by the user interface module in response to the identified archived content file, the query result including a representation of the archived content file and the associated source content provider. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computerized method for archiving and retrieving content collected from electronic addresses, comprising:
-
storing data in a computer-accessible organized structure, including a content storage structure having data related to a plurality of archived content files stored therein, and a provider storage structure having data related to a plurality of content providers stored therein; from a content storage module in communication with the content storage structure and a source electronic address, the source electronic address referencing a source content file having source data therein; deriving an archived data file having archived content therein from the source content file referenced by the source electronic address, and storing the archived content file in the content storage structure with a content timestamp; from a mechanism in communication with the content storage structure and the provider storage structure; determining a source content provider from the provider storage structure responsible for the source content at the source electronic address, and associating the archived content file within the content storage structure with the source content provider within the provider storage structure based on the content timestamp; in an indexer in communication with the content storage structure, calculating a searchable electronic index of the archived content of the archived content file; in a user interface module communicating with a user, soliciting a query parameter from the user for a desired content, the desired content having a match within the archived content of the archived content file; in a query engine communicating with the user interface module and the searchable electronic index, identifying the archived content file based on the query parameter and the searchable electronic index; and from the user interface module, presenting a query result to the user in response to the identified archived content file, the query result including a representation of the archived content file and the associated source content provider. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A computerized system for archiving content collected from electronic addresses over time, comprising:
-
data stored in a computer-accessible organized structure, including a content storage structure having data related to a plurality of archived content files stored therein, and a provider storage structure having data related to a plurality of content providers stored therein; a content storage module in communication with the content storage structure and a source electronic addresses, the source electronic address referencing a source content file having source data therein, the content storage module; deriving an archived content file having archived content therein from the source content file referenced by a source electronic address, storing the archived content file in the content storage structure with a content timestamp, and wherein the content storage structure stores a plurality of archived content files associated with the source electronic address, each of the archived content files having a different archived content and a different content timestamp; and a mechanism in communication with the content storage structure and the provider storage structure, the mechanism determining a source content provider from the provider storage structure responsible for the source content at the source electronic address, the mechanism associating the archived content file within the content storage structure with the source content provider within the provider storage structure based on the content timestamp. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34)
-
-
35. A computerized method for archiving and retrieving content collected from electronic addresses, comprising:
-
storing data in a computer-accessible organized structure, including a content storage structure having data related to a plurality of archived content files stored therein, and a provider storage structure having data related to a plurality of content providers stored therein; from a content storage module in communication with the content storage structure and a source electronic address, the source electronic address referencing a source content file having source data therein; deriving an archived data file having archived content therein from the source content file referenced by the source electronic address, storing the archived content file in the content storage structure with a content timestamp, and wherein the content storage structure stores a plurality of archived content files associated with the source electronic address, each of the archived content files having a different content timestamp; and from a mechanism in communication with the content storage structure and the provider storage structure; determining a source content provider from the provider storage structure responsible for the source content at the source electronic address, and associating the archived content file within the content storage structure with the source content provider within the provider storage structure based on the content timestamp. - View Dependent Claims (36, 37, 38, 39, 40, 41, 42)
-
Specification