Systems and/or methods for distributed data archiving amongst a plurality of networked computing devices
First Claim
1. An archival system, comprising:
- a plurality of computers connected to a network;
at least one source data store and at least one target data store connected to the network; and
at least one archive service configured to coordinate a plurality of extract operations with a plurality of accumulate operations, each said extract operation being executed on one said computer in the plurality of computers to read data from one said source data store and each said accumulate operation being executed on one said computer in the plurality of computers to write data to one said target data store,wherein each said extract operation is configured to run on a first computer in the plurality of computers and is paired with one said accumulate operation that is configured to run on a second separately located computer, different from the first computer, in the plurality of computers,wherein the at least one archive service is further configured to implement extract rules identifying data to be read from the at least one source data store and identifying whether a schema for the at least one source data store is to be attached to the data to be read, andwherein the at least one archive service is further configured to coordinate at least one validation operation to repeatedly verify integrity of data stored in the at least one target data store, and when the validation operation(s) determine(s) that data stored in the at least one target data store lost integrity, to perform an operation on the data with lost integrity based on rules associated with the data.
1 Assignment
0 Petitions
Accused Products
Abstract
Certain example embodiments of this invention relate to system and/or methods that pair a data extractor with a data accumulator, wherein these components may be located on any one or more computers in a network system. This distributed peer extract-accumulate approach is advantageous in that it reduces (and sometimes completely eliminates) the need for a “funnel” approach to data archiving, wherein all data is moved or backed up through a central computer or central computer system. In certain example embodiments, recall-accumulate, search, verify, and/or other archive-related activities may be performed in a similar peer-based and/or distributed manner. Certain example embodiments may in addition or in the alternative incorporate techniques for verifying the integrity of data in an archive system, and/or techniques for restoring/importing data from a non-consumable form.
-
Citations
37 Claims
-
1. An archival system, comprising:
-
a plurality of computers connected to a network; at least one source data store and at least one target data store connected to the network; and at least one archive service configured to coordinate a plurality of extract operations with a plurality of accumulate operations, each said extract operation being executed on one said computer in the plurality of computers to read data from one said source data store and each said accumulate operation being executed on one said computer in the plurality of computers to write data to one said target data store, wherein each said extract operation is configured to run on a first computer in the plurality of computers and is paired with one said accumulate operation that is configured to run on a second separately located computer, different from the first computer, in the plurality of computers, wherein the at least one archive service is further configured to implement extract rules identifying data to be read from the at least one source data store and identifying whether a schema for the at least one source data store is to be attached to the data to be read, and wherein the at least one archive service is further configured to coordinate at least one validation operation to repeatedly verify integrity of data stored in the at least one target data store, and when the validation operation(s) determine(s) that data stored in the at least one target data store lost integrity, to perform an operation on the data with lost integrity based on rules associated with the data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 28, 29, 30, 31, 36, 37)
-
-
18. A method implemented using a plurality of computers connected to a network for storing data in an archival system, the method comprising:
-
enabling at least one source data store and at least one target data store connectivity to the network; enabling at least one archive service on one or more of said computers, the at least one archive service being configured to interface a plurality of extract operations with a plurality of accumulate operations, each said extract operation being configured to run on a first computer in the plurality of computers, and to coordinate at least one validation operation to repeatedly verify integrity of data stored in the at least one target data store; pairing each said extract operation with one said accumulate operation, each said accumulate operation being configured to run on a second separately located computer, different from the first computer, in the plurality of computers; implementing defined extract rules (a) identifying data to be read from the at least one source data store and (b) identifying whether a schema for the at least one source data store is to be attached to the data to be read, wherein each said extract operation is executable on one said computer in the plurality of computers to read data from one said source data store and each said accumulate operation is executable on one said computer in the plurality of computers to write data to one said target data store; and when the validation operation(s) determine(s) that data stored in the at least one target data store lost integrity, performing an operation on the data with lost integrity based on rules associated with the data. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 32, 33, 34)
-
-
35. A non-transitory computer readable storage medium storing instructions that, when executed, cause a computer including at least one processor to perform features comprising:
-
enabling at least one source data store and at least one target data store connected to a network; enabling at least one archive service on one or more computers connected to the network, the at least one archive service being configured to interface a plurality of extract operations with a plurality of accumulate operations, each said extract operation being configured to run on a first computer in the plurality of computers, and to coordinate at least one validation operation to repeatedly verify integrity of data stored in the at least one target data store; pairing each said extract operation with one said accumulate operation, each said accumulate operation being configured to run on a second separately located computer, different from the first computer, in the plurality of computers; and implementing defined extract rules (a) identifying data to be read from the at least one source data store and (b) identifying whether a schema for the at least one source data store is to be attached to the data to be read, wherein each said extract operation is executable on one said computer in the plurality of computers to read data from one said source data store and each said accumulate operation is executable on one said computer in the plurality of computers to write data to one said target data store; and when the validation operation(s) determine(s) that data stored in the at least one target data store lost integrity, performing an operation on the data with lost integrity based on rules associated with the data.
-
Specification