DATA PROCESSING APPARATUS AND METHOD OF PROCESSING DATA
First Claim
1. Data processing apparatus comprising:
- a chunk store containing specimen data chunks,a manifest store containing a plurality of manifests, each of which represents at least a part of previously processed data and comprises at least one reference to at least one of said specimen data chunks,a sparse chunk index containing information on only some specimen data chunks,the processing apparatus being operable to;
process input data into a plurality of input data segments, each composed of input data chunks;
identify a first set of manifests, where each manifest of the first set has at least one reference to one of said specimen data chunks that corresponds to one of the input data chunks of a first input data segment, and on which there is information contained in the sparse chunk index;
identify specimen data chunks corresponding to other input data chunks of the first input data segment by using the identified first set of manifests and at least one manifest identified when processing previous data.
2 Assignments
0 Petitions
Accused Products
Abstract
One embodiment is a data processing apparatus that has a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of previously processed data and includes at least one reference to at least one of the specimen data chunks, and a sparse chunk index containing information on only some specimen data chunks. Input data is processed into a plurality of input data segments. Each manifest of the first set has at least one reference to one of said specimen data chunks that corresponds to one of the input data chunks of a first input data segment. Specimen data chunks corresponding to other input data chunks of the first input data segment are identified by using the identified first set of manifests and at least one manifest identified when processing previous data.
-
Citations
15 Claims
-
1. Data processing apparatus comprising:
-
a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of previously processed data and comprises at least one reference to at least one of said specimen data chunks, a sparse chunk index containing information on only some specimen data chunks, the processing apparatus being operable to; process input data into a plurality of input data segments, each composed of input data chunks; identify a first set of manifests, where each manifest of the first set has at least one reference to one of said specimen data chunks that corresponds to one of the input data chunks of a first input data segment, and on which there is information contained in the sparse chunk index; identify specimen data chunks corresponding to other input data chunks of the first input data segment by using the identified first set of manifests and at least one manifest identified when processing previous data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. Data processing apparatus comprising:
-
a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of previously processed data and comprises at least one reference to at least one of said specimen data chunks, a sparse chunk index containing information on only some specimen data chunks, the processing apparatus being operable to; process input data into a plurality of input data segments, each composed of input data chunks; identify a first set of manifests, where each manifest of the first set has references to specimen data chunks which correspond to input data chunks of a first input data segment and on which there is information contained in the sparse chunk index; retrieve manifests having references to specimen data chunks which correspond to at least one input data chunk of previously processed data, use the identified and retrieved manifests to identify specimen data chunks corresponding to other input data chunks of the first input data segment - View Dependent Claims (11, 12)
-
-
13. A data processor, operable to:
-
process input data into input data chunks, the input data chunks being arranged into input data segments; for a given input data segment, select at least some of the input data chunks of the input data segment having a predetermined characteristic, compile a list of candidate manifests from a manifest store, the list comprising; at least one manifest having a reference to a specimen data chunk corresponding to at least one of the selected input data chunks; and at least one manifest identified in processing at least one other segment of input data; and prioritise and process the candidate manifests to identify specimen data chunks corresponding to other input data chunks of the input data segment being processed.
-
-
14. A method of processing data, using:
-
a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of previously processed data and comprises at least one reference to at least one of said specimen data chunks; and a sparse chunk index containing information on only some specimen data chunks, the method comprising; processing input data into a plurality of input data segments, each composed of input data chunks; identifying a first set of manifests, where each manifest of the first set has at least one reference to one of said specimen data chunks that corresponds to one of the input data chunks of a first input data segment and on which there is information contained in the sparse chunk index; and using the identified first set of manifests and at least one manifest identified when processing previous input data to identify specimen data chunks corresponding to other input data chunks of the first input data segment - View Dependent Claims (15)
-
Specification