PRODUCING CHUNKS FROM INPUT DATA USING A PLURALITY OF PROCESSING ELEMENTS
First Claim
Patent Images
1. A method executed by a computer including a plurality of processing elements, comprising:
- dividing input data into a plurality of segments;
processing the plurality of segments, in parallel, by the processing elements, wherein processing the plurality of segments produces a plurality of tentative sets of chunks; and
stitching the plurality of tentative sets of chunks together to produce an output set of chunks.
2 Assignments
0 Petitions
Accused Products
Abstract
Input data is divided into multiple segments that are processed by processing elements of a computer. The processing of the segments produces a plurality of tentative sets of chunks. The plurality of tentative sets of chunks are stitched together to produce an output set of chunks.
96 Citations
20 Claims
-
1. A method executed by a computer including a plurality of processing elements, comprising:
-
dividing input data into a plurality of segments; processing the plurality of segments, in parallel, by the processing elements, wherein processing the plurality of segments produces a plurality of tentative sets of chunks; and stitching the plurality of tentative sets of chunks together to produce an output set of chunks. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer comprising:
-
a storage device to store input data; and a plurality of processing elements to; process segments of the input data to identify tentative chunk boundaries, wherein the processing produces at least a first tentative set of chunks that is based on a first one of the segments, and a second tentative set of chunks that is based on a second one of the segments; extend the first tentative set until synchronization occurs; and select a set of the chunking boundaries identified by the extended first tentative set or the second tentative set to provide as part of an output set of chunking boundaries. - View Dependent Claims (14, 15, 16, 17)
-
-
18. An article comprising at least one computer-readable storage medium containing instructions that when executed cause a computer to:
-
process segments, by a plurality of processing elements in parallel, of the input data to identify chunk boundaries, wherein the processing produces at least a first tentative set of chunks that is based on at least first and second ones of the segments, and a second tentative set of chunks that is based on at least the second segment; and perform harmonization to resolve inconsistencies in the chunk boundaries identified by the first and second tentative sets. - View Dependent Claims (19, 20)
-
Specification