System and Method for Summarizing Data
First Claim
1. A computer implemented method of characterizing data being associated with a plurality of location identifiers, each location identifier of the plurality of location identifiers identifying a location within the data where a particular 5 pattern of data is stored, the method comprising:
- identifying a first portion of the data based on a location of the first portion relative to a location identified by at least one of the plurality of location identifiers; and
determining a first plurality of summaries associated with the at least one of the plurality of location identifiers, at least one summary of the first plurality of summaries indicating a pattern of stored data included in the first portion.
6 Assignments
0 Petitions
Accused Products
Abstract
Described are computer-based methods and apparatuses, including computer program products, for removing redundant data from a storage system. In one example, a data delineation process delineates data targeted for de-duplication into regions using a plurality of markers. The de-duplication system determines which of these regions should be subject to further de-duplication processing by comparing metadata representing the regions to metadata representing regions of a reference data set. The de-duplication system identifies an area of data that incorporates the regions that should be subject to further de-duplication processing and de-duplicates this area with reference to a corresponding area within the reference data set.
116 Citations
20 Claims
-
1. A computer implemented method of characterizing data being associated with a plurality of location identifiers, each location identifier of the plurality of location identifiers identifying a location within the data where a particular 5 pattern of data is stored, the method comprising:
-
identifying a first portion of the data based on a location of the first portion relative to a location identified by at least one of the plurality of location identifiers; and determining a first plurality of summaries associated with the at least one of the plurality of location identifiers, at least one summary of the first plurality of summaries indicating a pattern of stored data included in the first portion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system configured to characterize data, the system comprising:
-
data storage storing the data and a plurality of location identifiers, each of the plurality of location identifiers identifying a location within the data where a particular pattern of data is stored; and a processor coupled to the data storage and configured to; identify a first portion of the data based on a location of the first portion relative to a location identified by at least one of the plurality of location identifiers; and determine a first plurality of summaries associated with the at least one of the plurality of location identifiers, at least one summary 5 of the first plurality of summaries indicating a pattern of stored data included in the first portion. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer readable medium storing computer readable instructions that, when executed by at least one processor, instruct the at least one processor to perform a method of characterizing data being associated with a plurality of location identifiers, each location identifier of the plurality of location identifiers identifying a location within the data where a particular pattern of data is stored, the method comprising:
-
identifying a first portion of the data based on a location of the first portion relative to a location identified by at least one of the plurality of location identifiers; and determining a first plurality of summaries associated with the at least one of the plurality of location identifiers, at least one summary of the first plurality of summaries indicating a pattern of stored data included in the first portion. - View Dependent Claims (18, 19, 20)
-
Specification