Detecting Duplicative Hierarchical Sets Of Files
First Claim
Patent Images
1. A method of detecting duplicative hierarchically arranged sets of files in a storage system, comprising:
- generating, for hierarchically arranged plural sets of files, respective collections of values computed based on files in corresponding sets of files;
generating, for a further set of files that is an ancestor of at least one of the plural sets of files, a respective collection of values that is based on the collection of values computed for the at least one set; and
identifying duplicative sets according to comparisons of the collections of values.
2 Assignments
0 Petitions
Accused Products
Abstract
To detect duplicative hierarchically arranged sets of files in a storage system, a method includes generating, for hierarchically arranged plural sets of files, respective collections of values computed based on files in corresponding sets of files. For a further set of files that is an ancestor of at least one of the plural sets of files, a respective collection of values that is based on the collection of values computed for the at least one set is generated. Duplicative sets according to comparisons of the collections of values are identified.
56 Citations
15 Claims
-
1. A method of detecting duplicative hierarchically arranged sets of files in a storage system, comprising:
-
generating, for hierarchically arranged plural sets of files, respective collections of values computed based on files in corresponding sets of files; generating, for a further set of files that is an ancestor of at least one of the plural sets of files, a respective collection of values that is based on the collection of values computed for the at least one set; and identifying duplicative sets according to comparisons of the collections of values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of detecting duplicative directories, comprising:
-
computing sketches for corresponding plural directories, wherein the sketches contain values based on files in the directories; computing a further sketch for a further directory that is an ancestor of at least one of the plural directories, wherein the further sketch contains one or more values from the sketch of the at least one of the plural directories; and identifying duplicative directories based on comparing the sketches.
-
-
15. An article comprising at least one computer-readable storage medium containing instructions that when executed cause a computer to:
-
generate, for hierarchically arranged plural sets of files, respective collections of values computed based on files in corresponding sets of files; generate, for a further set of files that is an ancestor of at least one of the plural sets of files, a respective collection of values that is based on the collection of values computed for the at least one set; and identify duplicative sets according to comparisons of the collections of values.
-
Specification