Systems and methods for searching of storage data with reduced bandwidth requirements
First Claim
Patent Images
1. A method, comprising:
- at a first location, determining a set of distinguishing characteristics associated with the first data;
transmitting the determined set of first data distinguishing characteristics from the first location to a remote location;
at a remote location, comparing the determined set of first data distinguishing characteristics to one or more sets of remote data distinguishing characteristics to identify remote data stored at the remote location that is similar to the first data, wherein similarity is a function of a similarity threshold; and
determining one or more differences between the first data and the identified similar remote data, wherein, once similar remote data has been identified, any differences between the first data and the identified similar remote data are determined without transmitting all of the first data to the remote location and without transmitting all of the identified similar remote data to the first location.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods enabling search of a repository for the location of data that is similar to input data, using a defined measure of similarity, in a time that is independent of the size of the repository and linear in a size of the input data, and a space that is proportional to a small fraction of the size of the repository. Additionally, remote operations are accomplished with significantly reduced system bandwidth by implementing remote differencing operations.
261 Citations
46 Claims
-
1. A method, comprising:
-
at a first location, determining a set of distinguishing characteristics associated with the first data;
transmitting the determined set of first data distinguishing characteristics from the first location to a remote location;
at a remote location, comparing the determined set of first data distinguishing characteristics to one or more sets of remote data distinguishing characteristics to identify remote data stored at the remote location that is similar to the first data, wherein similarity is a function of a similarity threshold; and
determining one or more differences between the first data and the identified similar remote data, wherein, once similar remote data has been identified, any differences between the first data and the identified similar remote data are determined without transmitting all of the first data to the remote location and without transmitting all of the identified similar remote data to the first location. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method, comprising:
-
receiving, at a remote location, a set of first data distinguishing characteristics from a first location, the set of first data distinguishing characteristics associated with first data as determined at the first location;
at the remote location, comparing the set of first data distinguishing characteristics to one or more sets of remote data distinguishing characteristics to identify remote data stored at the remote location that is similar to the first data, wherein similarity is a function of a similarity threshold; and
via communication between the first location and the remote location, determining one or more differences between the first data and the identified similar remote data, wherein, once similar remote data has been identified, the differences between the first data and the identified similar remote data are determined without all of the first data being received at the remote location. - View Dependent Claims (14, 15, 16)
-
-
17. A method, comprising:
-
receiving, at a remote location, a set of first data distinguishing characteristics from a first location, the set of first data distinguishing characteristics associated with first data as determined at the first location;
at the remote location, comparing the set of first data distinguishing characteristics to one or more sets of remote data distinguishing characteristics to identify remote data stored at the remote location that is similar to the first data, wherein similarity is a function of a similarity threshold; and
via communication between the first location and the remote location, determining one or more differences between the first data and the identified similar remote data. - View Dependent Claims (18, 19)
-
-
20. A system, comprising:
-
means for determining, at a first location, a set of distinguishing characteristics associated with the first data;
means for transmitting the determined set of first data distinguishing characteristics from the first location to a remote location;
means for comparing, at a remote location, the determined set of first data distinguishing characteristics to one or more sets of remote data distinguishing characteristics to identify remote data stored at the remote location that is similar to the first data, wherein similarity is a function of a similarity threshold; and
means for determining one or more differences between the first data and the identified similar remote data, wherein, once similar remote data has been identified, any differences between the first data and the identified similar remote data are determined without transmitting all of the first data to the remote location and without transmitting all of the identified similar remote data to the first location. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. A system, comprising:
-
means for receiving, at a remote location, a set of first data distinguishing characteristics from a first location, the set of first data distinguishing characteristics associated with first data as determined at the first location;
means for comparing, at the remote location, the set of first data distinguishing characteristics to one or more sets of remote data distinguishing characteristics to identify remote data stored at the remote location that is similar to the first data, wherein similarity is a function of a similarity threshold; and
means for determining, via communication between the first location and the remote location, one or more differences between the first data and the identified similar remote data, wherein, once similar remote data has been identified, the differences between the first data and the identified similar remote data are determined without all of the first data being received at the remote location. - View Dependent Claims (33, 34, 35)
-
-
36. A system, comprising:
-
means for receiving, at a remote location, a set of first data distinguishing characteristics from a first location, the set of first data distinguishing characteristics associated with first data as determined at the first location;
means for comparing, at the remote location, the set of first data distinguishing characteristics to one or more sets of remote data distinguishing characteristics to identify remote data stored at the remote location that is similar to the first data, wherein similarity is a function of a similarity threshold; and
means for determining, via communication between the first location and the remote location, one or more differences between the first data and the identified similar remote data. - View Dependent Claims (37, 38)
-
-
39. A system, comprising:
-
a processor;
a memory coupled to the processor; and
a local data repository coupled to the processor, wherein the processor and the memory are configured to perform a method comprising;
receiving a set of first data distinguishing characteristics from a first location, the set of first data distinguishing characteristics associated with first data as determined at the first location;
comparing the set of first data distinguishing characteristics to one or more sets of local data distinguishing characteristics to identify local data stored in the local data repository that is similar to the first data, wherein similarity is a function of a similarity threshold; and
via communication with the first location, determining one or more differences between the first data and the identified similar local data, wherein, once similar local data has been identified, the differences between the first data and the identified similar local data are determined without receiving all of the first data at the local data repository. - View Dependent Claims (40, 41, 42)
-
-
43. A computer-readable medium encoded with a computer program that causes a computer to perform a method comprising:
-
receiving, at a local location, a set of first data distinguishing characteristics from a first location, the set of first data distinguishing characteristics associated with first data as determined at the first location;
comparing the set of first data distinguishing characteristics to one or more sets of local data distinguishing characteristics to identify local data stored at a local repository that is similar to the first data, wherein similarity is a function of a similarity threshold; and
determining, via communication with the first location, one or more differences between the first data and the identified similar local data, wherein, once similar local data has been identified, the differences between the first data and the identified similar local data are determined without all of the first data being received at the local location. - View Dependent Claims (44, 45, 46)
-
Specification