Creating secondary copies of data based on searches for content
First Claim
Patent Images
1. A computer-implemented method for use in a data storage environment, the method comprising:
- searching, with a computing device, backup copies of data within a data store to identify a first set of data that satisfies first search criteria,wherein the backup copies of data within the data store are copies of production data generated by at least one client device,wherein the data store is a secondary storage device that stores backup copies of production data generated by client devices,wherein the secondary storage device is remotely located from the at least one client device;
searching, with the computing device, the backup copies of data within the data store to identify a second set of data that satisfies second search criteria,wherein the second search criteria is different than the first search criteria and is associated with the first search criteria based on a business entity relationship that includes information about subsidiary business entities, with respect to one or more business entities identified in the first set of data that satisfies the first search criteria;
creating a search-based secondary copy of at least two different sets of data,wherein the at least two different sets of data include the identified first set of data that satisfies the first search criteria and the identified second set of data that satisfies the second search criteria, andwherein the search-based secondary copy is created from the backup copies of production data stored on the secondary storage device;
transferring the search-based secondary copy to the secondary storage device;
receiving a request for restoring data;
identifying requested data in the secondary storage device that satisfies the request;
determining whether the identified requested data includes certain data that is also part of the search-based secondary copy; and
when the identified requested data includes the certain data, restoring the search-based secondary copy that includes the certain data in addition to restoring the identified requested data,wherein, by creating the search-based secondary copy and then later restoring the search-based secondary copy along with the identified requested data, the method assists in reducing restoration requests because the method anticipates data that may be associated with a later restoration request,wherein the method restores the identified requested data along with data deemed to be associated with the identified requested data based on a similarity of content determined by a previous search for content within the data store,wherein the request to restore data is a request to restore Microsoft Exchange email data from a certain day via a Microsoft Exchange data agent,wherein the method identifies other data in the search-based secondary copy whose content is similar to content within the email data for the certain day, and restores the other data from the search-based secondary copy, andwherein the similarity is based on sender names or keywords from subject lines in the email data,so that the method recovers data from the search-based secondary copy that may supplement or augment the identified requested data from the request for restoring data.
4 Assignments
0 Petitions
Accused Products
Abstract
A method and system for creating secondary copies of data whose contents satisfy searches within data stores is described. In some cases, the system searches for data within a data store, identifies a set of data that satisfies the search, copies the identified set of data, and transfers the copy to secondary or other storage. In some cases, the system utilizes search-based secondary copies of days during restoration processes in order to restore data similar to and/or associated with data requested to be restored.
312 Citations
20 Claims
-
1. A computer-implemented method for use in a data storage environment, the method comprising:
-
searching, with a computing device, backup copies of data within a data store to identify a first set of data that satisfies first search criteria, wherein the backup copies of data within the data store are copies of production data generated by at least one client device, wherein the data store is a secondary storage device that stores backup copies of production data generated by client devices, wherein the secondary storage device is remotely located from the at least one client device; searching, with the computing device, the backup copies of data within the data store to identify a second set of data that satisfies second search criteria, wherein the second search criteria is different than the first search criteria and is associated with the first search criteria based on a business entity relationship that includes information about subsidiary business entities, with respect to one or more business entities identified in the first set of data that satisfies the first search criteria; creating a search-based secondary copy of at least two different sets of data, wherein the at least two different sets of data include the identified first set of data that satisfies the first search criteria and the identified second set of data that satisfies the second search criteria, and wherein the search-based secondary copy is created from the backup copies of production data stored on the secondary storage device; transferring the search-based secondary copy to the secondary storage device; receiving a request for restoring data; identifying requested data in the secondary storage device that satisfies the request; determining whether the identified requested data includes certain data that is also part of the search-based secondary copy; and when the identified requested data includes the certain data, restoring the search-based secondary copy that includes the certain data in addition to restoring the identified requested data, wherein, by creating the search-based secondary copy and then later restoring the search-based secondary copy along with the identified requested data, the method assists in reducing restoration requests because the method anticipates data that may be associated with a later restoration request, wherein the method restores the identified requested data along with data deemed to be associated with the identified requested data based on a similarity of content determined by a previous search for content within the data store, wherein the request to restore data is a request to restore Microsoft Exchange email data from a certain day via a Microsoft Exchange data agent, wherein the method identifies other data in the search-based secondary copy whose content is similar to content within the email data for the certain day, and restores the other data from the search-based secondary copy, and wherein the similarity is based on sender names or keywords from subject lines in the email data, so that the method recovers data from the search-based secondary copy that may supplement or augment the identified requested data from the request for restoring data. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer-readable medium having instructions which, when executed by a computing device, cause the computing device to execute a computer-implemented method for creating a copy of data stored in secondary storage media, the method comprising:
-
receiving, by the computing device, information associated with a search for data stored in the secondary storage media and not stored in a primary storage device of the computing device, wherein the data stored in the secondary storage media is one or more backup copies of production data generated by at least one client device that is remotely located from the secondary storage media, and wherein the secondary storage media is distinct from the computing device; identifying data within the secondary storage media that satisfies one or more criteria of the search; determining additional criteria that is associated with and different than the one or more criteria of the search, wherein the additional criteria is based on a business entity relationship between the additional criteria and the one or more search criteria; identifying additional data that satisfies the determined additional criteria, wherein the additional criteria include information about subsidiary business entities that are in a subsidiary relationship with respect to one or more business entities identified in the set of data; generating a search-based secondary copy of at least two different copies of the data, including; a copy of the data that satisfies the criteria of the search; and a copy of the additional data that satisfies the additional criteria, wherein the search-based secondary copy is created from backup copies of production data stored on the secondary storage media; transferring the generated search-based secondary copy to the secondary storage media; receiving a request for restoring data; identifying requested data in the secondary storage media that satisfies the request; determining whether the identified requested data includes certain data that is also part of the search-based secondary copy; and when the identified requested data includes the certain data, restoring the search-based secondary copy that includes the certain data in addition to restoring the identified requested data, wherein, by creating the search-based secondary copy and then later restoring the search-based secondary copy along with the identified requested data, the method assists in reducing restoration requests because the method anticipates data that may be associated with a later restoration request, wherein the method restores the identified requested data along with data deemed to be associated with the identified requested data based on a similarity of content determined by a previous search for content within the data store, wherein the request to restore data is a request to restore Microsoft Exchange email data from a certain day via a Microsoft Exchange data agent, wherein the method identifies other data in the search-based secondary copy whose content is similar to content within the email data for the certain day, and restores the other data from the search-based secondary copy, and wherein the similarity is based on sender names or keywords from subject lines in the email data, so that the method recovers data from the search-based secondary copy that may supplement or augment the identified requested data from the request for restoring data. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer-implemented method for creating a copy of data stored in secondary storage media, the method comprising:
-
receiving, by the computing device, information associated with a search for data stored in the secondary storage media and not stored in a primary storage device of the computing device, wherein the data stored in the secondary storage media is one or more backup copies of production data generated by at least one client device that is remotely located from the secondary storage media; identifying a first set of data within the secondary storage media that satisfies one or more criteria of the search; determining additional criteria that is associated with and different than the one or more criteria of the search, wherein the additional criteria is based on a business entity relationship between the additional criteria and the one or more search criteria; identifying a second set of data that satisfies a search with the determined additional criteria, wherein the additional criteria include information about subsidiary business entities that are subsidiary with respect to one or more business entities identified in the set of data; generating a search-based secondary copy of at least two different data sets, including the first set of data that satisfies the one or more criteria of the search and the second set of data that satisfies the search with the determined additional criteria, wherein the search-based secondary copy is created from backup copies of production data stored on the secondary storage media; transferring the generated secondary copy to the secondary storage media; receiving a request for restoring data; identifying requested data in the secondary storage media that satisfies the request; determining whether the identified requested data includes certain data that is also part of the search-based secondary copy; and when the identified requested data includes the certain data, restoring the search-based secondary copy that includes the certain data in addition to restoring the identified requested data, wherein, by creating the search-based secondary copy and then later restoring the search-based secondary copy along with the identified requested data, the method assists in reducing restoration requests because the method anticipates data that may be associated with a later restoration request, wherein the method restores the identified requested data along with data deemed to be associated with the identified requested data based on a similarity of content determined by a previous search for content within the data store, wherein the request to restore data is a request to restore Microsoft Exchange email data from a certain day via a Microsoft Exchange data agent, wherein the method identifies other data in the search-based secondary copy whose content is similar to content within the email data for the certain day, and restores the other data from the search-based secondary copy, and wherein the similarity is based on sender names or keywords from subject lines in the email data, so that the method recovers data from the search-based secondary copy that may supplement or augment the identified requested data from the request for restoring data. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification