System and method for detecting incongruous or incorrect media in a data recovery process
First Claim
1. A computer-implemented method for detecting incorrect media in a data recovery operation, utilizing a data processing system, comprising a processor and a storage device, the method comprising the steps of:
- compiling a master profile for identifying certain members of a set of media associated with the data recovery operation, wherein the step of compiling a master profile further comprises the steps of;
reading an item from a member of the set of media;
extracting one or more terms from a file path of the item;
determining whether the one or more terms are on a first list;
adding new terms that are not on the first list to the first list;
counting repeated terms that are on the first list;
determining whether corresponding counters of the repeated terms meet or exceed a threshold;
determining whether terms having corresponding counters that meet or exceed the threshold are on a second list; and
updating the second list accordingly;
generating individual profiles for certain potential members of the set of media;
updating a term total counter according to an amount of the one or more terms extracted from the file path;
determining whether the term total meets a predetermined goal; and
moving terms having corresponding counters that meet or exceed the threshold to a third list which defines the master profile;
comparing certain of said individual profiles with said master profile by utilizing said processor, to execute a) a first test for determining physical characteristics for said certain potential members of said set of media, b) a second test for a unique identifier accorded certain of said potential members of said set of media, and c) a third test for determining unique content indication associated with certain potential members of said set of media; and
based on said comparing step, validating or invalidating said certain potential members of said set of media.
20 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are embodiments of a system and method for detecting wrong storage media in data recovery processes. One embodiment can detect incorrect media via environmental factors such as source data type and format. One embodiment can detect incongruous media by pool membership. One embodiment can detect inappropriate media by comparing fingerprints. As each item is read, its fingerprint is calculated and compared to the overall fingerprint of a job. If the new fingerprint is different by some preset margin, a human inspector may be asked to validate the newly inserted storage medium. If valid, the overall signature for the job is revised to integrate the new signature. If not, the wrong medium is ejected and incorrectly loaded data is removed. Embodiments disclosed herein can detect the presence of a wrong medium quickly and early in a data recovery process, avoiding mistakes and saving time and costs.
-
Citations
15 Claims
-
1. A computer-implemented method for detecting incorrect media in a data recovery operation, utilizing a data processing system, comprising a processor and a storage device, the method comprising the steps of:
-
compiling a master profile for identifying certain members of a set of media associated with the data recovery operation, wherein the step of compiling a master profile further comprises the steps of; reading an item from a member of the set of media; extracting one or more terms from a file path of the item; determining whether the one or more terms are on a first list; adding new terms that are not on the first list to the first list; counting repeated terms that are on the first list; determining whether corresponding counters of the repeated terms meet or exceed a threshold; determining whether terms having corresponding counters that meet or exceed the threshold are on a second list; and updating the second list accordingly; generating individual profiles for certain potential members of the set of media; updating a term total counter according to an amount of the one or more terms extracted from the file path; determining whether the term total meets a predetermined goal; and moving terms having corresponding counters that meet or exceed the threshold to a third list which defines the master profile; comparing certain of said individual profiles with said master profile by utilizing said processor, to execute a) a first test for determining physical characteristics for said certain potential members of said set of media, b) a second test for a unique identifier accorded certain of said potential members of said set of media, and c) a third test for determining unique content indication associated with certain potential members of said set of media; and based on said comparing step, validating or invalidating said certain potential members of said set of media. - View Dependent Claims (2, 3)
-
-
4. A non-transitory computer readable medium with software instructions for executing operations via a processor associated with the non-transitory computer readable medium, the operations for detecting incorrect media in a data recovery process, the operations comprising:
-
determining whether a first media identification and a second media identification match, wherein said first media identification is electronically stored and associated with a physical label identifying a storage medium and wherein said second media identification is associated with an electronic header of said storage medium; if said first media identification and said second media identification match, determining whether said storage medium is a member of a set of media associated with said data recovery operation; reading an item from said storage medium and scanning a file path of said item and extracting one or more terms from said file path; and compiling a master profile for identifying certain members of the set of media associated with the data recovery operation, wherein the step of compiling a master profile further comprises the steps of; reading an item from a member of the set of media; extracting one or more terms from a file path of the item; determining whether the one or more terms are on a first list; adding new terms that are not on the first list to the first list; counting repeated terms that are on the first list; determining whether corresponding counters of the repeated terms meet or exceed a threshold; determining whether terms having corresponding counters that meet or exceed the threshold are on a second list; and updating the second list accordingly; generating individual profiles for certain potential members of the set of media; updating a term total counter according to an amount of the one or more terms extracted from the file path; determining whether the term total meets a predetermined goal; and moving terms having corresponding counters that meet or exceed the threshold to a third list which defines the master profile. - View Dependent Claims (5, 6, 7, 8, 9)
-
-
10. A computer system configured to detect incorrect media in a data recovery operation in a native and non-native environment, comprising:
-
a) a plurality of storage media; b) a processor comprising a means for performing a first validation process, whereby the processor determines a computing environment for processing the plurality of storage media and performing an environmental check on each of the plurality of storage media, the processor further for validating or invalidating each of the plurality of storage media based on a result of the environmental check; c) the processor for each storage medium that passes the first validation process, comprising a means for performing a second validation process, whereby the means determines whether a first media identification and a second media identification match, wherein the first media identification is associated with a physical label identifying a storage medium and wherein the second media identification is associated with an electronic header of the storage medium and wherein, if the first media identification and the second media identification match, the processor determines whether the storage medium is a potential member of a set of media; d) the processor for each potential member of the set of media that passes the second validation process, comprising instructions for performing the following steps; compiling a master profile for identifying certain members of a set of media associated with the data operation, wherein the step of compiling a master profile further comprises the steps of; reading an item from a member of the set of media; extracting one or more terms from a file path of the item; determining whether the one or more terms are on a first list; adding new terms that are not on the first list to the first list; counting repeated terms that are on the first list; determining whether corresponding counters of the repeated terms meet or exceed a threshold; determining whether terms having corresponding counters that meet or exceed the threshold are on a second list; and updating the second list accordingly; generating individual profiles for certain potential members of the set of media; updating a term total counter according to an amount of the one or more terms extracted from the file path; determining whether the term total meets a predetermined goal; and moving terms having corresponding counters that meet or exceed the threshold to a third list, which defines the master profile; comparing certain of the individual profiles with the master profile to execute a) a first test for determining physical characteristics for the certain potential members of the set of media, b) a second test for a unique identifier accorded certain of the potential members of the set of media, and c) a third test for determining unique content indication associated with certain potential members of the set of media; and validating or invalidating the certain potential members of the set of media based on a comparison of the individual profiles with the master profile; and e) the processor configured to eliminate the incorrect data, after the third validation process, in the event it is determined based on application of additional factors. - View Dependent Claims (11, 12, 13, 14, 15)
-
Specification