Apparatus, system and method for data collection, import and modeling
First Claim
Patent Images
1. A method of data analysis for a backup system, the method comprising:
- generating a dump file for each of a plurality of backup servers, each dump file comprising configuration and state information about each of the plurality of backup servers in a native format used by each of the plurality of backup servers on which data is stored, wherein the backup servers backup the data from a primary storage layer to a common media layer;
extracting a first predetermined configuration and state information from the respective dump files of the plurality of backup servers, the first predetermined configuration and state information being in different formats based on the dump file from which it was extracted;
translating the first predetermined configuration and state information from the format used by each of the plurality of backup servers into a normalized format, wherein the translated first configuration and state information comprises configuration and state information irrespective of which of the plurality of backup servers from which it was generated;
storing the translated first configuration and state information in a single database;
generating a dump file for each of a plurality of different computer systems of the primary storage layer, each dump file comprising configuration and state information about each of the plurality of computer systems in a format used by each of the plurality of computer systems on which the data is stored, wherein the plurality of computer systems include server computers, desktop computers, and laptop computers which are physically located across various sites and use hardware and software from different vendors;
extracting a second predetermined configuration and state information from the respective dump files of the plurality of different computer systems, the second predetermined configuration and state information being in different formats based on the dump file from which it was extracted;
translating the second predetermined configuration and state information from the format used by each of the plurality of computer systems into a normalized format, wherein the translated second configuration and state information comprises configuration and state information irrespective of which of the plurality of computer systems from which it was generated; and
storing the translated second configuration and state information in the single database;
determining what components are in the backup system, how the backup system works, how data is stored in the backup system, how efficiently data is stored in the backup system, a total capacity of the backup system, a remaining capacity of the backup system, and an operating cost of the backup system by analyzing the normalized first and second configuration and state information stored in the single database.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of and system for data analysis, including interrogating a plurality of computer systems to generate for each computer system at least one dump file, each dump file including configuration and state information; extracting predetermined configuration and state information from the respective dump files; and storing the extracted configuration and state information in a database in a normalized format.
-
Citations
19 Claims
-
1. A method of data analysis for a backup system, the method comprising:
-
generating a dump file for each of a plurality of backup servers, each dump file comprising configuration and state information about each of the plurality of backup servers in a native format used by each of the plurality of backup servers on which data is stored, wherein the backup servers backup the data from a primary storage layer to a common media layer; extracting a first predetermined configuration and state information from the respective dump files of the plurality of backup servers, the first predetermined configuration and state information being in different formats based on the dump file from which it was extracted; translating the first predetermined configuration and state information from the format used by each of the plurality of backup servers into a normalized format, wherein the translated first configuration and state information comprises configuration and state information irrespective of which of the plurality of backup servers from which it was generated; storing the translated first configuration and state information in a single database; generating a dump file for each of a plurality of different computer systems of the primary storage layer, each dump file comprising configuration and state information about each of the plurality of computer systems in a format used by each of the plurality of computer systems on which the data is stored, wherein the plurality of computer systems include server computers, desktop computers, and laptop computers which are physically located across various sites and use hardware and software from different vendors; extracting a second predetermined configuration and state information from the respective dump files of the plurality of different computer systems, the second predetermined configuration and state information being in different formats based on the dump file from which it was extracted; translating the second predetermined configuration and state information from the format used by each of the plurality of computer systems into a normalized format, wherein the translated second configuration and state information comprises configuration and state information irrespective of which of the plurality of computer systems from which it was generated; and storing the translated second configuration and state information in the single database; determining what components are in the backup system, how the backup system works, how data is stored in the backup system, how efficiently data is stored in the backup system, a total capacity of the backup system, a remaining capacity of the backup system, and an operating cost of the backup system by analyzing the normalized first and second configuration and state information stored in the single database. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of data analysis, the method comprising:
-
generating a dump file for each of a plurality of different computer systems, each dump file comprising product specific configuration and state information about each of the plurality of computer systems in a native format used by each of the plurality of computer systems on which data is stored; extracting predetermined configuration and state information from the respective dump files of the plurality of different computer systems, the predetermined configuration and state information being in different native formats, wherein the predetermined configuration and state information comprises metadata which describes a physical storage location of each object and includes customer data, site data, source data, node data, object data, and fragment data, wherein the fragment data describes a physical location of each the object; translating the predetermined configuration and state information from the native format used by each of the plurality of different computer systems into a normalized format, wherein the normalized configuration and state information comprises a format irrespective of which of the plurality of different computer systems from which it was generated; and determining what components are in the backup system, how the backup system works, how data is stored in the backup system, how efficiently data is stored in the backup system, a total capacity of the backup system, a remaining capacity of the backup system, and an operating cost of the backup system by analyzing the normalized predetermined configuration and state information. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method of data analysis, the method comprising:
-
generating a dump file for each of a plurality of different computer systems, each dump file comprising configuration and state information about each of the plurality of different computer systems in a native format used by each of the plurality of different computer systems on which data is stored; extracting predetermined configuration and state information from the respective dump files of the plurality of different computer systems, the predetermined configuration and state information being in different native formats based on the dump file from which it was extracted; translating the predetermined configuration and state information from the native format used by each of the plurality of different computer systems into a normalized format, wherein the translated configuration and state information comprises configuration and state information irrespective of which of the plurality of different computer systems from which it was generated; and determining what components are in the backup system, how the backup system works, how data is stored in the backup system, how efficiently data is stored in the backup system, a total capacity of the backup system, a remaining capacity of the backup system, and an operating cost of the backup system by analyzing the normalized predetermined configuration and state information. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification