Detection and quantifying of data redundancy in column-oriented in-memory databases
First Claim
1. A computer-implemented method for quantifying a redundancy of data stored in tables of a column-oriented in-memory database, the method being executed using one or more processors and comprising:
- determining, by the one or more processors, data structures for a plurality of data objects stored in the column-oriented in-memory database through an integration of distributed enterprise systems, each data structure defining one or more attributes, one attribute being a primary key of a respective data object;
for each primary key and table pair in a set of primary key and table pairs, determining, by the one or more processors, an aggregate severity sub-score based on one or more values of the primary key in the table, the primary key being included in a set of primary keys and the table being included in a set of tables;
determining, by the one or more processors, an aggregate severity score for each primary key in the set of primary keys based on aggregate severity sub-scores associated with the primary key to provide a plurality of aggregate severity scores, each aggregate severity score indicating a relative redundancy of values of the primary key across all tables in the set of tables and each severity sub-score being determined based on a number of occurrences of the values of the primary key across all the tables and an attribute weight that is inversely proportional to an association grade for the primary key relative to the primary table;
providing, by the one or more processors, a list of aggregate severity scores and corresponding primary keys for display to a user; and
performing, by the one or more processors, at least one operation to reduce the relative redundancy associated with the primary key of the set of primary keys based on the list of aggregate severity scores.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, systems, and computer-readable storage media for quantifying a redundancy of data stored in tables of a database. In some implementations, actions include, for each primary key and table pair in a set of primary key and table pairs, determining an aggregate severity sub-score based on one or more values of the primary key in the table, the primary key being included in a set of primary keys and the table being included in a set of tables, determining an aggregate severity score for each primary key in the set of primary keys based on aggregate severity sub-scores associated with the primary key to provide a plurality of aggregate severity scores, each aggregate severity score indicating a relative redundancy of values of the primary key across all tables in the set of tables, and providing a list of aggregate severity scores and corresponding primary keys for display to a user.
36 Citations
20 Claims
-
1. A computer-implemented method for quantifying a redundancy of data stored in tables of a column-oriented in-memory database, the method being executed using one or more processors and comprising:
-
determining, by the one or more processors, data structures for a plurality of data objects stored in the column-oriented in-memory database through an integration of distributed enterprise systems, each data structure defining one or more attributes, one attribute being a primary key of a respective data object; for each primary key and table pair in a set of primary key and table pairs, determining, by the one or more processors, an aggregate severity sub-score based on one or more values of the primary key in the table, the primary key being included in a set of primary keys and the table being included in a set of tables; determining, by the one or more processors, an aggregate severity score for each primary key in the set of primary keys based on aggregate severity sub-scores associated with the primary key to provide a plurality of aggregate severity scores, each aggregate severity score indicating a relative redundancy of values of the primary key across all tables in the set of tables and each severity sub-score being determined based on a number of occurrences of the values of the primary key across all the tables and an attribute weight that is inversely proportional to an association grade for the primary key relative to the primary table; providing, by the one or more processors, a list of aggregate severity scores and corresponding primary keys for display to a user; and performing, by the one or more processors, at least one operation to reduce the relative redundancy associated with the primary key of the set of primary keys based on the list of aggregate severity scores. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable storage medium coupled to one or more processors and having instructions stored thereon which, when executed by the one or more processors, cause the one or more processors to perform operations for quantifying a redundancy of data stored in tables of a column-oriented in-memory database, the operations comprising:
-
determining data structures for a plurality of data objects stored in the column-oriented in-memory database through an integration of distributed enterprise systems, each data structure defining one or more attributes, one attribute being a primary key of a respective data object; for each primary key and table pair in a set of primary key and table pairs, determining an aggregate severity sub-score based on one or more values of the primary key in the table, the primary key being included in a set of primary keys and the table being included in a set of tables; determining an aggregate severity score for each primary key in the set of primary keys based on aggregate severity sub-scores associated with the primary key to provide a plurality of aggregate severity scores, each aggregate severity score indicating a relative redundancy of values of the primary key across all tables in the set of tables and each severity sub-score being determined based on a number of occurrences of the values of the primary key across all the tables and an attribute weight that is inversely proportional to an association grade for the primary key relative to the primary table; providing a list of aggregate severity scores and corresponding primary keys for display to a user; and performing at least one operation to reduce the relative redundancy associated with the primary key of the set of primary keys based on the list of aggregate severity scores. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system, comprising:
-
a computing device; and a computer-readable storage device coupled to the computing device and having instructions stored thereon which, when executed by the computing device, cause the computing device to perform operations for quantifying a redundancy of data stored in tables of a column-oriented in-memory database, the operations comprising; determining data structures for a plurality of data objects stored in the column-oriented in-memory database through an integration of distributed enterprise systems, each data structure defining one or more attributes, one attribute being a primary key of a respective data object; for each primary key and table pair in a set of primary key and table pairs, determining an aggregate severity sub-score based on one or more values of the primary key in the table, the primary key being included in a set of primary keys and the table being included in a set of tables; determining an aggregate severity score for each primary key in the set of primary keys based on aggregate severity sub-scores associated with the primary key to provide a plurality of aggregate severity scores, each aggregate severity score indicating a relative redundancy of values of the primary key across all tables in the set of tables and each severity sub-score being determined based on a number of occurrences of the values of the primary key across all the tables and an attribute weight that is inversely proportional to an association grade for the primary key relative to the primary table; providing a list of aggregate severity scores and corresponding primary keys for display to a user; and performing at least one operation to reduce the relative redundancy associated with the primary key of the set of primary keys based on the list of aggregate severity scores. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification