Systems and methods for general aggregation of characteristics and key figures
First Claim
1. A computer-implemented method for automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, the method comprising:
- receiving, at a data processing system, mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, the records being associated with granularity characteristics and key figures;
selecting, according to a customer-defined aggregation, granularity characteristics of the received mass data, key figures of the received mass data, and aggregation operations associated with the key figures;
generating a plurality of data packages from the received mass data, the data packages comprising a plurality of records, the plurality of records of the data packages being smaller than the plurality of records of the received mass data;
processing, using a processor of the data processing system, the data packages to reduce a number of records in the data packages according to the customer-defined aggregation, wherein the processing comprises;
identifying a granularity level associated with the selected granularity characteristics, the identified granularity levels defining an order of the selected granularity characteristics;
sorting the records of the data packages according to the defined order of granularity characteristics;
aggregating the sorted records of the data packages for the selected key figures using the selected aggregation operations, the aggregation reducing the records of the data packages; and
identifying adjacent data packages by comparing, for the aggregated data packages, a key of a first record of the aggregated data packages with a key of a first record and a key of a last record of the other aggregated data packages, the identifying comprising;
computing termination criteria for pairs of the aggregated data packages, the termination criteria having the form;
keypos1,xε
(keypos1,y;
keyposmax;
y),
wherein pos1 represents a first position of a data package, posmax represents a last position of a data package, and x and y represent numbers of data packages; and
identifying adjacent packages based on a violation of corresponding ones of the termination criteria, the adjacent data packages having first record keys that are closest together; and
saving, to a memory of the data processing system, the aggregated records of the data packages, wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity.
1 Assignment
0 Petitions
Accused Products
Abstract
Computer-implemented methods, computer systems, and computer programs product are provided for automated generic and parallel aggregation of characteristics and key figures of unsorted mass data being of specific economic interest, particularly associated with financial institutions, and with financial affairs in banking practice. The parallel aggregation may reduce the amount of data for a customer defined granularity for the purpose of facilitating the handling of raw data related to all areas of credit risk management in banking practice. Moreover, the computing power of software and the software performance run time, respectively, may be improved in the case of mass data.
4 Citations
20 Claims
-
1. A computer-implemented method for automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, the method comprising:
-
receiving, at a data processing system, mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, the records being associated with granularity characteristics and key figures; selecting, according to a customer-defined aggregation, granularity characteristics of the received mass data, key figures of the received mass data, and aggregation operations associated with the key figures; generating a plurality of data packages from the received mass data, the data packages comprising a plurality of records, the plurality of records of the data packages being smaller than the plurality of records of the received mass data; processing, using a processor of the data processing system, the data packages to reduce a number of records in the data packages according to the customer-defined aggregation, wherein the processing comprises; identifying a granularity level associated with the selected granularity characteristics, the identified granularity levels defining an order of the selected granularity characteristics; sorting the records of the data packages according to the defined order of granularity characteristics; aggregating the sorted records of the data packages for the selected key figures using the selected aggregation operations, the aggregation reducing the records of the data packages; and identifying adjacent data packages by comparing, for the aggregated data packages, a key of a first record of the aggregated data packages with a key of a first record and a key of a last record of the other aggregated data packages, the identifying comprising; computing termination criteria for pairs of the aggregated data packages, the termination criteria having the form;
keypos1,xε
(keypos1,y;
keyposmax;
y),
wherein pos1 represents a first position of a data package, posmax represents a last position of a data package, and x and y represent numbers of data packages; andidentifying adjacent packages based on a violation of corresponding ones of the termination criteria, the adjacent data packages having first record keys that are closest together; and saving, to a memory of the data processing system, the aggregated records of the data packages, wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer system configured to perform automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, comprising:
-
a module configured to receive mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, the records being associated with granularity characteristics and key figures; a module configured to select, according to a customer-defined aggregation, granularity characteristics of the received mass data, key figures of the received mass data, and aggregation operations associated with the key figures; a module configured to generate a plurality of data packages from the received mass data, the data packages comprising a plurality of records, the plurality of records of the data packages being smaller than the plurality of records of the received mass data; a processor configured to process the data packages to reduce a number of records in the data packages according to the customer-defined aggregation, wherein the processor is further configured to; identify a granularity level associated with the selected granularity characteristics, and the identified granularity levels defining an order of the selected granularity characteristics; sort the records of the data packages according to the defined order of granularity characteristics; aggregate the sorted records of the data packages for the selected key figures using the selected aggregation operations, the aggregation reducing the records of the data packages; and identify adjacent data packages by comparing, for the aggregated data packages, a key of a first record of the aggregated data packages with a key of a first record and a key of a last record of the other aggregated data packages, the identifying comprising; computing termination criteria for pairs of the aggregated data packages, the termination criteria having the form;
keypos1,xε
(keypos1,y;
keyposmax;
y),
wherein pos1 represents a first position of a data package, posmax represents a last position of a data package, and x and y represent numbers of data packages; andidentifying adjacent packages based on a violation of corresponding ones of the termination criteria, the adjacent data packages having first record keys that are closest together; and a memory configured to store the aggregated records of the data packages, wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer-readable storage medium comprising a plurality of instructions that, when executed by a processor, perform a method for automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, the method comprising:
-
receiving mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, the records being associated with granularity characteristics and key figures; selecting, according to a customer-defined aggregation, granularity characteristics of the received mass data, key figures of the received mass data, and aggregation operations associated with the key figures; generating a plurality of data packages from the received mass data, the data packages comprising a plurality of records, the plurality of records of the data packages being smaller than the plurality of records of the received mass data; processing the data packages to reduce a number of records in the data packages according to the customer-defined aggregation, wherein the processing comprises; identifying a granularity level associated with the selected granularity characteristics, and the identified granularity levels defining an order of the selected granularity characteristics; sorting the records of the data packages according to the defined order of granularity characteristics; aggregating the sorted records of the data packages for the selected key figures using the selected aggregation operations, the aggregation reducing the records of the data packages; and identifying adjacent data packages by comparing, for the aggregated data packages, a key of a first record of the aggregated data packages with a key of a first record and a key of a last record of the other aggregated data packages, the identifying comprising; computing termination criteria for pairs of the aggregated data packages, the termination criteria having the form;
keypos1,xε
keypos1,y;
keyposmax;
y),
wherein pos1 represents a first position of a data package, posmax represents a last position of a data package, and x and y represent numbers of data packages; andidentifying adjacent packages based on a violation of corresponding ones of the termination criteria, the adjacent data packages having first record keys that are closest together; and saving the aggregated records of the data packages, wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity. - View Dependent Claims (17, 18, 19, 20)
-
Specification