Systems and methods for general aggregation of characteristics and key figures

US 7,596,520 B2
Filed: 09/30/2005
Issued: 09/29/2009
Est. Priority Date: 09/30/2004
Status: Active Grant

First Claim

Patent Images

1. A computer-implemented method for automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, the method comprising:

receiving, at a data processing system, mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, each record being associated with one or more granularity characteristics and one or more key figures;

selecting, according to a customer-defined aggregation, one or more of the granularity characteristics of the received mass data, one or more of the key figures of the received mass data, and an aggregation operation associated with each of the one or more key figures;

generating a plurality of data packages from the received mass data, each data package comprising a plurality of records, the plurality of records of each data package being smaller than the plurality of records of the received mass data;

processing, using one or more processors of the data processing system, the data packages to reduce a number of records in each of the data packages according to the customer-defined aggregation, wherein the processing comprises;

identifying one or more granularity levels, each of the granularity levels being associated with one of the selected granularity characteristics, and the identified granularity levels defining an order of the selected granularity characteristics;

sorting the records of each data package according to the defined order of granularity characteristics; and

aggregating the sorted records of each data package for each of the selected key figures using the selected aggregation operations, the aggregation reducing the plurality of records of each data package;

splitting each of the aggregated data packages into one or more sub data packages, wherein each sub data package of an aggregated data package comprises fewer records than the aggregated data package; and

identifying adjacent sub data packages by comparing, for each sub data package, a key of a first record of the sub data package with a key of a first record and a key of a last record of each of the other sub data packages, the identifying comprising computing a termination criterion;

key _pos1,x∈

(key _pos1,y;

key _{pos max;

y}),wherein pos1 represents a first position of a sub data package, posmax represents a last position of a sub data package, and x and y represent numbers of sub data packages, wherein adjacent sub data packages are sub data packages having keys of the first record that are closest together and have violated the termination criterion; and

saving, to a memory of the data processing system, the processed records, wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Computer-implemented methods, computer systems, and computer programs product are provided for automated generic and parallel aggregation of characteristics and key figures of unsorted mass data being of specific economic interest, particularly associated with financial institutions, and with financial affairs in banking practice. The parallel aggregation may reduce the amount of data for a customer defined granularity for the purpose of facilitating the handling of raw data related to all areas of credit risk management in banking practice. Moreover, the computing power of software and the software performance run time, respectively, may be improved in the case of mass data.

6 Citations

View as Search Results

23 Claims

1. A computer-implemented method for automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, the method comprising:
- receiving, at a data processing system, mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, each record being associated with one or more granularity characteristics and one or more key figures;
  
  selecting, according to a customer-defined aggregation, one or more of the granularity characteristics of the received mass data, one or more of the key figures of the received mass data, and an aggregation operation associated with each of the one or more key figures;
  
  generating a plurality of data packages from the received mass data, each data package comprising a plurality of records, the plurality of records of each data package being smaller than the plurality of records of the received mass data;
  
  processing, using one or more processors of the data processing system, the data packages to reduce a number of records in each of the data packages according to the customer-defined aggregation, wherein the processing comprises;
  
  identifying one or more granularity levels, each of the granularity levels being associated with one of the selected granularity characteristics, and the identified granularity levels defining an order of the selected granularity characteristics;
  
  sorting the records of each data package according to the defined order of granularity characteristics; and
  
  aggregating the sorted records of each data package for each of the selected key figures using the selected aggregation operations, the aggregation reducing the plurality of records of each data package;
  
  splitting each of the aggregated data packages into one or more sub data packages, wherein each sub data package of an aggregated data package comprises fewer records than the aggregated data package; and
  
  identifying adjacent sub data packages by comparing, for each sub data package, a key of a first record of the sub data package with a key of a first record and a key of a last record of each of the other sub data packages, the identifying comprising computing a termination criterion;
  
  key _pos1,x∈
  
  (key _pos1,y;
  
  key _{pos max;
  
  y}),wherein pos1 represents a first position of a sub data package, posmax represents a last position of a sub data package, and x and y represent numbers of sub data packages, wherein adjacent sub data packages are sub data packages having keys of the first record that are closest together and have violated the termination criterion; and
  
  saving, to a memory of the data processing system, the processed records, wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 18, 19)
- - 2. The method of claim 1, wherein selecting comprises:
    - selecting the granularity characteristics from at least one of;
      
      (i) one or more predetermined granularity characteristics of the received mass data and (ii) one or more customer-defined granularity characteristics; and
      
      selecting the key figures from at least one of;
      
      (i) one or more predetermined key figures associated with the received mass data and (ii) one or more customer-defined key figures.
  - 3. The method of claim 1, wherein selecting comprises:
    - selecting the aggregation operation from one or more predetermined aggregation operations of a function pool and one or more costumer defined aggregation operations.
  - 4. The method of claim 1, further comprising:
    - enriching the generated data packages through parallel pre-processing using one or more secondary data sources.
  - 5. The method of claim 1, further comprising:
    - enriching the aggregated data packages through parallel post-processing using one or more secondary data sources.
  - 6. The method of claim 1, wherein processing further comprises processing the data packages in one or more jobs, each of the jobs comprising a plurality of the data packages.
  - 7. The method of claim 6, wherein one job or a plurality of jobs are processed in a parallel processing mode using a single processor.
  - 8. The method of claim 6, wherein one job or a plurality of jobs are processed in a parallel processing mode using a network of processors.
  - 9. The method of claim 8, wherein the network of data processors is a Local Area Network (LAN), Wide Area Network (WAN), intranet or internet.
  - 10. The method of claim 7, wherein processing further comprises aggregating the data packages of the job or jobs sequentially.
  - 11. The method of claim 8, wherein processing further comprises aggregating the data packages of the job or jobs sequentially.
  - 18. The method of claim 1, wherein processing further comprises, when adjacent sub data packages are identified:
    - merging the identified adjacent sub data packages to generate one or more merged data packages; and
      
      processing each of the merged data packages to reduce a number of records in each of the merged data packages according to the customer-defined aggregation.
  - 19. The method of claim 1, wherein the identifying, sorting, and aggregating are performed in parallel across the one or more processors of the data processing system.

12. A computer system configured to perform automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, comprising:
- a module configured to receive mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, each record being associated with one or more granularity characteristics and one or more key figures;
  
  a module configured to select, according to a customer-defined aggregation, one or more of the granularity characteristics of the received mass data, one or more of the key figures of the received mass data, and an aggregation operation associated with each of the one or more key figures;
  
  a module configured to generate a plurality of data packages from the received mass data, each data package comprising a plurality of records, the plurality of records of each data packages being smaller than the plurality of records of the received mass data;
  
  one or more processors configured to process the data packages to reduce a number of records in each of the data packages according to the customer-defined aggregation, wherein the one or more processors are further configured to;
  
  identify one or more granularity levels, each of the granularity level being associated with one of the selected granularity characteristics, and the identified granularity levels defining an order of the selected granularity characteristics;
  
  sort the records of each data package according to the defined order of granularity characteristics;
  
  aggregate the sorted records of each data package for each of the selected key figures using the selected aggregation operations, the aggregation reducing the plurality of records of each data package;
  
  split each of the aggregated data packages into one or more sub data Packages, wherein each sub data package of an aggregated data package comprises fewer records than the aggregated data package; and
  
  identify adjacent sub data packages by comparing, for each sub data package, a key of a first record of the sub data package with a key of a first record and a key of a last record of each of the other sub data packages, the identifying comprising computing a termination criterion;
  
  key _{pos1, x}∈
  
  (key _posi,y;
  
  key _{pos max;
  
  y}),wherein pos1 represents a first position of a sub data package, posmax represents a last position of a sub data package, and x and y represent numbers of sub data packages, wherein adjacent sub data packages are sub data packages having keys of the first record that are closest together and have violated the termination criterion; and
  
  a memory configured to store the processed records, Wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity.
- View Dependent Claims (13, 14, 20, 21)
- - 13. The computer system of claim 12, wherein the one or more processors are further configured to enrich the generated data packages through parallel pre-processing using one or more secondary data.
  - 14. The computer system of claim 12, wherein the one or more processors are further configured to enrich the aggregated data packages through parallel post-processing using one or more secondary data sources.
  - 20. The computer system of claim 12, wherein when adjacent sub data packages are identified, the one or more processors are further configured to:
    - merge the identified adjacent sub data packages to generate one or more merged data packages; and
      
      process each of the merged data packages to reduce a number of records in each of the merged data packages according to the customer-defined aggregation.
  - 21. The computer system of claim 12, wherein the one or more processors are configured to identify, store, and aggregate in parallel.

15. A computer readable medium comprising a plurality of instructions that, when executed by a processor, perform a method for automated generic and parallel aggregation of characteristics and key figures of data associated with financial institutions and with financial affairs in banking practice, the method comprising:
- receiving mass data from a single database of a single data source or from different databases of different data sources, the mass data comprising a plurality of records, each record being associated with one or more granularity characteristics and one or more key figures;
  
  selecting, according to a customer-defined aggregation, one or more of the granularity characteristics of the received mass data, one or more of the key figures of the received mass data, and an aggregation operation associated with each of the one or more key figures;
  
  generating a plurality of data packages from the received mass data, each data package comprising a plurality of records, the plurality of records of each data packages being smaller than the plurality of records of the received mass data;
  
  processing the data packages to reduce a number of records in each of the data packages according to the customer-defined aggregation, wherein the processing comprises;
  
  identifying one or more granularity levels, each of the granularity levels being associated with one of the selected granularity characteristics, and the identified granularity levels defining an order of the selected granularity characteristics;
  
  sorting the records of each data package according to the defined order of granularity characteristics; and
  
  aggregating the sorted records of each data package for each of the selected key figures using the selected aggregation operations, the aggregation reducing the plurality of records of each data package;
  
  splitting each of the aggregated data packages into one or more sub data packages, wherein each sub data package of an aggregated data package comprises fewer records than the aggregated data package; and
  
  identifying adjacent sub data packages by comparing, for each sub data package. a key of a first record of the sub data package with a key of a first record and a key of a last record of each of the other sub data packages, theidentifying comprising computing a termination criterion;
  
  key pos1 ,x∈
  
  (key _posi,y;
  
  key _{pos max;
  
  y}),wherein pos1 represents a first position of a sub data package, posmax represents a last position of a sub data package. and x and y represent numbers of sub data packages, wherein adjacent sub data packages are sub data packages having keys of the first record that are closest together and have violated the termination criterion; and
  
  saving the processed records, wherein the stored records comprise fewer records than the received mass data at the customer-defined granularity.
- View Dependent Claims (16, 17, 22, 23)
- - 16. The computer readable medium of claim 15, the method further comprising:
    - enriching the generated data packages through parallel pre-processing using one or more secondary data sources.
  - 17. The computer readable medium of claim 15 the method further comprising:
    - enriching the aggregated data packages through parallel post-processing using one or more secondary data sources.
  - 22. The computer readable medium of claim 15, wherein processing further comprises, when adjacent sub data packages are identified:
    - merging the identified adjacent sub data packages to generate one or more merged data packages; and
      
      processing each of the merged data packages to reduce a number of records in each of the merged data packages according to the customer-defined aggregation.
  - 23. The method of claim 15, wherein the identifying, sorting, and aggregating are performed in parallel.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
SAP SE
Original Assignee
SAP AG (SAP SE)
Inventors
Baumann, Marcus, Kahn, Markus
Primary Examiner(s)
Trammell; James P
Assistant Examiner(s)
Badii; Behrang

Application Number

US11/239,139
Publication Number

US 20060069632A1
Time in Patent Office

1,460 Days
Field of Search

725/112, 725/87, 707/102, 707/5, 707/10, 707/E17.014, 705/35, 705/30, 705/37, 709/223
US Class Current

705/35
CPC Class Codes

G06Q 20/108   Remote banking, e.g. home b...

G06Q 40/00   Finance; Insurance; Tax str...

G06Q 40/02   Banking, e.g. interest calc...

G06Q 40/03   Credit; Loans; Processing t...

G06Q 40/06   Asset management; Financial...

Systems and methods for general aggregation of characteristics and key figures

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

6 Citations

23 Claims

Specification

Use Cases

Quick Links

Others

Systems and methods for general aggregation of characteristics and key figures

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

6 Citations

23 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others