×

Star and snowflake schemas in extract, transform, load processes

  • US 9,298,787 B2
  • Filed: 11/09/2011
  • Issued: 03/29/2016
  • Est. Priority Date: 11/09/2011
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer program product for supporting star and snowflake data schemas for use with an Extract, Transform, Load (ETL) process, comprising:

  • a computer readable storage device having computer readable program code embodied therewith, the computer readable program code comprising computer readable program code configured to;

    select a data source comprising dimensional data, wherein the dimensional data comprises a plurality of dimensions and a plurality of source tables each comprising at least one source column;

    import a data model for the dimensional data into a data integration system, wherein the data model includes information pertaining to dimension tables and fact tables;

    analyze the imported data model to determine a target data schema for individual dimensions of the dimensional data, wherein the target data schema is a star data schema or a snowflake data schema, and wherein the target data schema comprises target dimensions and target facts, and wherein analyzing the imported data model includes;

    analyzing the imported data model and determining a quantity of dimension tables for the individual dimensions of the dimensional data;

    determining the star data schema as the target data schema and mapping a dimension of the dimensional data to the star data schema in response to the analyzing indicating the dimension contains a single dimension table; and

    determining the snowflake data schema as the target data schema and mapping a dimension of the dimensional data to the snowflake data schema in response to the analyzing indicating the dimension contains a plurality of dimension tables;

    wherein at least one dimension of the dimensional data is mapped to the star data schema and at least one other dimension of the dimensional data is mapped to the snowflake data schema;

    generate a meta-model representation by, for each target fact, mapping at least one source table or source column to the target fact, and for each target dimension, mapping at least one source table or source column to the target dimension;

    automatically convert the meta-model representation into one or more ETL jobs; and

    execute the one or more ETL jobs to extract the dimensional data from the data source and load the dimensional data into the determined target data schema for corresponding dimensions in a target data system.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×