×

Input gathering system and method for defining, refining or validating star schema for a source database

  • US 10,599,678 B2
  • Filed: 10/23/2015
  • Issued: 03/24/2020
  • Est. Priority Date: 10/23/2015
  • Status: Active Grant
First Claim
Patent Images

1. One or more non-transitory computer readable storage mediums storing one or more sequences of instructions, which when executed by one or more processors, causes automatic gathering of inputs for defining, refining or validating star schema for a source database, by performing the steps of:

  • linking the star schema to the source database through a communication network;

    automatically analyzing queries on the source database, for defining, refining or validating the star schema, based on a usage analysis technique to gather any of (i) entities and columns, (ii) entity keys, (iii) relationships between and within entities, (iv) measures, (v) workflow attributes, or (vi) an update frequency associated with the entities and the columns of the source database;

    automatically analyzing data in the source database, for defining, refining or validating the star schema, based on a data profiling technique to gather any of (i) entity keys, (ii) relationships between and within entities, (iii) measures, (iv) workflow attributes, and (v) specialized entities;

    automatically determining a first ranking of any of (i) columns, (ii) entities, (iii) measures, (iv) relationships between and within entities, (v) workflow attributes, (vi) entity keys of the source database based on any of (a) the usage analysis technique, and (b) the data profiling technique;

    automatically determining a second ranking for any of (i) columns, (ii) entities, (iii) measures, (iv) relationships between and within entities, (v) workflow attributes, (vi) entity keys of the source database by automatically gathering a ranking of any of (i) columns, (ii) entities, (iii) measures, (iv) relationships between and within entities, (v) workflow attributes, (vi) entity keys that are determined based on any of (a) the usage analysis technique, and (b) the data profiling technique; and

    compiling ranked inputs of the usage analysis and data profiling to define, refine, or validate the star schema for the source database,wherein the one or more sequences of instructions, which when executed by the one or more processors further causes (a) automatic determination of measures comprising quantifiable metrics by analyzing queries on the source database for expressions on numeric attributes in a projection list, and (b) automatic ranking of expressions in an ordered list of candidate measures for expressions that occur across distinct queries,wherein the automatic gathering of inputs for defining, refining or validating the star schema for the source database increases an analytics efficiency to identify changes in a source warehouse schema containing data gathered from a plurality of databases in order to output a computer-generated dimensional model containing the star schema, andwherein all automatic processes are performed using a computer structured query programming method.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×