×

Scalable analysis platform for semi-structured data

  • US 9,613,068 B2
  • Filed: 03/14/2014
  • Issued: 04/04/2017
  • Est. Priority Date: 03/15/2013
  • Status: Active Grant
First Claim
Patent Images

1. A data transformation system comprising:

  • one or more computing devices comprising one or more hardware processors and memory and configured to implement;

    a schema inference module configured todynamically create a cumulative schema for objects retrieved from a first data source, wherein;

    each of the retrieved objects includes (i) data and (ii) metadata describing the data; and

    dynamically creating the cumulative schema includes, for each object of the retrieved objects, (i) inferring a schema from the object and (ii) selectively updating the cumulative schema to describe the object according to the inferred schema;

    collect statistics on the data types of the retrieved objects; and

    based on the statistics on the data types, determine whether the data of the retrieved objects is typed correctly; and

    an export module configured to output the data of the retrieved objects to a data destination system according to the cumulative schema.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×