×

Scalable analysis platform for semi-structured data

  • US 10,275,475 B2
  • Filed: 03/14/2014
  • Issued: 04/30/2019
  • Est. Priority Date: 03/15/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method of operating a data analysis system, the method comprising:

  • retrieving objects from a data source, wherein each of the retrieved objects includes (i) data and (ii) metadata describing the data;

    dynamically updating a cumulative schema, wherein said dynamically updating comprises, for each object of the retrieved objects;

    (i) inferring a schema from the object based on the metadata of the object and inferred data types of elements of the data of the object, wherein, for at least one object of the objects, a structure of an inferred schema is different from another structure of another inferred schema for another object of the objects,(ii) creating a unified schema based at least on a portion of the inferred schema for the object, wherein the unified schema describes both (a) the object described by the inferred schema and (b) a cumulative set of objects described by the cumulative schema, and(iii) storing the unified schema as the cumulative schema;

    converting the cumulative schema into a relational schema; and

    exporting, according to the relational schema, the data of each of the retrieved objects to a data warehouse.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×