×

Scalable analysis platform for semi-structured data

  • US 10,095,732 B2
  • Filed: 02/26/2014
  • Issued: 10/09/2018
  • Est. Priority Date: 12/23/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method of operating a data analysis system, the method comprising:

  • retrieving objects from a semi-structured data source, wherein each of the retrieved objects includes (i) data and (ii) metadata describing the data;

    dynamically creating a cumulative schema wherein said dynamically creating comprises, for each object of the retrieved objects;

    (i) inferring a schema from the object based on the metadata of the object and inferred data types of elements of the data of the object,(ii) performing a union of a set of fields of the inferred schema and another set of fields of the cumulative schema to create a unified schema that describes both (a) the object described by the inferred schema and (b) a cumulative set of objects described by the cumulative schema, and(iii) storing the unified schema as the cumulative schema;

    converting fields in the cumulative schema into respective columns in a relational schema; and

    storing the data of each of the retrieved objects in a relational database according to the relational schema.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×