×

Implementation of semi-structured data as a first-class database element

  • US 10,108,686 B2
  • Filed: 10/20/2014
  • Issued: 10/23/2018
  • Est. Priority Date: 02/19/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for storing semi-structured data comprising:

  • receiving semi-structured data elements from a data source;

    performing statistical analysis on collections of the semi-structured data elements as they are added to the database;

    identifying common data elements from within the semi-structured data;

    assigning the common data elements from within the semi-structured data as first class data and as lesser class data dependent on a threshold of commonality, wherein the threshold of commonality is based on how many times the data element appears in the semi-structured data;

    extracting the common data elements from the data source and storing the common data elements separately in columnar format;

    storing the first class data in cache memory in pseudo columns and making metadata and statistics corresponding to the pseudo-columns of the first class data elements available to a computer based query generator;

    re-identifying common data elements within the semi-structured data and assigning additional common data elemments as first class data and saving the additional data elements in cache memory;

    reconstructing semi-structured data back to an original form by combining the first class data elements and the lesser class data elements and the non-common data;

    storing lesser class data in pseudo columns on disk storage; and

    storing non-common semi-structured data elements in an overflow serialized column.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×