×

Transparent discovery of semi-structured data schema

  • US 9,842,152 B2
  • Filed: 10/20/2014
  • Issued: 12/12/2017
  • Est. Priority Date: 02/19/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for managing semi-structured data comprising:

  • receiving semi-structured data elements from a data source that is connected over a computer network;

    performing statistical analysis on collections of the semi-structured data elements as they are added to the database via a computer processor, wherein separate collections comprising portions of the semi-structured data are stored in separate files having different subsets of the semi-structured data elements that have been extracted;

    identifying common data elements from within the semi-structured data;

    combining common data elements from the data source into separate pseudo-columns;

    storing non-common semi-structured data elements in an overflow serialized column in computer memory; and

    deriving metadata corresponding to the pseudo-columns of the common data elements from the statistical analysis.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×