×

System and method for storing stream data in distributed relational tables with data provenance

  • US 9,390,147 B2
  • Filed: 09/24/2012
  • Issued: 07/12/2016
  • Est. Priority Date: 09/23/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for storing and querying data, comprising the steps of:

  • representing data-elements in a hyper-table stored in a table-store, wherein the data-elements are allocated to data-blocks stored in the table-store, wherein the table-store is located on a distributed device, wherein the hyper-table comprises;

    (i) hyper-rows representing the data-elements allocated to the data-blocks, (ii) at least one hyper-column associated with an attribute of the corresponding data-elements, and (iii) hyper-cells having data-values of the corresponding data-elements;

    generating indices of the hyper-rows and the at least one hyper-column for the corresponding data-elements, wherein the indices are located on the distributed device and the data-values of the corresponding data-elements in the hyper-cells are capable of being retrieved based on the indices;

    storing a pre-determined time period on the distributed device;

    generating snapshots of the indices at each occurrence of the pre-determined time-period, wherein the snapshots comprise the indices as the indices existed at the occurrence of the pre-determined time-period;

    generating checkpoints of the hyper-table at each occurrence of the pre-determined time-period; and

    selecting one of a plurality of occurrences of the pre-determined time period and querying the snapshots of the indices and the checkpoints of the hyper-table to return query results based on the state of the indices and the hyper-table as of the selected occurrence of the pre-determined time-period,wherein said distributed device comprises a processor, a random-access memory, and a network interface connected to a network and is connected to a plurality of remote distributed devices, each of which comprises a processor, a random-access memory and a network interface connected to the network and is specially configured to store at least a portion of the data elements and at least a portion of the indices.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×