×

Efficient column based data encoding for large-scale data storage

  • US 8,452,737 B2
  • Filed: 01/10/2012
  • Issued: 05/28/2013
  • Est. Priority Date: 07/31/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for encoding data to simultaneously compact and organize the data in a manner that facilitates efficient data access operations, including:

  • an act of a computer system, which includes at least one processing unit, organizing the data according to a set of column based sequences of values corresponding to different data fields of the data;

    an act of the computer system transforming the set of column based sequences of values to a set of column based integer sequences of values according to at least one encoding algorithm; and

    an act of the computer system compressing the set of column based integer sequences according to an iterative hybrid compression algorithm, wherein the iterative hybrid compression algorithm includes performing the following, in at least one iterative encoding step;

    an act of analyzing the set of column based integer sequences to determine which encoding technique from a plurality of encoding techniques to apply to compress the set of column based integer sequences by at least comparing a first computed bit savings of applying a first encoding technique against a second computed bit savings of applying a second encoding technique; and

    an act of applying the first or second encoding technique on at least a portion of the set of column based integer sequences based on the analysis.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×