Coding network grouping data of same data type into blocks using file data structure and selecting compression for individual block base on block data type
First Claim
1. A computerized method of coding data having a plurality of data components, each data component structured into a data field of an associated data type according to an organized file format, comprising:
- from a source data structure of an organized file format, creating a plurality of blocks based on the source data structure, each block associated with a specific respective data field;
parsing each data component from the source data structure into one of the plurality of blocks based on the data field of the data component;
for each block;
selecting a compression algorithm from a plurality of candidate compression algorithms based on the data type of the data field associated with the block;
applying the selected compression algorithm to compress each data component in the block; and
combining the compressed data components from the plurality of blocks into an encoded data structure.
3 Assignments
0 Petitions
Accused Products
Abstract
A preferred coding network uses an architecture called a Base-Filter-Resource (BFR) system. This approach integrates the advantages of format-specific compression into a general-purpose compression tool serving a wide range of data formats. Source data is parsed into blocks of similar data and each parsed blocks are compressed using a respectively selected compression algorithm. The algorithm can be chosen from a static model of the data or can be adaptive to the data in the parsed block. The parsed blocks are then combined into an encoded data file. For decoding, the process is reversed.
144 Citations
29 Claims
-
1. A computerized method of coding data having a plurality of data components, each data component structured into a data field of an associated data type according to an organized file format, comprising:
-
from a source data structure of an organized file format, creating a plurality of blocks based on the source data structure, each block associated with a specific respective data field;
parsing each data component from the source data structure into one of the plurality of blocks based on the data field of the data component;
for each block;
selecting a compression algorithm from a plurality of candidate compression algorithms based on the data type of the data field associated with the block;
applying the selected compression algorithm to compress each data component in the block; and
combining the compressed data components from the plurality of blocks into an encoded data structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 21, 22, 23)
-
-
9. An article of manufacture, comprising:
-
a machine-readable medium;
a set of instructions recorded in the machine-readable medium to implement a data coding network having a plurality of data components, each data component structured into a data field of a respective data type according to an organized file format, comprising;
from a source data structure of an organized file format, creating a plurality of blocks based on the source data structure format, each block associated with a specific respective data field;
parsing each data component from the source data structure into one of the plurality of blocks based on the data field of the data component;
for each block;
selecting a compression algorithm from a plurality of candidate compression algorithms based on the data type of the data field associated with the block;
applying the selected compression algorithm to compress each data component in the block; and
combining the compressed data components from the plurality of blocks into an encoded data structure. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 24, 25, 26)
-
-
17. An apparatus to code data having a plurality of data components, each data component structured into a data field of a respective data type according to an organized file format, comprising:
-
a plurality of blocks based on and derived from a source data structure of an organized file format, each block associated with a specific respective data field;
a parser to parse each data component from the source data structure into one of the plurality of blocks based on the data field of the data component;
a selection system for selecting a compression algorithm for a block from a plurality of candidate compression algorithms based on the data type of the data field associated with the block;
a coder for applying the selected compression algorithm to compress each data component in the block and for combining the compressed data components from the plurality of blocks into an encoded data structure. - View Dependent Claims (18, 19, 20, 27, 28, 29)
-
Specification