Data management apparatus and method for efficiently generating a blocked transposed file and converting that file using a stored compression method
First Claim
1. A data management apparatus comprising:
- number of records in group determining means for virtual conversion by repeatedly reading at least one record from an input file having a plurality of records, each record including a plurality of fields, adding the at least one record to a buffer, and converting records in the buffer into a post-conversion block on a field-by-field basis until data size of post-conversion blocks for each kind of field of the records in the buffer exceeds a threshold size, wherein the number of records in group determining means determines the number of records of the post-conversion block by counting the records in the buffer, during a process immediately preceding a time when the data size exceeds the threshold size, and stores that number;
first conversion means for generating a first block by dividing at least one record including a plurality of fields into the fields and combining fields of like kind; and
second conversion means for converting the first block into a second block using a stored data conversion method, and for storing the second block, wherein the second conversion means selects one data compression algorithm that provides highest compression efficiency from a plurality of stored data compression algorithms in accordance with properties of the first block, and converts the first block into the second block using the selected data compression algorithm.
1 Assignment
0 Petitions
Accused Products
Abstract
A data management apparatus and method provide for a virtual conversion by repeatedly reading at least one record from an input file having a plurality of records, each record including a plurality of fields, and adding the at least one record to a buffer. The records in the buffer are then converted into a post-conversion block on a field-by-field basis until data size of post-conversion blocks for each kind of field of the records in the buffer exceeds a threshold size. The number of records in the post-conversion blocks can be determined by counting the records in the buffer during a process immediately preceding a time when the data size exceeds the threshold size, and storing that number. The conversion is completed by reading out records of the stored number of records from the input file, converting the records read out into post-conversion blocks on a field-by-field basis, and storing the post-conversion blocks.
-
Citations
8 Claims
-
1. A data management apparatus comprising:
-
number of records in group determining means for virtual conversion by repeatedly reading at least one record from an input file having a plurality of records, each record including a plurality of fields, adding the at least one record to a buffer, and converting records in the buffer into a post-conversion block on a field-by-field basis until data size of post-conversion blocks for each kind of field of the records in the buffer exceeds a threshold size, wherein the number of records in group determining means determines the number of records of the post-conversion block by counting the records in the buffer, during a process immediately preceding a time when the data size exceeds the threshold size, and stores that number;
first conversion means for generating a first block by dividing at least one record including a plurality of fields into the fields and combining fields of like kind; and
second conversion means for converting the first block into a second block using a stored data conversion method, and for storing the second block, wherein the second conversion means selects one data compression algorithm that provides highest compression efficiency from a plurality of stored data compression algorithms in accordance with properties of the first block, and converts the first block into the second block using the selected data compression algorithm. - View Dependent Claims (2, 3, 4, 5, 8)
-
-
6. A data management method comprising:
-
a virtual conversion, repeatedly reading at least one record from an input file having a plurality of records, each record including a plurality of fields, adding the at least one record to a buffer, and converting records in the buffer into a post-conversion block on a field-by-field basis until data size of post-conversion blocks for each kind of field of the records in the buffer exceeds a threshold size;
determining the number of records of the post-conversion block by counting the records in the buffer during a process immediately preceding a time when the data size exceeds the threshold size, and storing that number; and
reading out records of the stored number of records from the input file, converting the records read out into post-conversion blocks on a field-by-field basis, and storing the post-conversion blocks. - View Dependent Claims (7)
generating a first block by converting a record in the buffer into a converted record having a fixed-length field format, dividing the converted record into fields, and combining fields of like kind; and
converting the first block into a second block using a stored data conversion method.
-
Specification