Performance-based reset of data compression dictionary
First Claim
1. A system comprising:
- compressor means for processing a digital data stream according to a predetermined adaptive compression strategy, said compressor means including;
compressor input means for receiving an input stream to be compressed, said input stream including strings of input codes, each string including at least one input code, each input code having a respective bit-length,compressor output means for outputting an output stream yielded by said compression strategy, said output stream including output codes, each output code having a respective bit-length,dictionary means for translating said strings into said output codes, said dictionary means having a predetermined code set of entries, each of said entries corresponding to a respective one of said output codes, said code set including an assignable set of assignable entries, said assignable set including an assigned subset of assigned entries and an unassigned subset of unassigned entries, at any given time said assigned subset having a non-negative number of elements, at any given time said unassigned subset having a non-negative number of elements, each of said assigned entries having its respective output code assigned to a respective one of said strings, each of said unassigned entries having no string assigned to its respective output code; and
assignment means for, in accordance with said compression strategy, assigning unassigned ones of said assignable entries to strings received by said compressor input means;
monitor means for providing a bit-length comparison value as a function of the bit-lengths of a string series of said strings and a translation of said string series into said output codes, said monitor means being coupled to said compressor input means and said compressor output means; and
clearing means for converting at least some of said assigned entries to unassigned entries when said bit-length comparison value meets a predetermined comparison criterion, said clearing means being coupled to said monitor means and said compressor means.
2 Assignments
0 Petitions
Accused Products
Abstract
An adaptive data compression system is reset when performance drops below a predetermined threshold to permit greater compression of long files with evolving distributions of symbol combinations. The compression system uses a resettable dictionary in which initially unassigned codes are strategically assigned to symbol combinations as they are encountered in the data stream.
The difference between the bit-lengths of corresponding lengths of the compressor input and output is compared with a value representing a predetermined performance threshold. The dictionary can be reset if the actual performance falls below the performance threshold. The reset can be inhibited while the dictionary is less than half-full to ensure that performance measures are statistically significant. However, if the performance is such that data expansion is occurring, reset is not so delayed.
56 Citations
12 Claims
-
1. A system comprising:
-
compressor means for processing a digital data stream according to a predetermined adaptive compression strategy, said compressor means including; compressor input means for receiving an input stream to be compressed, said input stream including strings of input codes, each string including at least one input code, each input code having a respective bit-length, compressor output means for outputting an output stream yielded by said compression strategy, said output stream including output codes, each output code having a respective bit-length, dictionary means for translating said strings into said output codes, said dictionary means having a predetermined code set of entries, each of said entries corresponding to a respective one of said output codes, said code set including an assignable set of assignable entries, said assignable set including an assigned subset of assigned entries and an unassigned subset of unassigned entries, at any given time said assigned subset having a non-negative number of elements, at any given time said unassigned subset having a non-negative number of elements, each of said assigned entries having its respective output code assigned to a respective one of said strings, each of said unassigned entries having no string assigned to its respective output code; and assignment means for, in accordance with said compression strategy, assigning unassigned ones of said assignable entries to strings received by said compressor input means; monitor means for providing a bit-length comparison value as a function of the bit-lengths of a string series of said strings and a translation of said string series into said output codes, said monitor means being coupled to said compressor input means and said compressor output means; and clearing means for converting at least some of said assigned entries to unassigned entries when said bit-length comparison value meets a predetermined comparison criterion, said clearing means being coupled to said monitor means and said compressor means. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for compressing data using a data compressor for processing a digital data stream according to a predetermined adaptive compression strategy, said compressor including:
-
compressor input means for receiving an input stream to be compressed, said input stream including strings of input codes, each string including at least one input code, each input code having a respective bit-length, compressor output means for outputting an output stream yielded by said compression strategy, said output stream including output codes, each output code having a respective bit-length, and dictionary means for translating said strings into said output codes, said dictionary means having a predetermined code set of entries, each of said entries corresponding to a respective one of said output codes, said code set including an assignable set of assignable entries, said assignable set including an assigned subset of assigned entires and an unassigned subset of unassigned entries, at any given time said assigned subset having a non-negative number of elements, at any given time said unassigned subset having a non-negative number of elements, each of said assigned entries having its respective output code assigned to a respective one of said strings, each of said unassigned entries having no string assigned to its respective output code, said dictionary means having an initial state in which said assignable entries are unassigned entries, said method comprising; inputting said input stream into said compressor, said input stream including input blocks, each input block including code strings, each code string including at least one input code, said compressor translating each input block into a respective output block, each output block including output codes each of which corresponds to a respective one of said strings, each input block being characterizable by an input bit-length and each output block being characterizable by an output bit-length; assigning, in accordance with said compression strategy, unassigned ones of said assignable entries to strings received by said compressor input means; comparing the input bit-length of each input block with the output bit-length of the corresponding output block so as to provide a bit-length comparison value for each input block, the comparison value for a given block being determined after said given block is translated by said dictionary means; and clearing at least some of said assigned entries to unassigned entries when an input block received by said compressor results in a comparison value meeting a predetermined comparison criterion. - View Dependent Claims (9, 10, 11, 12)
-
Specification