System and method for incremental and continuous data compression
First Claim
Patent Images
1. A method for compressing a data stream comprising the steps of:
- receiving a data stream having a plurality of data symbols;
identifying a first data phrase as a first repeated phrase, said first data phrase being a combination of a first data term and at least a second data term, if said first data phrase matches a first library phrase, said first library phrase is one of a plurality of library phrases in a phrase library;
replacing said first data phrase with a first data phrase identifier associated with said first library phrase to generate a compressed data stream if said first data phrase is identified as said first repeated phrase;
storing said first data phrase as a second library phrase if said first data phrase does not match any of said plurality of library phrases;
replacing said first data phrase with a second data phrase identifier associated with said second library phrase to generate a compressed data stream if said first data phrase is not identified as said first repeated phrase; and
transmitting said compressed data stream while still receiving additional data symbols in said data stream;
wherein each of said data phrases includes a fixed number of data terms;
wherein each of said data terms is one of a data symbol and a data phrase identifier.
16 Assignments
0 Petitions
Accused Products
Abstract
A data compression system and method for that is capable of detecting and eliminating repeated phrases of variable length within a window of virtually unlimited size.
-
Citations
50 Claims
-
1. A method for compressing a data stream comprising the steps of:
-
receiving a data stream having a plurality of data symbols;
identifying a first data phrase as a first repeated phrase, said first data phrase being a combination of a first data term and at least a second data term, if said first data phrase matches a first library phrase, said first library phrase is one of a plurality of library phrases in a phrase library;
replacing said first data phrase with a first data phrase identifier associated with said first library phrase to generate a compressed data stream if said first data phrase is identified as said first repeated phrase;
storing said first data phrase as a second library phrase if said first data phrase does not match any of said plurality of library phrases;
replacing said first data phrase with a second data phrase identifier associated with said second library phrase to generate a compressed data stream if said first data phrase is not identified as said first repeated phrase; and
transmitting said compressed data stream while still receiving additional data symbols in said data stream;
wherein each of said data phrases includes a fixed number of data terms;
wherein each of said data terms is one of a data symbol and a data phrase identifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
-
17. A system for compressing a data stream comprising:
-
receiving means for receiving a data stream having a plurality of data symbols;
identifying means for identifying a first data phrase as a first repeated phrase, said first data phrase being a combination of a first data term and at least a second data term, if said first data phrase matches a first library phrase, said first library phrase is one of a plurality of library phrases in a phrase library;
first replacing means for replacing said first data phrase with a first data phrase identifier associated with said first library phrase to generate a compressed data stream if said first data phrase is identified as said first repeated phrase;
first storing means storing said first data phrase as a second library phrase if said first data phrase does not match any of said plurality of library phrases;
second replacing means for replacing said first data phrase with a second data phrase identifier associated with said second library phrase to generate a compressed data stream if said first data phrase is not identified as said first repeated phrase; and
transmitting means for transmitting said compressed data stream while still receiving additional data symbols in said data stream;
wherein each of said data phrases includes a fixed number of data terms;
wherein each of said data terms is one of a data symbol and a data phrase identifier.
-
-
34. A system for compressing a data stream comprising:
-
a data receiver for receiving a data stream having a plurality of data symbols;
a phrase library having a plurality of library phrases;
a detector unit, coupled to said data receiver including;
a first identification unit for identifying a first data phrase as a first repeated phrase, said first data phrase being a combination of a first data term and at least a second data term, if said first data phrase matches a first library phrase, said first library phrase is one of a plurality of library phrases in a phrase library;
a library building unit for storing said first data phrase as a second library phrase if said first data phrase does not match any of said plurality of library phrases; and
a replacing unit for replacing said first data phrase with a first data phrase identifier associated with said first library phrase to generate a compressed data stream if said first data phrase is identified as said first repeated phrase and for replacing said first data phrase with a second data phrase identifier associated with said second library phrase to generate a compressed data stream if said first data phrase is not identified as said first repeated phrase; and
a transmitting unit for transmitting said compressed data stream while still receiving additional data symbols in said data stream;
wherein each of said data phrases includes a fixed number of data terms; and
wherein each of said data terms is one of a data symbol and a data phrase identifier.
-
Specification