Please download the dossier by clicking on the dossier button x
×

Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation

  • US 6,828,925 B2
  • Filed: 12/08/2003
  • Issued: 12/07/2004
  • Est. Priority Date: 10/30/2002
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of encoding data within a system, the method comprising:

  • determining a set of cut points in input data, the input data including a sequence of symbols, wherein a cut point is determined using a fingerprint representation of a number of sequential symbols in the sequence of symbols;

    segmenting the input data as indicated by the set of cut points;

    for each segment, determining whether the segment is to be a referenced segment;

    for each referenced segment, replacing the segment data of the referenced segment with a reference label;

    for each referenced segment not already present in a persistent segment store, storing a reference binding in the persistent segment store, wherein a reference binding associates a referenced segment'"'"'s data and its reference label;

    determining whether any sequence of segments is to be grouped as a reference group;

    for each reference group, replacing the references in the group with a group label; and

    for each reference group not already present in the persistent segment store, storing a group reference binding in the persistent segment store, wherein a group reference binding associates a reference group'"'"'s references with its group label.

View all claims
  • 21 Assignments
Timeline View
Assignment View
    ×
    ×