Post-processing algorithm for byte-level file differencing
First Claim
1. An apparatus comprising an optimizing system that generates difference files between an original byte stream and a new byte stream by identifying replacement content in a first area of the new byte stream, wherein the replacement content includes a group of bytes of the new byte streams identified as at least one of byte insertions and byte replacements in an operation array, identifying content similarities between the replacement content and at least one of the original byte stream and a second area of the new byte stream, and encoding information of the content similarities to the difference file.
2 Assignments
0 Petitions
Accused Products
Abstract
Devices and methods are provided for generating difference files that include encoded differences between byte streams of original and new file versions. Generation of the difference files includes processing by components of an optimizing system and/or algorithm to identify replacement content in a first area of the new byte stream. The replacement content includes a group of bytes of the new byte stream identified as at least one of byte insertions and byte replacements in a corresponding operation array. The optimizing system identifies content similarities between the replacement content and at least one of the original byte stream and a second area of the new byte stream. The optimizing system generates information of the difference file by encoding information of the content similarities.
-
Citations
13 Claims
-
1. An apparatus comprising an optimizing system that generates difference files between an original byte stream and a new byte stream by identifying replacement content in a first area of the new byte stream, wherein the replacement content includes a group of bytes of the new byte streams identified as at least one of byte insertions and byte replacements in an operation array, identifying content similarities between the replacement content and at least one of the original byte stream and a second area of the new byte stream, and encoding information of the content similarities to the difference file.
-
2. An apparatus for generating difference files, comprising:
-
means for receiving an original byte stream that is a segment of an original file and a new byte stream that is a segment of a new file, wherein the new file includes an updated version of the original file;
means for identifying replacement content of the new byte stream, wherein replacement content includes a group of bytes of the new byte stream identified as at least one of byte insertions and byte replacements in an operation array;
means for comparing the replacement content and portions of the new byte stream and identifying a first set of content similarities resulting from the comparison;
means for comparing the replacement content and portions of the original byte stream and identifying a second set of content similarities resulting from the comparison; and
means for generating the difference file by encoding information of the first set and second set of content similarities. - View Dependent Claims (3)
-
-
4. A method for generating difference files, comprising:
-
receiving an original byte stream that is a segment of an original file and a new byte stream that is a segment of a new file, wherein the new file includes an updated version of the original file;
identifying replacement content of the new byte stream, wherein replacement content includes a group of bytes of the new byte stream identified as at least one of byte insertions and byte replacements in an operation array;
comparing the replacement content and portions of the new byte stream and identifying a first set of content similarities resulting from the comparison;
comparing the replacement content and portions of the original byte stream and identifying a second set of content similarities resulting from the comparison; and
generating the difference file by encoding information of the first set and second set of content similarities. - View Dependent Claims (5, 6, 7, 8, 9, 10)
-
-
11. A system for updating electronic files of remote devices, comprising:
-
a first device including a file differencing engine that generates differences between an original version and a new version of an electronic file by;
receiving an original byte stream that is a segment of the original version and a new byte stream that is a segment of the new version;
identifying replacement content of the new byte stream, wherein replacement content includes a group of bytes of the new byte stream identified as at least one of byte insertions and byte replacements in an operation array;
comparing the replacement content and portions of the new byte stream and identifying a first set of content similarities resulting from the comparison;
comparing the replacement content and portions of the original byte stream and identifying a second set of content similarities resulting from the comparison;
generating a difference file including encoded information of the first set and second set of content similarities; and
a file updating engine hosted on the remote devices, the file updating engine generating a copy of the new version using the difference file. - View Dependent Claims (12)
-
-
13. A computer readable medium including executable instructions which, when executed in a processing system, reduce a size of a difference file that includes coded differences between an original version and a new version of an electronic file by:
-
identifying replacement content of a new byte stream that is a segment of the new version, wherein replacement content includes a group of bytes of the new byte stream identified as at least one of byte insertions and byte replacements in an operation array;
comparing the replacement content and portions of the new byte stream and identifying a first set of content similarities resulting from the comparison;
comparing the replacement content and portions of an original byte stream and identifying a second set of content similarities resulting from the comparison, wherein the original byte stream is a segment of the original version; and
generating the difference file by encoding information of the first set and second set of content similarities.
-
Specification