Compressed journaling in event tracking files for metadata recovery and replication
First Claim
1. A computer-implemented method, comprising:
- receiving raw data on a computing device;
determining boundaries that divide the raw data into a set of events;
assigning a time stamp to each event in the set of events;
identifying a subset of events in the set of events;
compressing the raw data that includes the subset of events;
storing the compressed raw data that includes the subset of events;
determining a compression offset for the compressed raw data that includes the subset of events, wherein the compression offset indicates a location of the compressed raw data that includes the subset of events;
storing the compression offset in a compression index;
associating one or more uncompressed offsets with the compression offset, wherein each uncompressed offset includes information for identifying one of the events in the raw data that includes the subset of events;
receiving an indication to retrieve a particular event in the subset of events;
using the compression index to identify the compression offset indicating the location of the compressed raw data that includes the particular event; and
using the compression offset and an associated uncompressed offset to locate the particular event.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments are directed towards employing compressed journaling for event tracking files for metadata recovery and replication. Event data and related metadata are received from one or more client devices. When a feature within the received metadata is detected that is previously unwritten to a journal, then the previously unwritten feature is written to the journal. Further, any feature is detected for the received event data that is determined to be different from a feature associated with an immediately preceding event data that is written in the journal, then the detected different feature is identified in the journal. In one embodiment, the identification employs writing to the journal an effective feature record that may employ indices identifying the different feature. The received event data is also written to the journal and may further employ string arguments to minimize recording of redundant information into the journal.
-
Citations
18 Claims
-
1. A computer-implemented method, comprising:
-
receiving raw data on a computing device; determining boundaries that divide the raw data into a set of events; assigning a time stamp to each event in the set of events; identifying a subset of events in the set of events; compressing the raw data that includes the subset of events; storing the compressed raw data that includes the subset of events; determining a compression offset for the compressed raw data that includes the subset of events, wherein the compression offset indicates a location of the compressed raw data that includes the subset of events; storing the compression offset in a compression index; associating one or more uncompressed offsets with the compression offset, wherein each uncompressed offset includes information for identifying one of the events in the raw data that includes the subset of events; receiving an indication to retrieve a particular event in the subset of events; using the compression index to identify the compression offset indicating the location of the compressed raw data that includes the particular event; and using the compression offset and an associated uncompressed offset to locate the particular event. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented system, comprising:
-
one or more processors; and one or more non-transitory computer-readable storage mediums containing instructions configured to cause the one or more processors to perform operations including; receiving raw data; determining boundaries that divide the raw data into a set of events; assigning a time stamp to each event in the set of events; identifying a subset of events in the set of events; compressing the raw data that includes the subset of events; storing the compressed raw data that includes the subset of events; determining a compression offset for the compressed raw data that includes the subset of events, wherein the compression offset indicates a location of the compressed raw data that includes the subset of events; storing the compression offset in a compression index; associating one or more uncompressed offsets with the compression offset, wherein each uncompressed offset includes information for identifying one of the events in the raw data that includes the subset of events; receiving an indication to retrieve a particular event in the subset of events; using the compression index to identify the compression offset indicating the location of the compressed raw data that includes the particular event; and using the compression offset and an associated uncompressed offset to locate the particular event. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer-program product, tangibly embodied in a non-transitory machine-readable storage medium, including instructions configured to cause a data processing apparatus to:
-
receive raw data; determine boundaries that divide the raw data into a set of events; assign a time stamp to each event in the set of events; identify a subset of events in the set of events; compress the raw data that includes the subset of events; store the compressed raw data that includes the subset of events; determine a compression offset for the compressed raw data that includes the subset of events, wherein the compression offset indicates a location of the compressed raw data that includes the subset of events; store the compression offset in a compression index; associate one or more uncompressed offsets with the compression offset, wherein each uncompressed offset includes information for identifying one of the events in the raw data that includes the subset of events; receive an indication to retrieve a particular event in the subset of events; use the compression index to identify the compression offset indicating the location of the compressed raw data that includes the particular event; and use the compression offset and an associated uncompressed offset to locate the particular event. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification