SEARCHABLE ARCHIVE
First Claim
1. A method of retrieving a datum from a searchable archive by a data processing system, the searchable archive comprising a metadata file and one or more compacted files, comprising:
- selecting a selected compacted file from the one or more compacted files that may include the datum using the metadata file;
accessing the selected compacted file;
selecting a selected compressed segment from one or more compressed segments in the selected compacted file using metadata stored in the compacted file;
generating a decompressed segment from the selected compressed segment; and
searching the decompressed segment to determine if the decompressed segment includes the datum.
11 Assignments
0 Petitions
Accused Products
Abstract
A searchable archiving system. A searchable archiving system includes one or more compacted files of archive data loosely coupled to a search process. To create a compacted file, an archiving process tokenizes the archive data, optimizes the tokenized archive data, and extracts archive metadata from the tokenized data. The tokenized data may then be compressed in a variety of ways into compressed segments that may be individually accessed and decompressed by the search agents. Before compression, segment metadata is extracted from the segments. The compressed segments and segment metadata are then combined to create a compacted file. The search process accesses the compacted files by consulting locally stored archive metadata extracted from the files during the compaction process. The search process then invokes one or more search agents that actively search the compacted files. The search agents do so by using the segment metadata to identify segments to decompress and search.
-
Citations
8 Claims
-
1. A method of retrieving a datum from a searchable archive by a data processing system, the searchable archive comprising a metadata file and one or more compacted files, comprising:
-
selecting a selected compacted file from the one or more compacted files that may include the datum using the metadata file;
accessing the selected compacted file;
selecting a selected compressed segment from one or more compressed segments in the selected compacted file using metadata stored in the compacted file;
generating a decompressed segment from the selected compressed segment; and
searching the decompressed segment to determine if the decompressed segment includes the datum. - View Dependent Claims (2)
-
-
3. A method of retrieving a datum from a searchable archive by a data processing system, the searchable archive comprising a metadata file and one or more compacted files, comprising:
-
selecting a selected compacted file from the one or more compacted files that may include the datum using the metadata;
accessing the selected compacted file;
selecting one or more bit vectors from the selected compacted file; and
performing a Boolean operation on the bit vectors included in the selected compacted file to determine if the datum is stored in the selected compacted file. - View Dependent Claims (4)
-
-
5. A data processing system for retrieving a datum from a searchable archive, the searchable archive comprising a metadata file and one or more compacted files, comprising:
-
a processor; and
a memory coupled to the processor, the memory having program instructions executable by the processor stored therein, the program instructions including;
selecting a selected compacted file from the one or more compacted files that may include the datum using the metadata file;
accessing the selected compacted file;
selecting a selected compressed segment from one or more compressed segments in the selected compacted file using metadata stored in the compacted file;
generating a decompressed segment from the selected compressed segment; and
searching the decompressed segment to determine if the decompressed segment includes the datum. - View Dependent Claims (6)
-
-
7. A data processing system for retrieving a datum from a searchable archive, the searchable archive comprising a metadata file and one or more compacted files, comprising:
-
a processor; and
a memory coupled to the processor, the memory having program instructions executable by the processor stored therein, the program instructions including;
selecting a selected compacted file from the one or more compacted files that may include the datum using the metadata;
accessing the selected compacted file;
selecting one or more bit vectors from the selected compacted file; and
performing a Boolean operation on the bit vectors included in the selected compacted file to determine if the datum is stored in the selected compacted file. - View Dependent Claims (8)
-
Specification