USING AN INVERTED INDEX IN A PIPELINED SEARCH QUERY TO DETERMINE A SET OF EVENT DATA THAT IS FURTHER LIMITED BY FILTERING AND/OR PROCESSING OF SUBSEQUENT QUERY PIPESTAGES
First Claim
1. A method for searching data, the method comprising:
- providing a field searchable data store comprising a plurality of event records, each event record comprising a time-stamped portion of raw machine data;
receiving an incoming query;
generating results for the incoming query using a pipelined process, the pipelined process comprising;
responsive to the incoming query accessing an inverted index, wherein each entry in the inverted index comprises at least one field name, a corresponding at least one field value and a reference value associated with each field name and field value pair that identifies a location in the field searchable data store where an associated event record is stored; and
in a first pipe-stage of the pipelined process, using the inverted index to filter out a first subset of the plurality of event records, wherein the subset comprises one or more event records with corresponding reference values in the inverted index.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the present disclosure provide techniques for using an inverted index in a pipelined search query. A field searchable data store is provided that comprises a plurality of event records, each event record comprising a time-stamped portion of raw machine data. Responsive to the reciept of an incoming search query, the search engine accesses an inverted index, wherein each entry in the inverted index comprises at least one field name, a corresponding at least one field value and a reference value associated with each field name and value pair that identifies a location in the data store where an associated event record is stored. Once the inverted index is accessed, it can be used to filter out a subset of the plurality of event records, wherein the subset comprises one or more event records with corresponding reference values in the inverted index.
40 Citations
30 Claims
-
1. A method for searching data, the method comprising:
-
providing a field searchable data store comprising a plurality of event records, each event record comprising a time-stamped portion of raw machine data; receiving an incoming query; generating results for the incoming query using a pipelined process, the pipelined process comprising; responsive to the incoming query accessing an inverted index, wherein each entry in the inverted index comprises at least one field name, a corresponding at least one field value and a reference value associated with each field name and field value pair that identifies a location in the field searchable data store where an associated event record is stored; and in a first pipe-stage of the pipelined process, using the inverted index to filter out a first subset of the plurality of event records, wherein the subset comprises one or more event records with corresponding reference values in the inverted index. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for searching data, the method comprising:
-
providing a field searchable data store comprising a plurality of event records, each event record comprising a time-stamped portion of raw machine data; receiving a collection query that references a field name; responsive to the collection query, generating an inverted index by; determining an extraction rule associated with the field name; extracting a field value corresponding to the field name from one or more event records in the field searchable data store using the extraction rule; and generating an entry in the inverted index for each of the one or more event records, wherein each entry comprises the field name, the corresponding field value and and a reference value that identifies a location in the field searchable data store where an associated event record is stored; and responsive to an incoming search query, retrieving the inverted index and using one or more reference values in the inverted index to retrieve additional information from the one or more event records. - View Dependent Claims (11, 12, 13, 15, 16, 17, 18, 19)
-
-
14. The method of claim 14, wherein the filtering is performed in accordance with a conditional command.
-
20. A system comprising:
-
a field searchable data store comprising a plurality of event records, each event record comprising a time-stamped portion of raw machine data that is field searchable; a processing device coupled with the field searchable data store, the processing device being configured to; receive an incoming query; responsive to the incoming query, accessing an inverted index, wherein each entry in the inverted index comprises at least one field name, a corresponding at least one field value and a reference value associated with each field name and field value pair that identifies a location in the field searchable data store where an associated event record is stored; using a pipelined process to generate results of the incoming query, the pipelined process comprising; in a first pipe stage of the pipeline process, using the inverted index to filter out a first subset of the plurality of event records, wherein the subset comprises one or more event records with corresponding reference values in the inverted index satisfying a portion of the incoming query; and in a subsequent pipe stage of the pipeline process, performing a field search through raw machine data associated with the first subset of the plurality of event records to obtain field results wherein the field search is performed based on field information specified in the incoming query. - View Dependent Claims (21, 22, 23, 24, 25)
-
-
26. A system comprising:
-
a field searchable data store comprising a plurality of event records, each event record comprising a time-stamped portion of raw machine data that is field searchable; a processing device coupled with the field searchable data store, the processing device being configured to; receive an incoming query; responsive to the incoming query, accessing an inverted index, wherein each entry in the inverted index comprises at least one field name, a corresponding at least one field value and a reference value associated with each field name and field value pair that identifies a location in the field searchable data store where an associated event record is stored; using a pipelined process to generate results of the incoming query, the pipelined process comprising; in a first pipe stage of the pipeline process, using the inverted index to filter out a first subset of the plurality of event records, wherein the subset comprises one or more event records with corresponding reference values in the inverted index satisfying a portion of the incoming query; and in a subsequent pipe stage of the pipeline process, performing a keyword search through raw machine data associated with the first subset of the plurality of event records to obtain keyword search results wherein the keyword search is performed based on information specified in the incoming query. - View Dependent Claims (27, 28, 29, 30)
-
Specification