×

Generation of a search query to approximate replication of a cluster of events

  • US 10,296,616 B2
  • Filed: 07/31/2014
  • Issued: 05/21/2019
  • Est. Priority Date: 07/31/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for improving time-based searching of data, the method comprising:

  • (a) receiving a first user input that specifies a first search query;

    (b) in response to the first user input that specifies the first search query,(b)(1) accessing events in a field-searchable data store, each event being associated with a time stamp and containing a portion of raw machine-generated data indicative of performance or operation of a component in an information-technology environment, wherein the raw machine-generated data contained in each event includes a plurality of strings associated with a corresponding time stamp, and wherein each string includes text, numbers, or a combination of text and numbers;

    (b)(2) applying a clustering algorithm to the accessed events to form at least one cluster of events, the clustering algorithm generating, for each event, an ordered list of keywords contained in the event, and grouping events into a same cluster when their respective ordered lists of keywords meet a similarity threshold, and wherein an ordering of the keywords in the ordered list of keywords for any particular event is based on positions of the keywords within the particular event;

    (b)(3) for each cluster of the at least one cluster of events, after formation of the cluster, defining, from contents of the cluster, a set of one or more search terms as criteria for a corresponding search query that will retrieve at least one of the events of the cluster when executed against the accessed events, and storing the search terms in memory, wherein identifying the set of one or more search terms for a cluster of events is not part of forming a cluster of events, execution of the corresponding search query against the accessed events including evaluation of the search terms against the raw machine-generated data in the accessed events; and

    (b)(4) causing a display of a result of the first search query, the displayed result including an identification of each cluster of the at least one cluster of events;

    (c) receiving a second user input that selects a first cluster of the at least one cluster of events, the second user input being responsive to display of the identification of each cluster of the at least one cluster of events; and

    (d) in response to the second user input that selects the first cluster,(d)(1) executing a second search query against the field-searchable data store to retrieve stored events that satisfy a criterion for similarity to the first cluster, the second search query including search terms previously associated with the first cluster as part of said defining, wherein executing the second search query includes retrieving from the memory the search terms previously associated with the first cluster; and

    (d)(2) causing display, to the user, of a result of the second search query, including causing display of an event that satisfies the second search query;

    wherein the method is performed by one or more processing devices.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×