Multi-site clustering
First Claim
1. A method comprising:
- receiving data at a particular indexer belonging to a first user-specified grouping of indexers, the first user-specified grouping of indexers associated with a particular geographic location;
storing, by the particular indexer, at least one grouped subset of the data in a data store accessible by the particular indexer;
selecting, by a master node, a set of peer indexers based on both of (i) a user-specified indexer replication factor indicating a number of separate indexers at which the at least one grouped subset of the data is to be stored, and (ii) a separate user-specified site replication factor indicating a number of sites at which the at least one grouped subset of the data is to be stored, each site corresponding to a separate geographic location that is different from the first geographic location;
generating, by the master node, replication instructions identifying the selected set of peer indexers;
receiving, by the particular indexer, the data replication instructions identifying the selected peer indexers; and
sending, by the particular indexer, the at least one grouped subset of the data to the peer indexers based on the data replication instructions.
1 Assignment
0 Petitions
Accused Products
Abstract
According to various embodiments, techniques are described for managing data within a multi-site clustered data intake and query system. A data intake and query system as described herein generally refers to a system for collecting, retrieving, and analyzing data. In this context, a clustered data intake and query system generally refers to a system environment that is configured to provide data redundancy and other features that improve the availability of data stored by the system. For example, a clustered data intake and query system may be configured to store multiple copies of data stored by the system across multiple components such that recovery from a failure of one or more of the components is possible by using copies of the data stored elsewhere in the cluster.
-
Citations
16 Claims
-
1. A method comprising:
-
receiving data at a particular indexer belonging to a first user-specified grouping of indexers, the first user-specified grouping of indexers associated with a particular geographic location; storing, by the particular indexer, at least one grouped subset of the data in a data store accessible by the particular indexer; selecting, by a master node, a set of peer indexers based on both of (i) a user-specified indexer replication factor indicating a number of separate indexers at which the at least one grouped subset of the data is to be stored, and (ii) a separate user-specified site replication factor indicating a number of sites at which the at least one grouped subset of the data is to be stored, each site corresponding to a separate geographic location that is different from the first geographic location; generating, by the master node, replication instructions identifying the selected set of peer indexers; receiving, by the particular indexer, the data replication instructions identifying the selected peer indexers; and sending, by the particular indexer, the at least one grouped subset of the data to the peer indexers based on the data replication instructions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. One or more non-transitory computer-readable storage media, storing software instructions, which when executed by one or more processors cause performance of steps of:
-
receiving data at a particular indexer belonging to a first user-specified grouping of indexers, the first user-specified grouping of indexers associated with a particular geographic location; storing, by the particular indexer, at least one grouped subset of the data in a data store accessible by the particular indexer; selecting, by a master node, a set of peer indexers based on both of (i) a user-specified indexer replication factor indicating a number of separate indexers at which the at least one grouped subset of the data is to be stored, and (ii) a separate user-specified site replication factor indicating a number of sites at which the at least one grouped subset of the data is to be stored, each site corresponding to a separate geographic location that is different from the first geographic location; generating, by the master node, replication instructions identifying the selected set of peer indexers; receiving, by the particular indexer, the data replication instructions identifying the selected peer indexers; and sending, by the particular indexer, the at least one grouped subset of the data to the peer indexers based on the data replication instructions. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification