SEGMENT DATA VISIBILITY AND MANAGEMENT IN A DISTRIBUTED DATABASE OF TIME STAMPED RECORDS
First Claim
1. A non-transitory computer readable medium comprising computer executable instructions stored thereon to cause one or more processors to:
- receive a query request at a broker node from a query client, the query request related to a data source and a query time interval;
determine data segments applicable to the query request via interaction with at least one of a coordination node and a zookeeper node, the determined data segments represented as a collection of blocks; and
return a query response to the query client, the query response retrieved from the collection of blocks.
14 Assignments
0 Petitions
Accused Products
Abstract
A distributed database of time stamped records can be used to store time series data such as events occurring on the Internet. A distributed database of time stamped records can store segments of data that contain events for different time intervals. The volume of events occurring on the Internet introduces a “Big Data” variable that makes collections of data sets so large and complex they are difficult to manage. Disclosed are systems and methods to manage segments of a distributed database of time stamped records for optimal size (for storage and performance reasons etc.) and for proper visibility to data when different segments contain data for overlapping time periods.
-
Citations
20 Claims
-
1. A non-transitory computer readable medium comprising computer executable instructions stored thereon to cause one or more processors to:
-
receive a query request at a broker node from a query client, the query request related to a data source and a query time interval; determine data segments applicable to the query request via interaction with at least one of a coordination node and a zookeeper node, the determined data segments represented as a collection of blocks; and return a query response to the query client, the query response retrieved from the collection of blocks. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A non-transitory computer readable medium comprising computer executable instructions stored thereon to cause one or more processing units to:
determine a plurality of merger segments to merge into a single merged segment, wherein the determination of merger segments is based on at least one of a size of each of the merger segments, a size of a resulting merged segment, and machine resources providing infrastructure to a distributed database of time stamped records. - View Dependent Claims (14, 15, 16, 18, 19, 20)
-
17. A non-transitory computer readable medium comprising computer executable instructions stored thereon to cause one or more processing units to:
-
determine a time range for each of a plurality of segments; determine a data source for each of the plurality of segments; obtain, by a coordinator, at least one data source specific rule; and perform an action on one or more of the plurality of segments based on the at least one data source specific rule when the rule matches at least one of the time range and the data source.
-
Specification