In-memory time series database and processing in a distributed environment
First Claim
1. A computer-program product tangibly embodied in a non-transitory, machine-readable storage medium having instructions stored thereon, the instructions being executable to cause a grid-computing device to perform the following operations:
- accessing information while being operated in a grid-computing system that includes other grid-computing devices,wherein the information describes a hierarchical schema for assembling multiple time series of data in a distributed manner that includes assembling multiple time series at the grid-computing device and other time series at the other grid-computing devices,wherein the hierarchical schema associates each of the multiple time series with a particular level of the hierarchical schema and prescribes a structure of nested relationships between time series assigned to different levels of the hierarchical schema;
assembling multiple time series associated with a lowest level of the hierarchical schema by inventorying a portion of a data set;
assembling multiple time series associated with an intermediate level of the hierarchical schema by aggregating the time series associated with the lowest level based on the structure of nested relationships, wherein the intermediate level is above the lowest level, and wherein;
the data set is partitioned at the intermediate level of the hierarchical schema such that a first number (n) of partitions are defined, the n partitions including;
a partition that includes the inventoried portion;
and a second number (n−
1) of other partitions;
the other grid-computing devices consist of n−
1 grid-computing devices; and
each of the other partitions is assigned to one of the other grid-computing devices;
receiving multiple additional time series associated with the intermediate level and assembled by at least one of the other grid-computing devices;
assembling a time series associated with a level of the hierarchical schema above the intermediate level by aggregating the assembled time series associated with the intermediate level and the multiple additional time series based on the structure of nested relationships;
using volatile memory to store the time series associated with the level above the intermediate level;
accessing the stored time series in memory; and
generating a forecast by processing the accessed time series.
1 Assignment
0 Petitions
Accused Products
Abstract
This disclosure describes methods, systems, and computer-readable media for accessing information that describes a hierarchical schema for assembling multiple time series of data in a distributed manner. The hierarchical schema associates each of the time series with a particular level of the hierarchical schema and prescribes a structure of relationships between time series assigned to different levels of the hierarchical schema. Multiple time series associated with a lowest level of the hierarchical schema are assembled by inventorying a portion of a data set. Multiple time series associated with an intermediate level of the hierarchical schema are assembled by aggregating the time series associated with the lowest level based on the structure of nested relationships. Also, multiple additional time series that are associated with the intermediate level and which were assembled by other grid-computing devices are received. After the time series are assembled, they are made available for processing to facilitate parallelized forecasting.
163 Citations
27 Claims
-
1. A computer-program product tangibly embodied in a non-transitory, machine-readable storage medium having instructions stored thereon, the instructions being executable to cause a grid-computing device to perform the following operations:
-
accessing information while being operated in a grid-computing system that includes other grid-computing devices, wherein the information describes a hierarchical schema for assembling multiple time series of data in a distributed manner that includes assembling multiple time series at the grid-computing device and other time series at the other grid-computing devices, wherein the hierarchical schema associates each of the multiple time series with a particular level of the hierarchical schema and prescribes a structure of nested relationships between time series assigned to different levels of the hierarchical schema; assembling multiple time series associated with a lowest level of the hierarchical schema by inventorying a portion of a data set; assembling multiple time series associated with an intermediate level of the hierarchical schema by aggregating the time series associated with the lowest level based on the structure of nested relationships, wherein the intermediate level is above the lowest level, and wherein; the data set is partitioned at the intermediate level of the hierarchical schema such that a first number (n) of partitions are defined, the n partitions including; a partition that includes the inventoried portion; and a second number (n−
1) of other partitions;the other grid-computing devices consist of n−
1 grid-computing devices; andeach of the other partitions is assigned to one of the other grid-computing devices; receiving multiple additional time series associated with the intermediate level and assembled by at least one of the other grid-computing devices; assembling a time series associated with a level of the hierarchical schema above the intermediate level by aggregating the assembled time series associated with the intermediate level and the multiple additional time series based on the structure of nested relationships; using volatile memory to store the time series associated with the level above the intermediate level; accessing the stored time series in memory; and generating a forecast by processing the accessed time series. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-implemented method comprising the following operations performed by a grid-computing device while operating in a grid-computing system that includes other grid-computing devices:
-
accessing information describing a hierarchical schema for assembling multiple time series of data in a distributed manner that includes assembling multiple time series at the grid-computing device and other time series at the other grid-computing devices, wherein the hierarchical schema associates individual time series with a particular level of the hierarchical schema and prescribes a structure of nested relationships between time series assigned to different levels of the hierarchical schema; assembling multiple time series associated with a lowest level of the hierarchical schema by inventorying a portion of a data set; assembling multiple time series associated with an intermediate level of the hierarchical schema by aggregating the time series associated with the lowest level based on the structure of nested relationships, wherein the intermediate level is above the lowest level, and wherein; the data set is partitioned at the intermediate level of the hierarchical schema such that a first number (n) of partitions are defined, the n partitions including; a partition that includes the inventoried portion; and a second number (n−
1) of other partitions;the other grid-computing devices consist of n−
1 grid-computing devices; andeach of the other partitions is assigned to one of the other grid-computing devices; receiving multiple additional time series associated with the intermediate level and assembled by at least one of the other grid-computing devices; assembling a time series associated with a level of the hierarchical schema above the intermediate level by aggregating the assembled time series associated with the intermediate level and the multiple additional time series based on the structure of nested relationships; using volatile memory to store the time series associated with the level above the intermediate level; and accessing the stored time series in memory; and generating a forecast by processing the accessed time series. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A grid-computing device comprising:
-
a hardware processor configured to perform operations while the grid-computing device operates in a grid-computing system that includes other grid-computing devices, the operations including; accessing information describing a hierarchical schema for assembling multiple time series of data in a distributed manner that includes assembling multiple time series at the grid-computing device and other time series at the other grid-computing devices, wherein the hierarchical schema associates individual time series with a particular level of the hierarchical schema and prescribes a structure of nested relationships between time series assigned to different levels of the hierarchical schema; assembling multiple time series associated with a lowest level of the hierarchical schema by inventorying a portion of a data set; assembling multiple time series associated with an intermediate level of the hierarchical schema by aggregating the time series associated with the lowest level based on the structure of nested relationships, wherein the intermediate level is above the lowest level, and wherein; the data set is partitioned at the intermediate level of the hierarchical schema such that a first number (n) of partitions are defined, the n partitions including; a partition that includes the inventoried portion; and a second number (n−
1) of other partitions;the other grid-computing devices consist of n−
1 grid-computing devices; andeach of the other partitions is assigned to one of the other grid-computing devices; receiving multiple additional time series associated with the intermediate level and assembled by at least one of the other grid-computing devices; assembling a time series associated with a level of the hierarchical schema above the intermediate level by aggregating the assembled time series associated with the intermediate level and the multiple additional time series based on the structure of nested relationships; using volatile memory to store the time series associated with the level above the intermediate level; accessing the stored time series in memory; and generating a forecast by processing the accessed time series. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27)
-
Specification