DISTRIBUTED STORAGE OF AGGREGATED DATA
0 Assignments
0 Petitions
Accused Products
Abstract
Techniques are described for managing aggregation of data in a distributed manner, such as for a particular client based on specified configuration information. The described techniques may include storing aggregated data values for an OLAP cube or other data structure in a distributed manner, such as in some situations in a distributed hash table. The aggregated data values to be stored may be generated in various manners, such as by performing multi-stage data manipulation operations—for example, a map-reduce architecture may be used, with a first stage involving the use of one or more specified map functions to be performed, and with at least a second stage involving the use of one or more specified reduce functions to be performed.
46 Citations
29 Claims
-
1-20. -20. (canceled)
-
21. A non-transitory computer-readable medium whose contents configure one or more computing systems to perform a method, the method comprising:
-
receiving a plurality of aggregated data values, wherein each of the aggregated data values is associated with a combination of multiple dimension category values for multiple dimensions; and initiating storing the plurality of aggregated data values in a key-value storage structure on multiple storage nodes by, for each of the plurality of aggregated data values; determining a value associated with the aggregated data value for use as a key for the aggregated data value, the determined value being based at least in part on the combination of multiple dimension category values associated with the aggregated data value; determining a storage location within the key-value storage structure by using the determined value as input to a hash function, the determined storage location being part of a subset of the key-value storage structure on one of the multiple storage nodes; and providing the aggregated data value for storage in the determined storage location on the one storage node. - View Dependent Claims (22, 23, 24)
-
-
25. A system, comprising:
-
one or more processors of one or more computing systems; and a data aggregation system that is configured to, when executed by at least one of the one or more processors, store a plurality of aggregated data values for an OLAP (“
online analytical processing”
) cube having multiple dimensions, the storing of the plurality of aggregated data values including, for each of the plurality of aggregated data values;determining a value associated with the aggregated data value for use as a key for the aggregated data value, the determined value being based at least in part on a combination of multiple dimension category values that are associated with the aggregated data value and that correspond to the multiple dimensions; determining a storage location within a key-value storage structure stored across multiple storage nodes by using the determined value as input to a hash function, the determined storage location being part of a subset of the key-value storage structure on one of the multiple storage nodes; and providing the aggregated data value for storage in the determined storage location on the one storage node as part of the OLAP cube. - View Dependent Claims (26, 27, 28, 29)
-
Specification