Efficient Publication of Sparse Data
First Claim
1. A computer-implemented method for publishing data, the method comprising computer-implemented operations at least for:
- obtaining data at a summarization engine, the data comprising a plurality of entries;
modifying one of the plurality of entries to obtain a resulting value;
determining if the resulting value satisfies a threshold;
in response to determining that the resulting value satisfies the threshold, adding the resulting value to a data summary; and
publishing the data summary.
1 Assignment
0 Petitions
Accused Products
Abstract
The present disclosure is directed to systems, methods, and computer-readable storage media for publishing data. A data summary summarizing the data can be generated and published according to several publishing schemes. In some embodiments, non-zero entries are selected and modified and zero entries are sampled according to one or more distribution functions. The sampled and modified values are added to a data summary, or a sample of the sampled and modified values are added to the data summary. The data summary is published, released, used, or otherwise output. In other embodiments, priority values are assigned to each value associated with the data, and a number of entries with the highest values are selected and added to the data summary.
23 Citations
20 Claims
-
1. A computer-implemented method for publishing data, the method comprising computer-implemented operations at least for:
-
obtaining data at a summarization engine, the data comprising a plurality of entries; modifying one of the plurality of entries to obtain a resulting value; determining if the resulting value satisfies a threshold; in response to determining that the resulting value satisfies the threshold, adding the resulting value to a data summary; and publishing the data summary. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented method for publishing data, the method comprising computer-implemented operations at least for:
-
obtaining data at a summarization engine, the data comprising a plurality of entries; modifying one of the plurality of entries to obtain a resulting value, the one of the plurality of values comprising a non-zero valued entry; adding the resulting value to a data summary; sampling a zero-valued entry from the plurality of entries; adding the zero-valued entry to the data summary; and publishing the data summary. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A computer-implemented method for publishing data, the method comprising computer-implemented operations at least for:
-
obtaining data at a summarization engine, the data comprising a plurality of entries; assigning a priority value to each of the plurality of entries; drawing a sample from the plurality of entries, the sample comprising a plurality of sampled entries; adding the plurality of sampled entries to a data summary; and publishing the data summary. - View Dependent Claims (18, 19, 20)
-
Specification