SYSTEM AND METHOD FOR PICK-AND-DROP SAMPLING
First Claim
1. A database system comprising:
- a database;
a database server configured to control reading data from and writing data to the database;
an input to the database server configured to deliver a data stream formed of a sequence of elements, D={p1, p2, . . . , pm} of size m of numbers from {1, . . . , n} to the database server;
a non-transitive, computer-readable storage medium, having stored thereon, a computer program that, when executed by a processor, causes the processor to approximate frequency moments (Fk) in the data stream, such that a frequency of an element (i) is defined as fi=|{j;
pj=i}| and a k-th frequency moment of D is defined as
3 Assignments
0 Petitions
Accused Products
Abstract
A database system includes an input to a database server configured to deliver a data stream formed of a sequence of elements, D={p1, p2, . . . , pm} of size m of numbers from {1, . . . , n} to the database server. The system further includes a computer program that causes a processor to approximate frequency moments (Fk) in the data stream, such that a frequency of an element (i) is defined as fi=|{j:pj=i}| and a k-th frequency moment of D is defined as
single pass through the data stream. The processor is caused to carry out the steps of locating elements (i) with a frequency ΩFk in the data stream as heavy elements and approximating fi as ≧ a fraction of fi to limit memory resources used by the processor to estimate Fk to O(n1−2/k log(n)) bits.
7 Citations
18 Claims
-
1. A database system comprising:
-
a database; a database server configured to control reading data from and writing data to the database; an input to the database server configured to deliver a data stream formed of a sequence of elements, D={p1, p2, . . . , pm} of size m of numbers from {1, . . . , n} to the database server; a non-transitive, computer-readable storage medium, having stored thereon, a computer program that, when executed by a processor, causes the processor to approximate frequency moments (Fk) in the data stream, such that a frequency of an element (i) is defined as fi=|{j;
pj=i}| and a k-th frequency moment of D is defined as - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for approximating frequency moments (Fk)) in data streams formed of a sequence of elements, D={p1, p2, . . . , pm} of size m of numbers from {1, . . . , n}, such that a frequency of an element (i) is defined as fi=|{j:
- pj=i}| and a k-th frequency moment of D is defined as
- View Dependent Claims (7, 8, 9, 10, 11)
-
12. A database system comprising:
-
a database; a database server configured to control reading data from and writing data to the database; an input to the database server configured to deliver a data stream formed of a sequence of elements, D={p1, p2, . . . , pm} of size m of numbers from {1, . . . , n} to the database server; a non-transitive, computer-readable storage medium, having stored thereon, a computer program that, when executed by a processor, causes the processor to approximate frequency moments (Fk)) in the data stream, such that a frequency of an element (i) is defined as fi=|{j;
pj=i}| and a k-th frequency moment of D is defined as - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
Specification