×

Approximate order statistics of real numbers in generic data

  • US 8,756,262 B2
  • Filed: 03/01/2011
  • Issued: 06/17/2014
  • Est. Priority Date: 03/01/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for calculating approximate order statistics from a collection of floating point numbers from a digest in a network comprising:

  • receiving machine data, wherein the machine data includes a floating point number;

    extracting, using one or more processors, the floating point number from the machine data;

    determining, using the one or more processors, an ordinality of the floating point number, wherein the ordinality of each floating point number is determined by converting the floating point number to a mantissa and an exponent and subtracting a number of significant digits in the mantissa to the right of the decimal point including significant zeros from the exponent;

    identifying, using the one or more processors and based on the determined ordinality, a level from amongst a plurality of levels in the digest, the digest being stored in a non-transitory memory and including a plurality of buckets positioned along the plurality of levels, wherein each bucket of the plurality of buckets is;

    defined by the ordinality of the level along which it is positioned,further defined by a range limited by one or more extrema, andassociated with a count that reflects a quantity of floating point numbers;

    identifying, using the one or more processors, a bucket positioned at the identified level and being defined by a range that is inclusive of the floating point number;

    incrementing, using the one or more processors, the count of the identified bucket, wherein the identified bucket, for which the count was incremented, has a plurality of child buckets in the digest, wherein the digest is configured to be used to generate a response to a query based on the incremented count of the bucket;

    identifying, using the one or more processors, a set of buckets based on a query value in the query, wherein the set of buckets includes the identified bucket; and

    estimating, using the one or more processors, an order statistic for the query value based on a summation of counts associated with the identified set of buckets.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×