×

Managing uncertain data using Monte Carlo techniques

  • US 9,063,987 B2
  • Filed: 06/13/2012
  • Issued: 06/23/2015
  • Est. Priority Date: 06/03/2009
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer implemented method comprising:

  • specifying data uncertainty using at least one variable generation (VG) function wherein said VG function generates pseudorandom samples of uncertain data values;

    specifying a random database based on said VG function;

    generating, by a processor coupled to memory, a number of N Monte Carlo instantiations of said random database, wherein N is a number greater than 1;

    identifying a plurality of database tuple bundles t, wherein each of the plurality of database tuple bundles t is a data structure comprising a correlated tuple from each of the N Monte Carlo instantiations;

    representing the plurality of database tuple bundles t in a compressed form in which only pseudorandom numbers used to generate the uncertain data values are represented;

    expanding the plurality of database tuple bundles t represented in the compressed form to an expanded form, wherein the plurality of database tuple bundles t is represented in the expanded form when all instantiated attribute values are explicitly represented;

    executing, by a processor coupled to memory, a query Q over the N Monte Carlo instantiations, wherein said executing comprises;

    executing a query plan for the query Q once over each of the plurality of database tuple bundles; and

    outputting query-results, where zero or more numerical values that are used to estimate statistical properties of the probability distribution of the result of the query Q;

    maintaining a running statistical property of query-results as each query-result is determined, wherein each query-result corresponds to one or more of the N Monte Carlo instantiations; and

    after N query-results are determined, outputting the final value of the running statistical property to be the estimated statistical property of the probability distribution of the result of the query Q.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×