×

Distributed processing of streaming data records

  • US 8,738,649 B2
  • Filed: 03/15/2013
  • Issued: 05/27/2014
  • Est. Priority Date: 05/22/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method of providing interactive information related to streaming data records using an OLAP cube, the method comprising steps of:

  • (i) sending, to a client in response to a request from said client, information related to a first set of predetermined dimensions stored in the OLAP cube, the first set of predetermined dimensions being derived from information in a first subset of the streaming data records;

    (ii) receiving a new request from the client, the new request comprising a second set of predetermined dimensions having at least one dimension not present in the first set of predetermined dimensions;

    (iii) parsing information in the streaming data records to identify a second subset of information relevant to the second set of predetermined dimensions;

    (iv) receiving only the second subset of the information in the streaming data records at a plurality of distributed computational nodes, each node comprising a processor and a storage element;

    (v) converting, at each computational node, a portion of the subset of the information in the received streaming data records into key-value pairs;

    (vi) parsing, at each computational node, the converted key-value pairs of the subset of the information in the received streaming data records received at each said computational node to (i) identify matches of the keys to at least one predetermined dimension of the second set of predetermined dimensions and (ii) based thereon, combine the key-value pairs having identical keys;

    (vii) re-distributing the keys of the converted subset of the received streaming data records among the distributed computational nodes in accordance with the second set of predetermined dimensions stored on the computational nodes, wherein each distributed computational node receives the key corresponding to one of the second set of predetermined dimensions, thereby reducing a size of the portion of the subset of information in the received streaming data records received at each computational node;

    (viii) updating the OLAP cube to delete measures associated with only the first set of predetermined dimensions and to store measures associated with the second set of predetermined dimensions by collecting data from the computational nodes; and

    (ix) sending, to the client in response to a request from said client, information related to the second set of predetermined dimensions.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×