×

System, method and computer program for multi-dimensional temporal and relative data mining framework, analysis and sub-grouping

  • US 9,898,513 B2
  • Filed: 12/12/2012
  • Issued: 02/20/2018
  • Est. Priority Date: 12/12/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented data mining method for controlling mining of data streams in a distributed computing environment configured to provide a distribution layer operable to maintain consistencies across multiple distributed computing systems when performing distributed data processing and analysis, wherein different attributes are associated with each of a plurality data streams, the computer implemented data mining method comprising:

  • (a) using a central distribution computer system component to store and maintain consistency of a data mining framework configured to support data mining across the multiple distributed computing systems, the data mining framework including at least;

    (i) a series of temporal rules deployable to a subset of multiple distributed computing systems that are targets for a query; and

    (ii) relative rules adapted for relatively aligning time series multi-dimensional data based on at least one time point of interest, the central distribution computer system being configured for determining a subset of particular temporal rules that are applicable to the time series multi-dimensional data associated to a particular site, based on the different attributes associated with the data streams;

    (b) distributing, from the central distribution computer system to the multiple distributed computing systems, the series of temporal rules and the relative rules to be applied by each distributed computing systems of the multiple distributed computing systems to pre-process the time series multi-dimensional data and to generate new temporally abstracted and relatively aligned time series data representing trends and patterns that include one or more indications of a potential future clinical event;

    (c) collecting, and cleaning at the multiple distributed computing systems, the time series multi-dimensional data, the time series multi-dimensional data obtained through one or more corresponding data streams of the plurality of data streams;

    (d) temporally abstracting, at the multiple distributed computing systems, the collected and cleaned time series multi-dimensional data by accessing and applying the applicable temporal rules so as to generate temporally abstracted time series multi-dimensional data categorized both on similarity and frequency, and relatively aligning the temporally abstracted time series multi-dimensional data based on an at least one time point of interest by accessing and applying the applicable relative rules; and

    (e) collecting the temporally abstracted and relatively aligned time series multi-dimensional data from the multiple distributed computing systems to provide multi-dimensional, temporal, multi-site time series data for use in data mining operations.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×