×

Systems and methods for machine learning using classifying, clustering, and grouping time series data

  • US 10,169,720 B2
  • Filed: 12/16/2016
  • Issued: 01/01/2019
  • Est. Priority Date: 04/17/2014
  • Status: Active Grant
First Claim
Patent Images

1. A system for performing data mining and statistical learning techniques on a data set, the system comprising:

  • a processor; and

    a non-transitory computer-readable storage medium including instructions stored thereon, which when executed by the processor, cause the system to perform operations including;

    receiving a plurality of time series included in a prediction hierarchy for performing statistical learning to develop the prediction hierarchy, each individual time series of the plurality of time series comprising one or more need output characteristics and a need output pattern for an object, the one or more need output characteristics including at least one of a need output data, an intermittence, or a time period of a year, the need output pattern indicating one or more time intervals for which need output for the object is greater than a threshold amount;

    pre-processing data associated with each of the plurality of time series, wherein the pre-processing includes executing tasks in parallel using a grid-enabled computing environment, the tasks comprising, for each time series of the plurality of time series;

    determining a classification for the individual time series based on the one or more need output characteristics;

    determining a pattern group for each individual time series by comparing the need output pattern to need output patterns for other time series in the plurality of time series; and

    determining a level of the prediction hierarchy at which the each individual time series comprises a need output amount greater than the threshold amount, wherein determining the level further includes, for each time series in each level of the hierarchy and starting with a lowest level of the hierarchy;

    determining whether the individual time series includes a sufficient volume of data by determining whether the individual time series includes an amount of need output above the threshold amount; and

    based upon the determination, for each time series that does not include an amount of need output above the threshold amount, aggregating multiple time series from a particular level into a node that is one level higher than the particular level in the hierarchy;

    generating an additional prediction hierarchy using the prediction hierarchy, the classification, the pattern group, and the determined level, wherein utilizing the additional prediction hierarchy generates more accurate need output predictions than need output predictions generated utilizing the prediction hierarchy; and

    transmitting, to one or more nodes in the grid-enabled computing environment, prediction data related to at least one time series of the plurality of time series based on the additional prediction hierarchy.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×