Systems for time-series predictive data analytics, and related methods and apparatus
First Claim
1. A predictive modeling method comprising:
- performing, with a processing device, a predictive modeling procedure, including;
(a) obtaining time-series data including one or more data sets, wherein each data set includes a plurality of observations, wherein each observation includes (1) an indication of a time associated with the observation and (2) respective values of one or more variables;
(b) identifying one or more of the variables as targets, and identifying zero or more other variables as features;
setting values of a plurality of adjustable parameters of the predictive modeling procedure, including;
(c) determining a value of a time interval parameter of the time-series data; and
(d) determining a value of a forecast range parameter and a value of a skip range parameter, wherein the forecast range parameter value indicates a duration of a period for which values of the targets are to be predicted, and wherein the skip range parameter value indicates a temporal lag between a time associated with an earliest prediction and a time associated with a latest observation upon which the earliest prediction is based;
segmenting the time-series data into training data and testing data based, at least in part, on the values of the forecast range parameter and the skip range parameter, including;
(e) generating the training data from the time-series data, wherein the training data include a first subset of the observations of at least one of the data sets, wherein the first subset of the observations includes training-input and training-output collections of the observations, wherein the times associated with the observations in the training-input and training-output collections correspond, respectively, to a training-input time range and a training-output time range, wherein the skip range parameter value separates an end of the training-input time range from a beginning of the training-output time range, and wherein a duration of the training-output time range is at least as long as the forecast range parameter value; and
(f) generating the testing data from the time-series data, wherein the testing data include a second subset of the observations of at least one of the data sets, wherein the second subset of the observations includes testing-input and testing-validation collections of the observations, wherein the times associated with the observations in the testing-input and testing-validation collections correspond, respectively, to a testing-input time range and a testing-validation time range, wherein the skip range parameter value separates an end of the testing-input time range from a beginning of the testing-validation time range, and wherein a duration of the testing-validation time range is at least as long as the forecast range parameter value; and
adapting a predictive model to solve a prediction problem represented by the time-series data and the values of the forecast range and skip range parameters, including;
(g) fitting the predictive model to the training data; and
(h) testing the fitted model on the testing data.
2 Assignments
0 Petitions
Accused Products
Abstract
A predictive modeling method may include determining a time interval of time-series data; identifying one or more variables of the data as targets; determining a forecast range and a skip range associated with a prediction problem represented by the data; generating training data and testing data from the time-series data; fitting a predictive model to the training data; and testing the fitted model on the testing data. The forecast range may indicate a duration of a period for which values of the targets are to be predicted. The skip range may indicate a temporal lag between the time period corresponding to the data used to make predictions and the time period corresponding to the predictions. The skip range may separate input data subsets representing model inputs from subsets representing model outputs, and separate test data subsets representing model inputs from subsets representing validation data.
92 Citations
109 Claims
-
1. A predictive modeling method comprising:
-
performing, with a processing device, a predictive modeling procedure, including; (a) obtaining time-series data including one or more data sets, wherein each data set includes a plurality of observations, wherein each observation includes (1) an indication of a time associated with the observation and (2) respective values of one or more variables; (b) identifying one or more of the variables as targets, and identifying zero or more other variables as features; setting values of a plurality of adjustable parameters of the predictive modeling procedure, including; (c) determining a value of a time interval parameter of the time-series data; and (d) determining a value of a forecast range parameter and a value of a skip range parameter, wherein the forecast range parameter value indicates a duration of a period for which values of the targets are to be predicted, and wherein the skip range parameter value indicates a temporal lag between a time associated with an earliest prediction and a time associated with a latest observation upon which the earliest prediction is based; segmenting the time-series data into training data and testing data based, at least in part, on the values of the forecast range parameter and the skip range parameter, including; (e) generating the training data from the time-series data, wherein the training data include a first subset of the observations of at least one of the data sets, wherein the first subset of the observations includes training-input and training-output collections of the observations, wherein the times associated with the observations in the training-input and training-output collections correspond, respectively, to a training-input time range and a training-output time range, wherein the skip range parameter value separates an end of the training-input time range from a beginning of the training-output time range, and wherein a duration of the training-output time range is at least as long as the forecast range parameter value; and (f) generating the testing data from the time-series data, wherein the testing data include a second subset of the observations of at least one of the data sets, wherein the second subset of the observations includes testing-input and testing-validation collections of the observations, wherein the times associated with the observations in the testing-input and testing-validation collections correspond, respectively, to a testing-input time range and a testing-validation time range, wherein the skip range parameter value separates an end of the testing-input time range from a beginning of the testing-validation time range, and wherein a duration of the testing-validation time range is at least as long as the forecast range parameter value; and adapting a predictive model to solve a prediction problem represented by the time-series data and the values of the forecast range and skip range parameters, including; (g) fitting the predictive model to the training data; and (h) testing the fitted model on the testing data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. A predictive modeling apparatus comprising:
-
a memory configured to store a machine-executable module encoding a predictive modeling procedure, wherein the predictive modeling procedure includes a plurality of tasks including at least one pre-processing task and at least one model-fitting task; and at least one processor configured to execute the machine-executable module, wherein executing the machine-executable module causes the apparatus to perform the predictive modeling procedure, including; performing the pre-processing task, including; (a) obtaining time-series data including one or more data sets, wherein each data set includes a plurality of observations, wherein each observation includes (1) an indication of a time associated with the observation and (2) respective values of one or more variables, (b) identifying one or more of the variables as targets, and identifying zero or more other variables as features, and setting values of a plurality of adjustable parameters of the predictive modeling procedure, including; (c) determining a value of a time interval parameter of the time-series data, and (d) determining a value of a forecast range and a value of a skip range parameter, wherein the forecast range parameter value indicates a duration of a period for which values of the targets are to be predicted, and wherein the skip range parameter value indicates a temporal lag between a time associated with an earliest prediction and a time associated with a latest observation upon which the earliest prediction is based; and performing the model-fitting task, including; segmenting the time-series data into training data and testing data based, at least in part, on the values of the forecast range parameter and the skip range parameter, including; (e) generating the training data from the time-series data, wherein the training data include a first subset of the observations of at least one of the data sets, wherein the first subset of the observations includes training-input and training-output collections of the observations, wherein the times associated with the observations in the training-input and training-output collections correspond, respectively, to a training-input time range and a training-output time range, wherein the skip range parameter value separates an end of the training-input time range from a beginning of the training-output time range, and wherein a duration of the training-output time range is at least as long as the forecast range parameter value, and (f) generating the testing data from the time-series data, wherein the testing data include a second subset of the observations of at least one of the data sets, wherein the second subset of the observations includes testing-input and testing-validation collections of the observations, wherein the times associated with the observations in the testing-input and testing-validation collections correspond, respectively, to a testing-input time range and a testing-validation time range, wherein the skip range parameter value separates an end of the testing-input time range from a beginning of the testing-validation time range, and wherein a duration of the testing-validation time range is at least as long as the forecast range parameter value, and adapting a predictive model to solve a prediction problem represented by the time-series data and the values of the forecast range and skip range parameters, including; (g) fitting the predictive model to the training data, and (h) testing the fitted model on the testing data. - View Dependent Claims (62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109)
-
Specification