Outlier correction
First Claim
1. Method for correcting outlier values in a series of values over time, the values representing a predetermined numerical parameter, the method comprising:
- for a predetermined time interval of the time series, the time interval having a beginning point (b) and an end point (e), a) calculating an ex-post forecast time series (s_ep) on the basis of the historical time series (s_h) and a predetermined model function (f);
b) defining a lower tolerance limit line (t_min) and an upper tolerance limit line (t_max) on the basis of the ex-post forecast time series and on a quality function (qf) over the time interval;
c) beginning at the beginning point (b) of the time interval, correcting the historical time series by replacing the first value of the historical time series (s_h) outside of the tolerance lines by a predetermined value not outside the tolerance lines (t_min, t_max);
d) recalculating the ex-post forecast time series (s_ep) on the basis of the corrected historical time series (s_h) and the predetermined model function; and
redefining the tolerance lines on the basis of the recalculated ex-post forecast time series (s_ep) and the quality function, whereby the quality function (qf) takes only values into account until the time point where the last outlier value has been replaced;
e) repeating c) to e) until all outlier values of the historical time series (s_h) have been replaced.
2 Assignments
0 Petitions
Accused Products
Abstract
The invention provides methods and apparatus, including computer program products, for correcting outlier values in a series of values over time, the values representing a predetermined numerical parameter, the method comprising the following steps: for a predetermined time interval of the time series, the time interval having a beginning point and an end point, calculating an ex-post forecast time series on the basis of the historical time series and a predetermined model function; defining a lower tolerance limit line and an upper tolerance limit line on the basis of the ex-post forecast time series and on a quality function over the time interval; beginning at the beginning point of the time interval, correcting the historical time series by replacing the first value of the historical time series outside of the tolerance lines by a predetermined value not outside the tolerance lines; recalculating the ex-post forecast time series on the basis of the corrected historical time series (s_h) and the predetermined model function; and redefining the tolerance lines on the basis of the recalculated ex-post forecast time series (s_ep) and the quality function, whereby the quality function (qf) takes only values into account until the time point where the last outlier value has been replaced; repeating the above steps until all outlier values of the historical time series have been replaced.
11 Citations
10 Claims
-
1. Method for correcting outlier values in a series of values over time, the values representing a predetermined numerical parameter, the method comprising:
-
for a predetermined time interval of the time series, the time interval having a beginning point (b) and an end point (e), a) calculating an ex-post forecast time series (s_ep) on the basis of the historical time series (s_h) and a predetermined model function (f);
b) defining a lower tolerance limit line (t_min) and an upper tolerance limit line (t_max) on the basis of the ex-post forecast time series and on a quality function (qf) over the time interval;
c) beginning at the beginning point (b) of the time interval, correcting the historical time series by replacing the first value of the historical time series (s_h) outside of the tolerance lines by a predetermined value not outside the tolerance lines (t_min, t_max);
d) recalculating the ex-post forecast time series (s_ep) on the basis of the corrected historical time series (s_h) and the predetermined model function; and
redefining the tolerance lines on the basis of the recalculated ex-post forecast time series (s_ep) and the quality function, whereby the quality function (qf) takes only values into account until the time point where the last outlier value has been replaced;
e) repeating c) to e) until all outlier values of the historical time series (s_h) have been replaced. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
means for calculating an ex-post forecast time series (s_ep) on the basis of the historical time series (s_h) and a predetermined model function (f);
means for defining a lower tolerance limit line (t_min) and an upper tolerance limit line (t_max) on the basis of the ex-post forecast time series and on a quality function (qf) over the time interval;
means for beginning at the beginning point (b) of the time interval, correcting the historical time series by replacing the first value of the historical time series (s_h) outside of the tolerance lines by a predetermined value not outside the tolerance lines (t_min, t_max); and
means for recalculating the ex-post forecast time series (s_ep) on the basis of the corrected historical time series (s_h) and the predetermined model function; and
redefining the tolerance lines on the basis of the recalculated ex-post forecast time series (s_ep) and the quality function, whereby the quality function (qf) takes only values into account until the time point where the last outlier value has been replaced.
-
-
10. A machine-accessible medium containing instructions that, when executed, cause a machine to:
-
for a predetermined time interval of the time series, the time interval having a beginning point (b) and an end point (e), a) calculate an ex-post forecast time series (s_ep) on the basis of the historical time series (s_h) and a predetermined model function (f);
b) define a lower tolerance limit line (t_min) and an upper tolerance limit line (t_max) on the basis of the ex-post forecast time series and on a quality function (qf) over the time interval;
c) beginning at the beginning point (b) of the time interval, correct the historical time series by replacing the first value of the historical time series (s_h) outside of the tolerance lines by a predetermined value not outside the tolerance lines (t_min, t_max);
d) recalculate the ex-post forecast time series (s_ep) on the basis of the corrected historical time series (s_h) and the predetermined model function; and
redefining the tolerance lines on the basis of the recalculated ex-post forecast time series (s_ep) and the quality function, whereby the quality function (qf) takes only values into account until the time point where the last outlier value has been replaced;
e) repeat c) to e) until all outlier values of the historical time series (s_h) have been replaced.
-
Specification