Method of intelligent data analysis to detect abnormal use of utilities in buildings
First Claim
1. A method for determining abnormal consumption of a utility by a system:
- repeatedly measuring a level of use of the utility thereby producing a plurality of utility measurements;
employing a statistical procedure to identify a set of outliers in the plurality of utility measurements;
removing the set of outliers from the plurality of utility measurements to define a set of non-outliers; and
evaluating performance of the system in response to the set of outliers and the set of non-outliers by determining a severity of abnormal utility usage represented by each outlier in the set of outliers, wherein determining a severity of abnormal utility usage for each outlier comprises calculating how far each outlier is from a robust estimate value for utility usage determined from the set of non-outliers.
3 Assignments
0 Petitions
Accused Products
Abstract
Outlier identification is employed to detect abnormally high or low energy use in a building. The utility use is measured periodically throughout each day and the measurements are grouped according to days that have similar average utility consumption levels. The data in each group is statistically analyzed using the Generalized Extreme Studentized Deviate (GESD) method. That method identifies outliers which are data samples that vary significantly from the majority of the data. The degree to which each outlier deviates from the remainder of the data indicates the severity of the abnormal utility consumption denoted by that outlier. The resultant outlier information is readily discernable by the building operators in accessing whether the cause of a particular occurrence of abnormal utility usage requires further investigation.
-
Citations
17 Claims
-
1. A method for determining abnormal consumption of a utility by a system:
-
repeatedly measuring a level of use of the utility thereby producing a plurality of utility measurements;
employing a statistical procedure to identify a set of outliers in the plurality of utility measurements;
removing the set of outliers from the plurality of utility measurements to define a set of non-outliers; and
evaluating performance of the system in response to the set of outliers and the set of non-outliers by determining a severity of abnormal utility usage represented by each outlier in the set of outliers, wherein determining a severity of abnormal utility usage for each outlier comprises calculating how far each outlier is from a robust estimate value for utility usage determined from the set of non-outliers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
where xej is the energy consumption for the jth outlier, {overscore (x)}robust is a robust estimate of the average energy consumption for days of the same day type as outlier j, and Srobust is a robust estimate of the standard deviation of energy consumption for days of the same day type.
-
-
3. The method as recited in claim 1 wherein determining a severity of abnormal utility usage comprises determining an amount that each outlier deviates from a mean of the plurality of utility measurements which are not identified as outliers.
-
4. The method as recited in claim 1 further comprising separating the plurality of utility measurements into groups wherein each group contains utility measurements acquired during days that under normal conditions have similar utility consumption levels;
- and
wherein the statistical procedure is applied separately to each group.
- and
-
5. The method as recited in claim 1 further comprising separating the plurality of utility measurements into groups wherein each group contains utility measurements acquired during predefined time periods that under normal conditions have similar utility consumption levels;
- and
wherein the statistical procedure is applied separately to each group.
- and
-
6. The method as recited in claim 1 wherein the outliers are identified using a Generalized Extreme Studentized Deviate (GESD) statistical procedure.
-
7. The method as recited in claim 6 wherein the Generalized Extreme Studentized Deviate (GESD) statistical procedure comprises:
-
(a) calculating an arithmetic mean ({overscore (x)}) of the plurality of utility measurements;
(b) finding an extreme utility measurement xe,i which is the utility measurement that has a value which is farther numerically from the arithmetic mean ({overscore (x)}) than the other ones of the plurality of utility measurements;
(c) using the extreme utility measurement xe,i to calculate an extreme studentized deviate Ri;
(d) calculating a 100α
percent critical value λ
i for the extreme utility measurement xe,i;
(e) declaring the extreme utility measurement xe,i to be an outlier when the extreme studentized deviate Ri is greater than the 100α
percent critical value λ
i.
-
-
8. The method as recited in claim 7 wherein the Generalized Extreme Studentized Deviate (GESD) statistical procedure further comprises:
-
removing the extreme utility measurement xe,i from the plurality of utility measurements to form a new plurality of utility measurements; and
repeating steps (a) through (e) for the new plurality of utility measurements.
-
-
9. The method as recited in claim 7 wherein the extreme studentized deviate Ri is calculated according to the expression:
-
where s is a standard deviation of the plurality of utility measurements.
-
-
10. The method as recited in claim 7 wherein the 100α
- percent critical value λ
i then is calculated using the equation;
where n is the number of utility measurements, i is a number identifying a particular outlier being evaluated, tn−
i−
1,p is a student'"'"'s t-distribution with (n−
i−
1) degrees of freedom, and p is a value based on the user defined probability α
of incorrectly declaring one or more outliers when no outliers exist.
- percent critical value λ
-
11. The method as recited in claim 10 wherein the percentile p is determined from:
-
12. A method for determining abnormal consumption of a utility by a system, comprising:
-
(a) repeatedly measuring the level of use of the utility, thereby producing a plurality of utility measurements;
(b) forming a group of those of the plurality of utility measurements taken during predefined periods of time;
(c) calculating an arithmetic mean ({overscore (x)}) of the group;
(d) finding an extreme utility measurement xe,i which is the utility measurement having a value that is farthest numerically from the arithmetic mean ({overscore (x)});
(e) using the extreme utility measurement xe,i to calculate an extreme studentized deviate Ri;
(f) calculating a 100α
percent critical value λ
i for the extreme utility measurement xe,i;
(g) declaring the extreme utility measurement xe,i to be an outlier indicative of abnormal utility use and adding the outlier to a set of outliers when the extreme studentized deviate is greater than the 100α
percent critical value;
(h) removing the extreme utility measurement xe,i from the group of utility measurements;
(i) repeating steps (c) through (h) a defined number of times;
(j) removing the set of outliers from the group of utility measurements to define a set of non-outliers; and
(k) evaluating performance of the system in response to the set of outliers and the set of non-outliers by determining a severity of abnormal utility usage represented by each outlier, wherein determining a severity of abnormal utility usage for each outlier comprises calculating how far each outlier is from a robust estimate value for utility usage determined from the set of non-outliers. - View Dependent Claims (13, 14, 15, 16, 17)
where s is a standard deviation of the utility measurements in the group.
-
-
14. The method as recited in claim 12 wherein the 100α
- percent critical value λ
i is calculated using the equation;
where n is the number of utility measurements, i is a number identifying a particular outlier being evaluated, tn−
i−
1,p is the student'"'"'s t-distribution with (n−
i−
1) degrees of freedom, and p is a value based on the user defined probability α
of incorrectly declaring one or more outliers when no outliers exist.
- percent critical value λ
-
15. The method as recited in claim 14 wherein the percentile p is determined from:
-
16. The method as recited in claim 12 further comprising defining periods of time during a plurality of days in which under normal conditions similar utility consumption levels occur during each one of those periods of time.
-
17. The method as recited in claim 12 further comprising performing maintenance on the system in response to examination of one or more of the outliers.
Specification