Detecting electricity theft via meter tampering using statistical methods
First Claim
1. A system for detecting anomalous energy usage amongst building and household entities, said system comprising:
- a data storage device;
a processor unit coupled to the data storage device configured to perform a method to;
receive, at a computing system, data comprising energy usage data relating to a building'"'"'s actual energy use over a defined time period (shift), characteristics data of the building, and weather data over one or more defined time periods, said building characteristics data comprising one or more selected from the group consisting of;
a physical size of the building, a number of floors, a number of occupants, building age, a number of bedrooms, a number of bathrooms, a type of heating fuel in use, a type of air conditioning in use, and a latitude and longitude coordinate of the building;
cluster buildings in one or more clusters as determined based on a building'"'"'s energy usage in each time period;
identify buildings having energy usage that migrate from one cluster to another cluster across time of day shifts, wherein to identify buildings, said processor unit is further configured to;
represent a series of data processing operations as a series of data processing nodes, a node representing a usage data source for obtaining said energy usage data for a building;
perform, from said node, concurrent data processing operations as parallel paths according to a respective shift, each parallel path comprising a flow of operations comprising a linking of said node, a node representing a data aggregation operation for that shift and a node representing clustering operations for that shift; and
merge data operations at each parallel path to obtain a list of energy use migrations for that building;
generate a model to predict a building'"'"'s energy usage, said model defining expected bounds of energy consumption given a time period (shift) and said weather data and building characteristics data received;
compare energy usage for each building against an energy use predicted by the model for said building; and
identify, from said comparison, buildings whose electricity usage is not predicted by the model, wherein to identify buildings, said processor unit is further configured to;
perform, from said node representing a usage data source for a building, concurrent data processing operations as parallel paths according to a respective shift, each parallel path comprising a flow of operations comprising a linking of said node, a further node for generating non-linear regression models of energy use data aggregation operation for that shift, a further node of operations for detecting anomalous buildings for that shift, and a node of operations to rank a building'"'"'s anomalous energy usage based on statistics computed for the number of times a household'"'"'s usage is flagged as anomalous; and
merge data operations at each parallel path to obtain a list of top buildings having anomalous energy usage;
wherein said buildings identified as migrating from one cluster to another cluster between time periods, and said buildings exhibiting electricity usage not predicted by said generated model are flagged as anomalous energy usage entities.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for detecting anomalous energy usage of building or household entities. The method applies a number of successively stringent anomaly detection techniques to isolate households that are highly suspect for having engaged in electricity theft via meter tampering. The system utilizes historical time series data of electricity usage, weather, and household characteristics (e.g., size, age, value) and provides a list of households that are worthy of a formal theft investigation. Generally, raw utility usage data, weather history data, and household characteristics are cleansed, and loaded into an analytics data mart. The data mart feeds four classes of anomaly detection algorithms developed, with each analytic producing a set of households suspected of having engaged in electricity theft. The system allows a user to select households from each list or a set based on the intersection of all individual sets.
-
Citations
13 Claims
-
1. A system for detecting anomalous energy usage amongst building and household entities, said system comprising:
-
a data storage device; a processor unit coupled to the data storage device configured to perform a method to; receive, at a computing system, data comprising energy usage data relating to a building'"'"'s actual energy use over a defined time period (shift), characteristics data of the building, and weather data over one or more defined time periods, said building characteristics data comprising one or more selected from the group consisting of;
a physical size of the building, a number of floors, a number of occupants, building age, a number of bedrooms, a number of bathrooms, a type of heating fuel in use, a type of air conditioning in use, and a latitude and longitude coordinate of the building;cluster buildings in one or more clusters as determined based on a building'"'"'s energy usage in each time period; identify buildings having energy usage that migrate from one cluster to another cluster across time of day shifts, wherein to identify buildings, said processor unit is further configured to; represent a series of data processing operations as a series of data processing nodes, a node representing a usage data source for obtaining said energy usage data for a building; perform, from said node, concurrent data processing operations as parallel paths according to a respective shift, each parallel path comprising a flow of operations comprising a linking of said node, a node representing a data aggregation operation for that shift and a node representing clustering operations for that shift; and merge data operations at each parallel path to obtain a list of energy use migrations for that building; generate a model to predict a building'"'"'s energy usage, said model defining expected bounds of energy consumption given a time period (shift) and said weather data and building characteristics data received; compare energy usage for each building against an energy use predicted by the model for said building; and identify, from said comparison, buildings whose electricity usage is not predicted by the model, wherein to identify buildings, said processor unit is further configured to; perform, from said node representing a usage data source for a building, concurrent data processing operations as parallel paths according to a respective shift, each parallel path comprising a flow of operations comprising a linking of said node, a further node for generating non-linear regression models of energy use data aggregation operation for that shift, a further node of operations for detecting anomalous buildings for that shift, and a node of operations to rank a building'"'"'s anomalous energy usage based on statistics computed for the number of times a household'"'"'s usage is flagged as anomalous; and merge data operations at each parallel path to obtain a list of top buildings having anomalous energy usage; wherein said buildings identified as migrating from one cluster to another cluster between time periods, and said buildings exhibiting electricity usage not predicted by said generated model are flagged as anomalous energy usage entities. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product for detecting anomalous energy usage amongst building and household entities, the computer program product comprising a tangible non-transitory storage medium, said medium readable by a processing circuit and storing instructions run by the processing circuit for performing a method, the method comprising:
-
receiving, at a computing system, data comprising energy usage data relating to a building'"'"'s actual energy use over a defined time period (shift), characteristics data of the building, and weather data over one or more defined time periods, said building characteristics data comprising one or more selected from the group consisting of;
a physical size of the building, a number of floors, a number of occupants, building age, a number of bedrooms, a number of bathrooms, a type of heating fuel in use, a type of air conditioning in use, and a latitude and longitude coordinate of the building;clustering buildings in one or more clusters as determined based on a building'"'"'s energy usage in each time period; identifying buildings having energy usage that migrate from one cluster to another cluster across time of day shifts, said identifying comprising; representing a series of data processing operations as a series of data processing nodes, a node representing a usage data source for obtaining said energy usage data for a building; performing, from said node, concurrent data processing operations as parallel paths according to a respective shift, each parallel path comprising a flow of operations comprising a linking of said node, a second representing a data aggregation operation for that shift and a node representing clustering operations for that shift; and merging data operations at each parallel path to obtain a list of energy use migrations for that building; generating a model to predict a building'"'"'s energy usage, said model defining expected bounds of energy consumption given a time period (shift) and weather and building characteristics data received; comparing energy usage for each building against an energy use predicted by the model for said building; and identifying, from said comparison, buildings whose electricity usage is not predicted by the model, said identifying comprising; performing, from said node representing a usage data source for a building, concurrent data processing operations as parallel paths according to a respective shift, each parallel path comprising a flow of operations comprising a linking of said node, a node for generating non-linear regression models of energy use data aggregation operation for that shift, a node of operations for detecting anomalous buildings for that shift, and a node of operations to rank a building'"'"'s anomalous energy usage based on statistics computed for the number of times a household'"'"'s usage is flagged as anomalous; and merging data operations at each parallel path to obtain a list of top buildings having anomalous energy usage; wherein said buildings identified as migrating from one cluster to another cluster between time periods, and said buildings exhibiting electricity usage not predicted by said generated model are flagged as anomalous energy usage entities. - View Dependent Claims (9, 10, 11, 12, 13)
-
Specification