SYSTEM AND METHODS FOR ADDRESSING DATA QUALITY ISSUES IN INDUSTRIAL DATA
First Claim
1. A device comprising at least one hardware implemented module configured to at least:
- retrieve time series data measured from at least one sensor;
retrieve proximity data about the at least one sensor, the proximity data comprising;
sensor ID metadata for the at least one sensor;
orsensor environment metadata for the at least one sensor;
orboth the sensor ID metadata for the at least one sensor and the sensor environment metadata for the at least one sensor;
detect a first set of defects in the retrieved time series data using constraints based upon at least one of;
at least one stored model of the at least one sensor, a location for the at least one sensor, or both;
ora statistical model;
orother constraints that relate to the time series data or proximity data;
present, via a user interface (UI), information relating to the first set of defects defects and receive, via the UI, feedback about the first set of defects;
clean the time series data based on the feedback;
capture information to allow reversal of changes in whole or in part made to the time series data based on the feedback.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments allow data cleaning of industrial data gathered from at least one sensor. The data cleaning utilizes a workflow that defines at least one cleaning step to be performed. Each cleaning step comprises detecting defects based on at least one constraint such as various models and/or statistics. Potential defects are presented to a user for feedback. The data is cleaned based on the feedback. Multiple copies of the data are stored to track all the various cleaning choices. All choices can be rolled back at will so that cleaning decisions made can be eliminated and different choices applied. Intermediate data is captured to allow reporting and auditing of the cleaning process.
12 Citations
20 Claims
-
1. A device comprising at least one hardware implemented module configured to at least:
-
retrieve time series data measured from at least one sensor; retrieve proximity data about the at least one sensor, the proximity data comprising; sensor ID metadata for the at least one sensor;
orsensor environment metadata for the at least one sensor;
orboth the sensor ID metadata for the at least one sensor and the sensor environment metadata for the at least one sensor; detect a first set of defects in the retrieved time series data using constraints based upon at least one of; at least one stored model of the at least one sensor, a location for the at least one sensor, or both;
ora statistical model;
orother constraints that relate to the time series data or proximity data; present, via a user interface (UI), information relating to the first set of defects defects and receive, via the UI, feedback about the first set of defects; clean the time series data based on the feedback; capture information to allow reversal of changes in whole or in part made to the time series data based on the feedback. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method performed by a device to clean time series data, the method comprising:
-
retrieving time series data measured from at least one sensor; retrieving proximity data about the at least one sensor, the proximity data comprising sensor ID metadata or sensor environment metadata or both; and performing at least one cleaning operation, each cleaning operation comprising; detecting a set of defects in the retrieved time series data using at least one constraint based on any combination of; the proximity data; a statistical model; a model relating to the proximity data;
ora characteristic of the time series data or proximity data; presenting, via a user interface (UI), information relating to the detected defects and receiving via the UI, feedback about the first set of defects; cleaning the time series data based on the feedback; and capturing information to allow reversal of changes in whole or in part made to the time series data. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A computer storage medium comprising computer executable instructions that when executed configure a device to at least:
-
retrieve time series data measured from at least one sensor; retrieve proximity data about the at least one sensor, the proximity data comprising sensor ID metadata or sensor environment metadata or both; and perform at least one cleaning operation, each cleaning operation configuring the device to at least; detect a set of defects in the retrieved time series data using at least one constraint based on any combination of; the proximity data; a statistical model; a model relating to the proximity data;
ora characteristic of the time series data or proximity data; present, via a user interface (UI), information relating to the detected defects and receive, via the UI, feedback about the first set of defects; clean the time series data based on the feedback; and capture information to allow reversal of changes in whole or in part made to the time series data. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification