Interactive visualization for outlier identification
First Claim
Patent Images
1. A computer program product, the computer program product being tangibly embodied on a non-transitory computer-readable storage medium and comprising instructions that, when executed, are configured to cause at least one computing device to:
- receive a user selection of columns of data tables to select data values, the selected data values representing measurements taken over time from at least one sensor;
render a graphical representation of a distribution of the selected data values, in which frequencies of the selected data values at corresponding included values of an available range of values are represented by extensions of the graphical representation in proportion to the frequencies and in parallel with a first axis, wherein the available range of values is illustrated along a second axis perpendicular to the first axis;
render extensions of the graphical representation in a first value range of the graphical representation with a first visual designation;
render extensions of the graphical representation in a second value range of the graphical representation with a second visual designation, wherein the first value range and the second value range are included in a total value range; and
render an outlier limit defined by a user input with respect to the total value range, wherein extensions of the graphical representation in an outlier portion of the graphical representation, corresponding to at least one selected data value that is designated as an outlier value by virtue of being outside of the outlier limit, are rendered with a third visual designation selected by the user input,wherein the first value range, the second value range and the outlier portion are rendered with differing visual designations.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are provided for processing, visualizing, interpreting, and otherwise utilizing collected data. More particularly, collected data may be visually represented in an interactive manner, which allows a user, for example, to define and identify outlier data within a very large dataset. These results may be obtained through the use of a customizable, interactive visualization of the data, in which outliers and other aspects of the data are visually apparent.
-
Citations
20 Claims
-
1. A computer program product, the computer program product being tangibly embodied on a non-transitory computer-readable storage medium and comprising instructions that, when executed, are configured to cause at least one computing device to:
-
receive a user selection of columns of data tables to select data values, the selected data values representing measurements taken over time from at least one sensor; render a graphical representation of a distribution of the selected data values, in which frequencies of the selected data values at corresponding included values of an available range of values are represented by extensions of the graphical representation in proportion to the frequencies and in parallel with a first axis, wherein the available range of values is illustrated along a second axis perpendicular to the first axis; render extensions of the graphical representation in a first value range of the graphical representation with a first visual designation; render extensions of the graphical representation in a second value range of the graphical representation with a second visual designation, wherein the first value range and the second value range are included in a total value range; and render an outlier limit defined by a user input with respect to the total value range, wherein extensions of the graphical representation in an outlier portion of the graphical representation, corresponding to at least one selected data value that is designated as an outlier value by virtue of being outside of the outlier limit, are rendered with a third visual designation selected by the user input, wherein the first value range, the second value range and the outlier portion are rendered with differing visual designations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method, comprising:
-
receiving a user selection of data values from a hierarchical user interface displaying data schemas, data tables, and data columns, the selected data values representing measurements taken over time from at least one sensor; rendering a graphical representation of a distribution of the selected data values, in which frequencies of the selected data values at corresponding included values of an available range of values are represented by extensions of the graphical representation in proportion to the frequencies and in parallel with a first axis, wherein the available range of values is illustrated along a second axis perpendicular to the first axis; rendering extensions of the graphical representation in a first value range of the graphical representation with a first visual designation; rendering extensions of the graphical representation in a second value range of the graphical representation with a second visual designation, wherein the first value range and the second value range are included in a total value range; and rendering an outlier limit defined by a user input with respect to the total value range, wherein extensions of the graphical representation in an outlier portion of the graphical representation, corresponding to at least one selected data value that is designated as an outlier value by virtue of being outside of the outlier limit, are rendered with a third visual designation selected by the user input, wherein the first value range, the second value range and the outlier portion are rendered with differing visual designations. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computer program product, the computer program product being tangibly embodied on a non-transitory computer-readable storage medium and comprising instructions that, when executed, are configured to cause at least one computing device to:
-
render a user interface configured to receive user selections in a hierarchical fashion, the user selections including a data schema, a data table, and a data column to define a dataset; calculate a count of each of a plurality of data values within the dataset; render a graphical representation of a distribution of the dataset, in which each count for each data value of the plurality of data values is represented by a relatively proportional visual extent of the graphical representation; calculate a total value range within a central portion of the graphical representation; calculate an outlier limit defined as a multiple of the total value range; render visual extents within the total value range with a total value identifying visual designation; render a visual extent representing at least one outlier value of the plurality of data values, the outlier value being outside of the outlier limit, with an outlier identifying visual designation differing from the total value identifying visual designation; receive, from a user interface, an update to the multiple of the total value range; calculate an updated outlier limit, based on the updated multiple; and render an updated graphical representation, including the updated outlier limit. - View Dependent Claims (18, 19, 20)
-
Specification