Methods and systems for analyzing weirdness of variables
First Claim
1. A computer-based method of determining a weirdness score for variables within a data set, said method performed using a computer coupled to a database, said method comprising:
- receiving a selection of a first variable included within the data set, wherein the first variable is defined by a measure, a time period, and a plurality of entities;
calculating a plurality of parameters for the first variable, wherein each of the plurality of parameters is calculated based at least in part on a deviation of a measured value of the first variable from a predicted value of the first variable;
calculating a rank for each of the plurality of parameters for the first variable by;
identifying all other variables in the data set that have the same measure as the first variable and the same time period as the first variable;
identifying, for each of the other variables, a plurality of parameters that correspond to the plurality of parameters for the first variable; and
for each parameter of the plurality of parameters for the first variable, ranking the parameter for the first variable relative to the corresponding parameters for the other variables; and
calculating a weirdness score for the first variable, wherein the weirdness score is indicative of how unexpected the measured value of the first variable is, and wherein the weirdness score depends on the calculated rank of each of the plurality of parameters for the first variable.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-based method of determining a weirdness score for variables within a data set is provided. The method includes receiving a selection of a first variable, wherein the first variable is defined by a measure, a time period, and a plurality of entities, calculating a plurality of parameters for the first variable, wherein each of the plurality of parameters is calculated based at least in part on a deviation of a measured value from a predicted value, calculating a rank for each of the plurality of parameters for the first variable, wherein the rank of each parameter is calculated relative to corresponding parameters calculated for all other variables in the data set having the same measure and time period as the first variable, and calculating a weirdness score for the first variable based at least in part on the calculated rank of each of the plurality of parameters.
-
Citations
28 Claims
-
1. A computer-based method of determining a weirdness score for variables within a data set, said method performed using a computer coupled to a database, said method comprising:
-
receiving a selection of a first variable included within the data set, wherein the first variable is defined by a measure, a time period, and a plurality of entities; calculating a plurality of parameters for the first variable, wherein each of the plurality of parameters is calculated based at least in part on a deviation of a measured value of the first variable from a predicted value of the first variable; calculating a rank for each of the plurality of parameters for the first variable by; identifying all other variables in the data set that have the same measure as the first variable and the same time period as the first variable; identifying, for each of the other variables, a plurality of parameters that correspond to the plurality of parameters for the first variable; and for each parameter of the plurality of parameters for the first variable, ranking the parameter for the first variable relative to the corresponding parameters for the other variables; and calculating a weirdness score for the first variable, wherein the weirdness score is indicative of how unexpected the measured value of the first variable is, and wherein the weirdness score depends on the calculated rank of each of the plurality of parameters for the first variable. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer system for determining a weirdness score for variables within a data set, said computer system comprising:
-
a memory device for storing data; and a computing device comprising a processor, said computing device coupled to said memory device, said computing device configured to; receive a selection of a first variable included within the data set, wherein the first variable is defined by a measure, a time period, and a plurality of entities; calculate a plurality of parameters for the first variable, wherein each of the plurality of parameters is calculated based at least in part on a deviation of a measured value of the first variable from a predicted value of the first variable; calculate a rank for each of the plurality of parameters for the first variable by; identifying all other variables in the data set that have the same measure as the first variable and the same time period as the first variable; identifying, for each of the other variables, a plurality of parameters that correspond to the plurality of parameters for the first variable; and for each parameter of the plurality of parameters for the first variable, ranking the parameter for the first variable relative to the corresponding parameters for the other variables; and calculate a weirdness score for the first variable, wherein the weirdness score is indicative of how unexpected the measured value of the first variable is, and wherein the weirdness score depends on the calculated rank of each of the plurality of parameters for the first variable. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program embodied on a non-transitory computer readable medium for determining a weirdness score for variables within a data set, said program comprises at least one code segment executable by a computer to instruct the computer to:
-
receive a selection of a first variable included within the data set, wherein the first variable is defined by a measure, a time period, and a plurality of entities; calculate a plurality of parameters for the first variable, wherein each of the plurality of parameters is calculated based at least in part on a deviation of a measured value of the first variable from a predicted value of the first variable; calculate a rank for each of the plurality of parameters for the first variable by; identifying all other variables in the data set that have the same measure as the first variable and the same time period as the first variable; identifying, for each of the other variables, a plurality of parameters that correspond to the plurality of parameters for the first variable; and for each parameter of the plurality of parameters for the first variable, ranking the parameter for the first variable relative to the corresponding parameters for the other variables; and calculate a weirdness score for the first variable, wherein the weirdness score is indicative of how unexpected the measured value of the first variable is, and wherein the weirdness score depends on the calculated rank of each of the plurality of parameters for the first variable. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A network-based system for determining a weirdness score for variables within a data set, said system comprising:
-
a client computer system; a database; and a server system coupled to said client computer system and said database, said server system configured to; receive from said client computer system a selection of a first variable included within the data set, wherein the first variable is defined by a measure, a time period, and a plurality of entities; calculate a plurality of parameters for the first variable, wherein each of the plurality of parameters is calculated based at least in part on a deviation of a measured value of the first variable from a predicted value of the first variable; calculate a rank for each of the plurality of parameters for the first variable by; identifying all other variables in the data set that have the same measure as the first variable and the same time period as the first variable; identifying, for each of the other variables, a plurality of parameters that correspond to the plurality of parameters for the first variable; and for each parameter of the plurality of parameters for the first variable, ranking the parameter for the first variable relative to the corresponding parameters for the other variables; and calculate a weirdness score for the first variable, wherein the weirdness score is indicative of how unexpected the measured value of the first variable is, and wherein the weirdness score depends on the calculated rank of each of the plurality of parameters for the first variable.
-
Specification