Determining whether a variable is numeric or non-numeric
First Claim
Patent Images
1. A computer-implemented method comprising:
- inputting a variable having a plurality of values, each value having a count;
determining whether the variable is one of numeric and non-numeric by assessing closeness of counts for adjacent values of the variable; and
, outputting whether the variable is the one of numeric and non-numeric.
2 Assignments
0 Petitions
Accused Products
Abstract
Determination as to whether a variable is numeric or non-numeric. In one embodiment, a variable is input having a plurality of values, where each value has a count. The variable is determined to be numeric or non-numeric by assessing closeness of counts for adjacent values of the variable. Whether the variable is numeric or non-numeric is then output.
85 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
inputting a variable having a plurality of values, each value having a count;
determining whether the variable is one of numeric and non-numeric by assessing closeness of counts for adjacent values of the variable; and
,outputting whether the variable is the one of numeric and non-numeric. - View Dependent Claims (2, 3, 4, 5, 6)
determining whether any value of the variable is non-numeric; and
,upon determining that any value of the variable is non-numeric, determining that the variable is non-numeric.
-
-
3. The method of claim 1, wherein determining whether the variable is one of numeric and non-numeric comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is greater than a predetermined threshold;
upon determining that the number of the plurality of values of the variable is greater than the predetermined threshold, determining that the variable is numeric.
-
-
4. The method of claim 1, wherein determining whether the variable is one of numeric and non-numeric comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is less than a predetermined threshold;
upon determining that the number of the plurality of values of the variable is less than the predetermined threshold, determining that the variable is non-numeric.
-
-
5. The method of claim 1, wherein determining whether the variable is one of numeric and non-numeric comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is greater than a first predetermined threshold and less than a second predetermined threshold;
upon determining that the number of the plurality of values of the variable is greater than the first predetermined threshold and less than the second predetermined threshold, determining whether the variable is the one of non-numeric and numeric by performing a statistical analysis of a histogram of the plurality of values.
-
-
6. The method of claim 5, wherein performing a statistical analysis comprises:
-
calculating a first sum of the counts of the values of the variable;
permutating randomly the counts of the values of the variable;
calculating a second sum of the counts of the values of the variable, as have been permutated randomly;
determining a mean and a standard deviation of the second sum; and
,determining whether the variable is the one of non-numeric and numeric based on the mean and the standard deviation of the second sum.
-
-
7. A machine-readable medium having instructions stored thereon for execution by a processor to perform a method comprising:
-
inputting a variable having a plurality of values, each value having a count;
determining whether the variable is one of numeric and non-numeric by assessing closeness of counts for adjacent values of the variable; and
,outputting whether the variable is the one of numeric and non-numeric. - View Dependent Claims (8, 9, 10, 11, 12)
determining whether any value of the variable has a non-numeric name; and
,upon determining that any value of the variable has a non-numeric name, determining that the variable is non-numeric.
-
-
9. The medium of claim 7, wherein determining whether the variable is one of numeric and non-numeric comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is greater than a predetermined threshold;
upon determining that the number of the plurality of values of the variable is greater than the predetermined threshold, determining that the variable is numeric.
-
-
10. The medium of claim 7, wherein determining whether the variable is one of numeric and non-numeric comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is less than a predetermined threshold;
upon determining that the number of the plurality of values of the variable is less than the predetermined threshold, determining that the variable is non-numeric.
-
-
11. The medium of claim 7, wherein determining whether the variable is one of numeric and non-numeric comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is greater than a first predetermined threshold and less than a second predetermined threshold;
upon determining that the number of the plurality of values of the variable is greater than the first predetermined threshold and less than the second predetermined threshold, determining whether the variable is the one of non-numeric and numeric by performing a statistical analysis of a histogram of the plurality of values.
-
-
12. The medium of claim 11, wherein performing a statistical analysis comprises:
-
calculating a first sum of the counts of the values of the variable;
permutating randomly the counts of the values of the variable;
calculating a second sum of the counts of the values of the variable, as have been permutated randomly;
determining a mean and a standard deviation of the second sum; and
,determining whether the variable is the one of non-numeric and numeric based on the mean and the standard deviation of the second sum.
-
-
13. A computerized system comprising:
-
a variable having a plurality of values, each value having a count;
a flag indicating whether the variable is one of numeric and non-numeric; and
,a determiner to set the flag based on the plurality of values of the variable, by assessing closeness of counts for adjacent values of the variable. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
determining whether any value of the variable has a non-numeric name; and
,upon determining that any value of the variable has a non-numeric name, determining that the variable is non-numeric.
-
-
15. The system of claim 14, wherein the method performed by the determiner further comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is greater than a predetermined threshold;
upon determining that the number of the plurality of values of the variable is greater than the predetermined threshold, determining that the variable is numeric.
-
-
16. The system of claim 14, wherein the method performed by the determiner further comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is less than a predetermined threshold;
upon determining that the number of the plurality of values of the variable is less than the predetermined threshold, determining that the variable is non-numeric.
-
-
17. The system of claim 14, wherein the method performed by the determiner further comprises:
-
determining whether no value of the variable has a non-numeric name;
upon determining that no value of the variable has a non-numeric name, determining whether a number of the plurality of values of the variable is greater than a first predetermined threshold and less than a second predetermined threshold;
upon determining that the number of the plurality of values of the variable is greater than the first predetermined threshold and less than the second predetermined threshold, determining whether the variable is the one of non-numeric and numeric by performing a statistical analysis of a histogram of the plurality of values.
-
-
18. The system of claim 17, wherein performing a statistical analysis comprises:
-
calculating a first sum of the counts of the values of the variable;
permutating randomly the counts of the values of the variable;
calculating a second sum of the counts of the values of the variable, as have been permutated randomly;
determining a mean and a standard deviation of the second sum; and
,determining whether the variable is the one of non-numeric and numeric based on the mean and the standard deviation of the second sum.
-
-
19. The system of claim 13, where each of the variable and the flag is stored as data on a computer-readable medium.
-
20. The system of claim 13, wherein the determiner comprises a computer program executable by a processor from a computer-readable medium.
Specification