Handling data sets
First Claim
1. A computer implemented method comprising:
- determining a first characteristic for a first data set of a plurality of data elements, wherein the first characteristic includes a first set of metric values indicating properties for the plurality of data elements within the first data set;
comparing said first data set with at least one of a second data set of a plurality of data elements and a single data value by at least one of the following;
determining a second characteristic for the second data set, wherein the second characteristic includes a second set of metric values indicating properties for the plurality of data elements within the second data set, and calculating a similarity of said first data set with said second data set based on a comparison of said first and second characteristics; and
determining a third characteristic for the single data value, wherein the third characteristic includes a third set of metric values indicating properties for the single data value, and calculating a similarity of said first data set with said single data value based on a comparison of said first characteristic and said third characteristic.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, system and computer program product provides a first characteristic associated with a first data set and a single data value, and a second characteristic associated with a second data set; and calculates at least one of: 1) the similarity of the first data set with the second data set based on the first and second characteristics, 2) the similarity of the first data set with the single data value based on the first characteristic and the single data value, 3) confidence indicating how well the first characteristic reflects properties of the first data set based on the first characteristic, and 4) confidence indicating how well the similarity of the first data set with the single data value reflects properties of the single data value based on the first characteristic and the single data value.
101 Citations
22 Claims
-
1. A computer implemented method comprising:
-
determining a first characteristic for a first data set of a plurality of data elements, wherein the first characteristic includes a first set of metric values indicating properties for the plurality of data elements within the first data set; comparing said first data set with at least one of a second data set of a plurality of data elements and a single data value by at least one of the following; determining a second characteristic for the second data set, wherein the second characteristic includes a second set of metric values indicating properties for the plurality of data elements within the second data set, and calculating a similarity of said first data set with said second data set based on a comparison of said first and second characteristics; and determining a third characteristic for the single data value, wherein the third characteristic includes a third set of metric values indicating properties for the single data value, and calculating a similarity of said first data set with said single data value based on a comparison of said first characteristic and said third characteristic. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A tangible computer program product comprising a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code configured to:
-
determine a first characteristic for a first data set of a plurality of data elements, wherein the first characteristic includes a first set of metric values indicating properties for the plurality of data elements within the first data set; compare said first data set with at least one of a second data set of a plurality of data elements and a single data value by at least one of the following; determining a second characteristic for the second data set, wherein the second characteristic includes a second set of metric values indicating properties for the plurality of data elements within the second data set, and calculating a similarity of said first data set with said second data set based on a comparison of said first and second characteristics; and determining a third characteristic for the single data value, wherein the third characteristic includes a third set of metric values indicating properties for the single data value, and calculating a similarity of said first data set with said single data value based on a comparison of said first characteristic and said third characteristic. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A system comprising a computer system including at least one processor configured to:
-
determine a first characteristic for a first data set of a plurality of data elements, wherein the first characteristic includes a first set of metric values indicating properties for the plurality of data elements within the first data set; compare said first data set with at least one of a second data set of a plurality of data elements and a single data value by at least one of the following; determining a second characteristic for the second data set, wherein the second characteristic includes a second set of metric values indicating properties for the plurality of data elements within the second data set, and calculating a similarity of said first data set with said second data set based on a comparison of said first and second characteristics; and determining a third characteristic for the single data value, wherein the third characteristic includes a third set of metric values indicating properties for the single data value, and calculating a similarity of said first data set with said single data value based on a comparison of said first characteristic and said third characteristic. - View Dependent Claims (19, 20, 21, 22)
-
Specification