Fuzzy Data Operations
3 Assignments
0 Petitions
Accused Products
Abstract
A method for clustering data elements stored in a data storage system includes reading data elements from the data storage system. Clusters of data elements are formed with each data element being a member of at least one cluster. At least one data element is associated with two or more clusters. Membership of the data element belonging to respective ones of the two or more clusters is represented by a measure of ambiguity. Information is stored in the data storage system to represent the formed clusters.
16 Citations
21 Claims
-
1. (canceled)
-
2. A method for identifying one or more matches between a first data element, with a plurality of fields and with one or more values of one or more of the fields representing a key for the first data element, and each of one or more second data elements stored in a data storage system, the method including:
-
determining one or more variant matches between one or more variants of the key and one or more respective values of one or more search fields of the one or more second data elements, with a variant of the key being specified in accordance with a variant relation for the key; and corroborating the one or more variant matches based on a comparison of one or more respective values of one or more comparison fields of the one or more second data elements to one or more values of one or more comparison fields in the first data element, with a comparison field being different from a search field. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for identifying one or more matches between a first data element, with a plurality of fields and with one or more values of one or more of the fields representing a key for the first data element, and each of one or more second data elements stored in a data storage system, the system including:
-
one or more processors; and one or more machine-readable hardware storage devices storing instructions that are executable by the one or more processors to perform operations comprising; determining one or more variant matches between one or more variants of the key and one or more respective values of one or more search fields of the one or more second data elements, with a variant of the key being specified in accordance with a variant relation for the key; and corroborating the one or more variant matches based on a comparison of one or more respective values of one or more comparison fields of the one or more second data elements to one or more values of one or more comparison fields in the first data element, with a comparison field being different from a search field. - View Dependent Claims (15, 16, 17)
-
-
18. One or more machine-readable hardware storage devices for identifying one or more matches between a first data element, with a plurality of fields and with one or more values of one or more of the fields representing a key for the first data element, and each of one or more second data elements stored in a data storage system, the one or more machine-readable hardware storage devices storing instructions that are executable by one or more processors to perform operations comprising including:
-
determining one or more variant matches between one or more variants of the key and one or more respective values of one or more search fields of the one or more second data elements, with a variant of the key being specified in accordance with a variant relation for the key; and corroborating the one or more variant matches based on a comparison of one or more respective values of one or more comparison fields of the one or more second data elements to one or more values of one or more comparison fields in the first data element, with a comparison field being different from a search field. - View Dependent Claims (19, 20, 21)
-
Specification