Attribute value information for a data extent
First Claim
1. A computer-implemented method for providing attribute value information for a data extent comprising a set of data entries, each data entry comprising an attribute string value of at least a first attribute, each attribute string value comprising a sequence of symbols, the method comprising:
- determining for the first attribute at least one reference string value of a data-extent-specific reference point based on symbol frequencies at each sequence position of the attribute string values of the first attribute in a subset of the set of data entries, the resulting reference string value comprising a sequence of symbols;
determining for each of the attribute string values of the first attribute in the set of data entries an attribute-string-value-specific minimum distance for any reference string value of the data-extent-specific reference point resulting in a set of attribute-string-value-specific minimum distances for the set of data entries;
storing for the data extent the minimum distance and the maximum distance of the set of attribute-string-value-specific minimum distances as attribute value information for further use with query processing.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to a method, computer program product and computer system for providing attribute value information for a data extent having a set of data entries. The method includes: determining a reference string value of a data-extent-specific reference point based on symbol frequencies at each sequence position of attribute string values in a subset of the set of data entries; calculating a distance between each of the attribute string values in the subset and the reference string value of the data-extent-specific reference point resulting in a set of distances; determining for each of the attribute string values an attribute-string-value-specific minimum distance for any reference string value of the data-extent-specific reference point resulting in a set of attribute-string-value-specific minimum distances for the set of data entries; storing for the data extent the minimum distance and the maximum distance of the set of attribute-string-value-specific minimum distances as attribute value information.
53 Citations
18 Claims
-
1. A computer-implemented method for providing attribute value information for a data extent comprising a set of data entries, each data entry comprising an attribute string value of at least a first attribute, each attribute string value comprising a sequence of symbols, the method comprising:
-
determining for the first attribute at least one reference string value of a data-extent-specific reference point based on symbol frequencies at each sequence position of the attribute string values of the first attribute in a subset of the set of data entries, the resulting reference string value comprising a sequence of symbols; determining for each of the attribute string values of the first attribute in the set of data entries an attribute-string-value-specific minimum distance for any reference string value of the data-extent-specific reference point resulting in a set of attribute-string-value-specific minimum distances for the set of data entries; storing for the data extent the minimum distance and the maximum distance of the set of attribute-string-value-specific minimum distances as attribute value information for further use with query processing. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
Specification