Clustering
First Claim
1. A method comprising receiving a set of data containing values associated with respective data points, the values associated with each of the data points being characterized by a distribution, expressing the values for each of the data points in a form that includes information about the distribution of the values for each of the data points, and using the distribution information in clustering the set of data with at least one other set of data containing values associated with data points.
2 Assignments
0 Petitions
Accused Products
Abstract
A set of data is received containing values associated with respective data points, the values associated with each of the data points being characterized by a distribution. The values for each of the data points are expressed in a form that includes information about a distribution of the values for each of the data points. The distribution information is used in clustering the set of data with at least one other set of data containing values associated with data points.
64 Citations
17 Claims
-
1. A method comprising
receiving a set of data containing values associated with respective data points, the values associated with each of the data points being characterized by a distribution, expressing the values for each of the data points in a form that includes information about the distribution of the values for each of the data points, and using the distribution information in clustering the set of data with at least one other set of data containing values associated with data points.
-
15. A machine-accessible medium that when accessed results in a machine effecting actions comprising:
-
receiving a set of data containing values associated with respective data points, the values associated with each of the data points being characterized by a distribution, expressing the values for each of the data points in a form that includes information about a distribution of the values for each of the data points, and using the distribution information in clustering the set of data with at least one other set of data containing values associated with data points.
-
-
16. A method comprising
receiving sets of data, each of the sets containing values associated with respect data points, the values associated with each of the data points being characterized by a distribution, evaluating a distance function that characterizes the similarity or dissimilarity of at least two of the sets of data, the distance function including a factor based on the distributions of the values in the sets, and using the evaluation of the distance function as a basis for clustering of the sets of data.
-
17. A method comprising
receiving data that represents seasonality time-series for each of a set of retail items, the data also representing error information associated with data values for each of a series of time points for each of the items, and forming composite seasonality time-series based on respective clusters of the retail item seasonality time-series, the composites formed based in part on the error information.
Specification