Multi-pass data organization and automatic naming
First Claim
1. A computer-implemented method comprising:
- receiving a query from a client machine;
retrieving a data set based on the query.the data set including a plurality of data units;
a data unit of the plurality of data units being representative of an item,the data unit includes a value of an attribute of the item represented by the data unit;
performing a first operation that organizes the data set into a first plurality of clusters;
performing a second operation that partitions the first plurality of clusters into a second plurality of clusters and a third plurality of clusters, the second plurality of clusters being characterized by an attribute common to each cluster of the second plurality of clusters,a cluster of the second plurality of clusters including the data unit,the attribute common to each cluster of the second plurality of clusters being the attribute of the item,the third plurality of clusters being characterized by absence of the attribute from each cluster of the third plurality of clusters,the second operation being performed using a processor of a server machine communicatively coupled to the client machine and to a database; and
storing the second plurality of clusters in the database.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and a system to organize a data set into groups of data subsets in multiple passes using different parameters and to automatically name the groups is disclosed. For example, a data set is retrieved in accordance with a search query submitted by a user. The data set is organized into clusters based on a statistic(s) of the data set. The data set is then organized into groups of data subsets based on an attribute(s) indicated by the data set. Each of the groups are automatically named based on a property shared by data units of the group. The name(s) of a group may be mined from the data units of the group, retrieved from a structure that maps to attribute values indicated by the data units of the group, etc.
32 Citations
17 Claims
-
1. A computer-implemented method comprising:
receiving a query from a client machine; retrieving a data set based on the query. the data set including a plurality of data units; a data unit of the plurality of data units being representative of an item, the data unit includes a value of an attribute of the item represented by the data unit; performing a first operation that organizes the data set into a first plurality of clusters; performing a second operation that partitions the first plurality of clusters into a second plurality of clusters and a third plurality of clusters, the second plurality of clusters being characterized by an attribute common to each cluster of the second plurality of clusters, a cluster of the second plurality of clusters including the data unit, the attribute common to each cluster of the second plurality of clusters being the attribute of the item, the third plurality of clusters being characterized by absence of the attribute from each cluster of the third plurality of clusters, the second operation being performed using a processor of a server machine communicatively coupled to the client machine and to a database; and storing the second plurality of clusters in the database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
12. A system comprising:
-
a memory to host a data set including plurality of data units; a navigation module configured to retrieve a plurality of data units responsive to a query received from a client machine; a data unit of the plurality of data units being representative of an item, the data unit including a value of an attribute of the item represented by the data unit; a hardware-implemented organizing module communicatively coupled to the navigation module, the organizing module being configured to; perform a first operation that organizes the data set into a first plurality of clusters; and perform a second operation that partitions the first plurality of clusters into a second plurality of clusters and a third plurality of clusters, the second plurality clusters being characterized by an attribute common to each cluster of the second plurality of clusters, a cluster of the second plurality of clusters includes the data unit; the attribute common to each cluster of the second plurality of clusters is the attribute of the item, the third plurality of clusters being characterized by absence of the attribute from each cluster of the third plurality of clusters; and a storage module communicatively coupled to the organizing module, the storage module being configured to store the second plurality of clusters in a database. - View Dependent Claims (13, 14, 15)
-
-
16. A machine-readable storage medium comprising instructions that, when executed by one or more processors of a machine, cause the machine to perform a method comprising:
-
receiving a query from a client machine; retrieving a data set based on the query; the data set including a plurality of data units; a data unit of the plurality of data units being representative of an item, the data unit includes a value of an attribute of the item represented by the data unit; performing a first operation that organizes the data set into a first plurality of clusters; performing a second operation that partitions the first plurality of clusters into a second plurality of clusters and a third plurality of clusters, the second plurality of clusters being characterized by an attribute common to each cluster of the second plurality of clusters, a cluster of the second plurality of clusters includes the data unit; the attribute common to each cluster of the second plurality of clusters is the attribute of the item, the third plurality clusters being characterized by absence of the attribute from each cluster of the third plurality clusters; and storing the second plurality of clusters in a database. - View Dependent Claims (17)
-
Specification