Multi-pass data organization and automatic naming
First Claim
1. A method comprising:
- identifying a first cluster of data items among a plurality of data items in response to a query,each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute;
subdividing the first cluster of data items into a second cluster of data items and a third cluster of data items,the subdividing of the first cluster being performed by a processor of a machine and based on a common value of the attribute,the common value being present in each data item within the second cluster and absent from each data item within the third cluster;
storing the second cluster of data items as corresponding to the common value of the attribute,each data item within the second cluster representing one of a plurality of items; and
naming the second cluster based on a property shared by a majority of the second cluster.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and a system to organize a data set into groups of data subsets in multiple passes using different parameters and to automatically name the groups is disclosed. For example, a data set is retrieved in accordance with a search query submitted by a user. The data set is organized into clusters based on a statistic(s) of the data set. The data set is then organized into groups of data subsets based on an attribute(s) indicated by the data set. Each of the groups are automatically named based on a property shared by data units of the group. The name(s) of a group may be mined from the data units of the group, retrieved from a structure that maps to attribute values indicated by the data units of the group, etc.
-
Citations
19 Claims
-
1. A method comprising:
-
identifying a first cluster of data items among a plurality of data items in response to a query, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; subdividing the first cluster of data items into a second cluster of data items and a third cluster of data items, the subdividing of the first cluster being performed by a processor of a machine and based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; storing the second cluster of data items as corresponding to the common value of the attribute, each data item within the second cluster representing one of a plurality of items; and naming the second cluster based on a property shared by a majority of the second cluster. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system comprising:
-
a navigation module configured to access a plurality of data items, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; and a processor configured by an organizing module communicatively coupled to the navigation module, the organizing module configured to; identify a first cluster of data items among the plurality of data items in response to a query; subdivide the first cluster of data items into a second cluster of data items and a third cluster of data items, the subdividing of the first cluster being based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; store the second cluster of data items as corresponding to the common value of the attribute, each data item within the second cluster representing one of a plurality of items; and name the second cluster based on a property shared by a majority of the second cluster. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A non-transitory machine-readable storage medium comprising instructions that, when executed by one or more processors of a machine, cause the machine to perform operations comprising:
-
identifying a first cluster of data items among a plurality of data items in response to a query, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; subdividing the first cluster of data items into a second cluster of data items and a third cluster of data items, the subdividing of the first cluster being based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; storing the second cluster of data items as corresponding to the common value of the attribute, each data item within the second cluster representing one of a plurality of items; and naming the second cluster based on a property shared by a majority of the second cluster.
-
-
19. A system comprising:
-
means for accessing a plurality of data items, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; and means for; identifying a first cluster of data items among the plurality of data items in response to a query; subdividing the first cluster of data items into a second cluster of data items and a third cluster of data items, the subdividing of the first cluster being based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; storing the second cluster of data items as corresponding to the common value of the attribute, each data item within the second cluster representing one of a plurality of items; and naming the second cluster based on a property shared by a majority of the second cluster.
-
Specification