MULTI-PASS DATA ORGANIZATION AND AUTOMATIC NAMING
First Claim
1. A method comprising:
- identifying a first cluster of data items among a plurality of data items in response to a query,each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute;
subdividing the first cluster of data items into a second duster of data items and a third cluster of data items,the subdividing of the first cluster being performed by a processor ofa machine and based on a common value of the attribute,the common value being present in each data item within the second cluster and absent from each data item within the third cluster; and
storing the second cluster of data items as corresponding to the common attribute.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and a system to organize a data set into groups of data subsets in multiple passes using different parameters and to automatically name the groups is disclosed. For example, a data set is retrieved in accordance with a search query submitted by a user. The data set is organized into clusters based on a statistic(s) of the data set. The data set is then organized into groups of data subsets based on an attribute(s) indicated by the data set. Each of the groups are automatically named based on a property shared by data units of the group. The name(s) of a group may be mined from the data units of the group, retrieved from a structure that maps to attribute values indicated by the data units of the group, etc.
-
Citations
21 Claims
-
1. A method comprising:
-
identifying a first cluster of data items among a plurality of data items in response to a query, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; subdividing the first cluster of data items into a second duster of data items and a third cluster of data items, the subdividing of the first cluster being performed by a processor of a machine and based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; and storing the second cluster of data items as corresponding to the common attribute. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a navigation module configured to access a plurality of data items, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; and a processor configured by an organizing module communicatively coupled to the navigation module, the organizing module configured to; identify a first cluster of data items among the plurality of data items in response to a query; subdivide the first cluster of data items into a second cluster of data items and a third cluster of data items, the subdividing of the first cluster being based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; and store the second cluster of data items as corresponding to the common attribute. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A non-transitory machine-readable storage medium comprising instructions that, when executed by one or more processors of a machine, cause the machine to perform operations comprising:
-
identifying a first cluster of data items among a plurality of data items in response to a query, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; subdividing the first cluster of data items into a second cluster of data items and a third cluster of data items, the subdividing of the first cluster being based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; and storing the second cluster of data items as corresponding to the common attribute.
-
-
21. A system comprising:
-
means for accessing a plurality of data items, each of the plurality of data items including an attribute able to have one of a plurality of values of the attribute; and means for; identifying a first cluster of data items among the plurality of data items in response to a query; subdividing the first cluster of data items into a second cluster of data items and a third cluster of data items, the subdividing of the first cluster being based on a common value of the attribute, the common value being present in each data item within the second cluster and absent from each data item within the third cluster; and storing the second cluster of data items as corresponding to the common attribute.
-
Specification