SEARCH CLUSTERING
First Claim
Patent Images
1. A method comprising:
- retrieving item data from a plurality of listings, the item data filtered from noise data;
constructing at least one base cluster having at least one document with common item data stored in a suffix ordering;
compacting the at least one base cluster to create a compacted cluster representation having a reduced duplicate suffix ordering amongst the clusters; and
merging the compact cluster representation to generate a merged cluster, the merging based upon a first overlap value applied to the at least one document with common item data.
2 Assignments
0 Petitions
Accused Products
Abstract
In one example embodiment, a method is illustrated as including retrieving item data from a plurality of listings, the item data filtered from noise data, constructing at least one base cluster having at least one document with common item data stored in a suffix ordering, compacting the at least one base cluster to create a compacted cluster representation having a reduced duplicate suffix ordering amongst the clusters, and merging the compact cluster representation to generate a merged cluster, the merging based upon a first overlap value applied to the at least one document with common item data.
-
Citations
24 Claims
-
1. A method comprising:
-
retrieving item data from a plurality of listings, the item data filtered from noise data; constructing at least one base cluster having at least one document with common item data stored in a suffix ordering; compacting the at least one base cluster to create a compacted cluster representation having a reduced duplicate suffix ordering amongst the clusters; and merging the compact cluster representation to generate a merged cluster, the merging based upon a first overlap value applied to the at least one document with common item data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer system comprising:
-
a retrieving engine to retrieve item data from a plurality of listings, the item data filtered from noise data; a cluster generator to construct at least one base cluster having at least one document with common item data stored in a suffix ordering; a compacting engine to compact the at least one base cluster to create a compacted cluster representation having a reduced duplicate suffix ordering amongst the clusters; and a first merging engine to merge the compact cluster representation to generate a merged cluster, the merging based upon a first overlap value applied to the at least one document with common item data. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. An apparatus comprising:
-
means for retrieving item data from a plurality of listings, the item data filtered from noise data; means for constructing at least one base cluster having at least one document with common item data stored in a suffix ordering; means for compacting the at least one base cluster to create a compacted cluster representation having a reduced duplicate suffix ordering amongst the clusters; and means for merging the compact cluster representation to generate a merged cluster, the merging based upon a first overlap value applied to the at least one document with common item data.
-
-
24. A machine-readable medium comprising instructions, which, when implemented by one or more machines that cause the one or more machines to perform the following operations:
-
retrieving item data from a plurality of listings, the item data filtered from noise data; constructing at least one base cluster having at least one document with common item data stored in a suffix ordering; compacting the at least one base cluster to create a compacted cluster representation having a reduced duplicate suffix ordering amongst the clusters; and merging the compact cluster representation to generate a merged cluster, the merging based upon a first overlap value applied to the at least one document with common item data.
-
Specification