Identifying related objects using quantum clustering
First Claim
Patent Images
1. A method of clustering objects, implemented on a computing device, the method comprising:
- constructing a feature-object database of multiple objects, wherein the constructing comprises, for each of the multiple objects,obtaining the object;
extracting features from the object to obtain extracted features;
calculating feature values for the extracted features, wherein at least some of the features values are continuous;
selecting extracted features to obtain selected features and corresponding selected feature values;
quantizing the selected feature values to obtain quantized selected feature values, wherein quantizing comprises identifying the selected feature values as discrete or continuous and transforming the identified continuous feature values into discrete feature values; and
building the feature-object database having keys comprising quantized selected feature values;
building a connected objects database;
building a directed graph of connected objects from the connected objects database, wherein the connected objects database is built from the feature-object database;
identifying clusters of connected objects; and
evaluating the clusters of identified objects to designate groups of related objects.
11 Assignments
0 Petitions
Accused Products
Abstract
Techniques for grouping related objects such as documents and files using quantum clustering are disclosed. A method may include constructing a feature-object database of multiple objects. The feature-object database may have quantized selected features as keys. A connected objects database maybe built. Clusters of connected objects may be identified in the connected objects database. The clusters of identified objects may be evaluated to determine groups of related objects. The method may be implemented on a computing device.
57 Citations
22 Claims
-
1. A method of clustering objects, implemented on a computing device, the method comprising:
-
constructing a feature-object database of multiple objects, wherein the constructing comprises, for each of the multiple objects, obtaining the object; extracting features from the object to obtain extracted features; calculating feature values for the extracted features, wherein at least some of the features values are continuous; selecting extracted features to obtain selected features and corresponding selected feature values; quantizing the selected feature values to obtain quantized selected feature values, wherein quantizing comprises identifying the selected feature values as discrete or continuous and transforming the identified continuous feature values into discrete feature values; and building the feature-object database having keys comprising quantized selected feature values; building a connected objects database; building a directed graph of connected objects from the connected objects database, wherein the connected objects database is built from the feature-object database; identifying clusters of connected objects; and evaluating the clusters of identified objects to designate groups of related objects. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A storage medium having instructions stored thereon which when executed by a processor cause the processor to perform actions comprising:
-
constructing a feature-object database of multiple objects, wherein the constructing comprises, for each of the multiple objects, obtaining the object; extracting features from the object to obtain extracted features; calculating feature values for the extracted features, wherein at least some of the features values are continuous; selecting extracted features to obtain selected features and corresponding selected feature values; quantizing the selected feature values to obtain quantized selected feature values, wherein quantizing comprises identifying the selected feature values as discrete or continuous and transforming the identified continuous selected feature values into discrete selected feature values; and building the feature-object database having keys comprising quantized selected feature values; building a connected objects database; building a directed graph of connected objects from the connected objects database, wherein the connected objects database is built from the feature-object database; identifying clusters of connected objects; and evaluating the clusters of identified objects to designate groups of related objects. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computing device comprising:
- a processor, a memory, and a storage device, wherein the storage device includes a storage medium having instructions thereon which when executed cause the computing device to perform operations comprising;
constructing a feature-object database of multiple objects, wherein the constructing comprises, for each of the multiple objects, obtaining the object; extracting features from the object; calculating feature values for the extracted features, wherein at least some of the features values are continuous; selecting features to obtain selected features and corresponding selected feature values; quantizing the selected feature values to obtain quantized selected feature values, wherein quantizing comprises identifying the selected feature values as discrete or continuous and transforming the identified continuous selected feature values into discrete selected feature values; and building the feature-object database having keys comprising quantized selected feature values; building a connected objects database; building a directed graph of connected objects from the connected objects database, wherein the connected objects database is built from the feature-object database; identifying clusters of connected objects; and evaluating the clusters of identified objects to determine groups of related objects. - View Dependent Claims (12, 13, 14, 15)
- a processor, a memory, and a storage device, wherein the storage device includes a storage medium having instructions thereon which when executed cause the computing device to perform operations comprising;
-
16. A method, implemented on a computing device, of identifying related documents by clustering a plurality of documents, the method comprising:
-
constructing a feature-document database of a plurality of documents, wherein the constructing comprises, for each of the documents, obtaining one of the plurality of documents as a current document; extracting words from the current document to obtain extracted words; forming features from the extracted words; calculating feature values for the features, wherein at least some of the features values are continuous; selecting some of the features to obtain selected features and corresponding feature values; quantizing the selected feature value to obtain quantized selected feature values, wherein quantizing comprises identifying the selected feature values as discrete or continuous and transforming the identified continuous selected feature values into discrete selected feature values; and building the feature-document database having keys comprising quantized selected features; building a connected documents database; building a directed graph of connected objects from the connected objects database, wherein the connected objects database is built from the feature-object database; identifying clusters of connected documents; and evaluating the clusters of identified documents to determine groups of related documents. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification