Data search system and method using mutual subsethood measures
First Claim
1. A data search method comprising:
- receiving a query vector specifying a searching set of fuzzy attribute values for a collection of data;
calculating mutual subsethood measures between said query vector and a plurality of keytroids in a keytroid database, each keytroid in said keytroid database specifying a respective set of fuzzy attribute values for said collection of data; and
retrieving a subset of keytroids from said keytroid database, each keytroid in said subset of keytroids satisfying a threshold mutual subsethood measure.
1 Assignment
0 Petitions
Accused Products
Abstract
A non-textual data searching system according to the invention is capable of searching non-textual data at semantic levels above the fundamental symbolic level. The general approach begins by indexing the non-textual data corpus in such a way as to facilitate searching. The indexing process results in a number of “keytroids” that represent clusters of fuzzy attribute vectors, where each fuzzy attribute vector represents a data event associated with one or more non-textual data points. The actual searching process is analogous to a conventional text-based search engine: a query vector, which identifies a number of fuzzy attributes of the desired data, is processed to retrieve and rank a number of keytroids. The keytroids can be inverse-mapped to obtain data events and/or non-textual data points that satisfy the query.
193 Citations
21 Claims
-
1. A data search method comprising:
-
receiving a query vector specifying a searching set of fuzzy attribute values for a collection of data;
calculating mutual subsethood measures between said query vector and a plurality of keytroids in a keytroid database, each keytroid in said keytroid database specifying a respective set of fuzzy attribute values for said collection of data; and
retrieving a subset of keytroids from said keytroid database, each keytroid in said subset of keytroids satisfying a threshold mutual subsethood measure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A data search system comprising:
-
a query input component configured to receive a query vector specifying a searching set of fuzzy attribute values for a collection of data;
a keytroid database containing keytroids, each specifying a respective set of fuzzy attribute values for said collection of data; and
a query processing component configured to calculate mutual subsethood measures between said query vector and a plurality of keytroids in said keytroid database, and to retrieve a subset of keytroids from said keytroid database, each keytroid in said subset of keytroids satisfying a threshold mutual subsethood measure. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer program for searching non-textual data, said computer program being embodied on a computer-readable medium, said computer program having computer-executable instructions for carrying out a method comprising:
-
receiving a query vector specifying a searching set of fuzzy attribute values for a collection of data;
calculating mutual subsethood measures between said query vector and a plurality of keytroids in a keytroid database, each keytroid in said keytroid database specifying a respective set of fuzzy attribute values for said collection of data; and
retrieving a subset of keytroids from said keytroid database, each keytroid in said subset of keytroids satisfying a threshold mutual subsethood measure.
-
Specification