Apparatus and method for removing non-discriminatory indices of an indexed dataset
First Claim
1. A data analyzer for use with a pattern classifier to compress a set of indexed data, comprising a data removal module for identifying and removing portions of the set of indexed data having insufficient discriminatory power based on the ensemble statistics of the set of indexed data.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides a device and method for removing non-discriminatory indices of an indexed dataset using ensemble statistics analysis. The device may include a data removal module (320) for removing non-discriminatory indices. For example, the data removal module (320) may comprise a common characteristic removal module and/or a noise removal module. In addition, the data analyzer (300) may comprise a normalization means (310) for normalizing the indexed data. The method of the present invention comprises the steps of identifying and removing portions of the set of data having insufficient discriminatory power based on ensemble statistics of the set of indexed data. For example, the method may include the steps of identifying and removing common characteristics and/or noise portions of the set of indexed data. In addition, the method may comprise the step of normalizing the indexed data either prior to or after the step of removing portions of the set of data.
18 Citations
53 Claims
- 1. A data analyzer for use with a pattern classifier to compress a set of indexed data, comprising a data removal module for identifying and removing portions of the set of indexed data having insufficient discriminatory power based on the ensemble statistics of the set of indexed data.
-
22. A data analyzer for use with a pattern classifier to compress a set of indexed data having common characteristics and noise, comprising:
-
a. means for determining a common characteristic threshold for the indexed data set;
b. means for removing indices having ensemble statistics higher than the common characteristic threshold value to provide a retained dataset;
c. means for calculating ensemble statistics of each retained index in the retained dataset;
d. means for determining a noise threshold;
e. means for removing indices from the retained dataset having an ensemble statistic lower than a noise threshold value; and
f. means for normalizing the indexed data. - View Dependent Claims (23, 24, 25, 26)
-
-
27. A method for filtering spectral data from a set of spectra to remove common characteristics and noise, comprising:
-
identifying and removing the common characteristics from each spectrum within the set of spectra based on the ensemble statistics of the set of spectra, and identifying and removing the noise portions of each spectrum based on ensemble statistics of the set of spectra, whereby a filtered spectra is provided in which the common characteristic and noise portions have been removed. - View Dependent Claims (50)
-
- 28. A method for analyzing a set of indexed data to compress the set of data, comprising the steps of identifying and removing portions of the set of data having insufficient discriminatory power based on ensemble statistics of the set of indexed data, thereby providing a set of compressed indexed data.
-
51. A method for classifying a set of indexed data which include a set of control spectra, comprising the steps of:
-
a. calculating an ensemble statistic at each index in the control spectra;
b. identifying those indices at which the ensemble statistic exceeds a first selected threshold;
c. removing the identified indices from all spectra in the set of indexed data to provide a set of compressed indexed data;
d. calculating an ensemble statistic at each index of the compressed indexed data;
e. removing all indices from each compressed spectrum that have an ensemble statistic that is lower than a second selected threshold value to provide a set of reduced indexed data;
f. extracting a feature portion of each of the reduced indexed data to provide a set of feature spectra; and
g. classifying the set of feature spectra into clusters. - View Dependent Claims (52, 53)
-
Specification