Efficient imagery exploitation employing wavelet-based feature indices
First Claim
1. An efficient method of data mining to facilitate ready identification of desired features within imagery data dispersed among multiple spectral bands, comprising:
- (a) selecting a wavelet type for use in said efficient method of data mining;
(b) providing means for manipulating said data, said means at least further capable of implementing the algorithm,
whereGDFI2n(i, t) is a wavelet-based generalized difference feature index,i refers to wavelength band i of a data collector,t is a specified lag between wavelength bands,h0, h1 . . . h2n−
1 are high frequency coefficientsg0, g1 . . . g2n−
1 are low frequency coefficients, wherein, a number of said high and low frequency coefficients is determined upon establishing an order of a wavelet of said selected wavelet type,n is a specified number of vanishing moments of said selected wavelet type, andzi, zi+t. . . zi +(2n−
1)t are data necessary to yield at least one said wavelet-based generalized difference feature index from a spectral signature of an image;
(c) establishing a set of wavelet-based generalized difference feature indices that may be generated later in said efficient method of data mining;
(d) initiating at least one said means for manipulating data by setting a maximum wavelet order limit, selecting wavelength bands and setting K=0 and setting T=1, whereK is a specified wavelet array index, andT is an incremented specified lag, defined as a specified number of said wavelength bands skipped between ones of said selected wavelength bands;
(e) setting a lag limit defined as
where m is a specified number of wavelength bands in a specified dataset;
(f) reading at least one said data set comprising said wavelength in said specified dataset bands into said means for manipulating;
(g) identifying and discarding said specified wavelength bands having compromised data;
(h) incrementing said K;
(i) incrementing said T by 1;
(j) computing a reduced set of difference-sum wavelength band ratios;
(k) generating at least one said established wavelet-based generalized difference feature index;
(l) generating a cube of each said established wavelet-based generalized difference feature index;
(m) selecting at least one of said established wavelet-based generalized difference feature indices;
(n) thresholding said selected pre-specified established wavelet-based generalized difference feature indices, wherein said thresholding results in only said selected pre-specified established wavelet-based generalized difference feature indices being used henceforth;
(o) saving said thresholded selected pre-specified established wavelet-based generalized difference feature indices;
(p) determining if said lag limit has been met;
(q) if said lag limit has been met, determining if said maximum wavelet order limit has been met;
(r) if said lag limit has not been met, performing another iteration of steps (h) through (r) until said lag limit has been met;
(s) if said maximum wavelet order limit has been met, stopping; and
(t) if said maximum wavelet order limit has not been met, setting said T=1 and performing another iteration of steps (h) through (t) until said maximum wavelet order limit has been met,wherein, if both said lag limit and said maximum wavelet order limit have been met, said efficient method of data mining is ended, resulting in an efficient identification of said desired features in said imagery data.
1 Assignment
0 Petitions
Accused Products
Abstract
A wavelet-based band difference-sum ratio method reduces the computation cost of classification and feature extraction (identification) tasks. A Generalized Difference Feature Index (GDFI), computed using wavelets such as Daubechies wavelets, is employed in a method to automatically generate a large sequence of generalized band ratio images. In select embodiments of the present invention, judicious data mining of the large set of GDFI bands produces a small subset of GDFI bands suitable to identify specific Terrain Category/Classification (TERCAT) features. Other wavelets, such as Vaidyanathan, Coiflet, Beylkin, and Symmlet and the like may be employed in select embodiments. The classification and feature extraction (identification) performance of the band ratio method of the present invention is comparable to that obtained with the same or similar data sets using much more sophisticated methods such as discriminants, neural net classification, endmember Gibbs-based partitioning, and genetic algorithms.
20 Citations
11 Claims
-
1. An efficient method of data mining to facilitate ready identification of desired features within imagery data dispersed among multiple spectral bands, comprising:
-
(a) selecting a wavelet type for use in said efficient method of data mining; (b) providing means for manipulating said data, said means at least further capable of implementing the algorithm,
whereGDFI2n(i, t) is a wavelet-based generalized difference feature index, i refers to wavelength band i of a data collector, t is a specified lag between wavelength bands, h0, h1 . . . h2n−
1 are high frequency coefficientsg0, g1 . . . g2n−
1 are low frequency coefficients,wherein, a number of said high and low frequency coefficients is determined upon establishing an order of a wavelet of said selected wavelet type, n is a specified number of vanishing moments of said selected wavelet type, and zi, zi+t. . . zi +(2n−
1)t are data necessary to yield at least one said wavelet-based generalized difference feature index from a spectral signature of an image;(c) establishing a set of wavelet-based generalized difference feature indices that may be generated later in said efficient method of data mining; (d) initiating at least one said means for manipulating data by setting a maximum wavelet order limit, selecting wavelength bands and setting K=0 and setting T=1, where K is a specified wavelet array index, and T is an incremented specified lag, defined as a specified number of said wavelength bands skipped between ones of said selected wavelength bands; (e) setting a lag limit defined as
where m is a specified number of wavelength bands in a specified dataset;(f) reading at least one said data set comprising said wavelength in said specified dataset bands into said means for manipulating; (g) identifying and discarding said specified wavelength bands having compromised data; (h) incrementing said K; (i) incrementing said T by 1; (j) computing a reduced set of difference-sum wavelength band ratios; (k) generating at least one said established wavelet-based generalized difference feature index; (l) generating a cube of each said established wavelet-based generalized difference feature index; (m) selecting at least one of said established wavelet-based generalized difference feature indices; (n) thresholding said selected pre-specified established wavelet-based generalized difference feature indices, wherein said thresholding results in only said selected pre-specified established wavelet-based generalized difference feature indices being used henceforth; (o) saving said thresholded selected pre-specified established wavelet-based generalized difference feature indices; (p) determining if said lag limit has been met; (q) if said lag limit has been met, determining if said maximum wavelet order limit has been met; (r) if said lag limit has not been met, performing another iteration of steps (h) through (r) until said lag limit has been met; (s) if said maximum wavelet order limit has been met, stopping; and (t) if said maximum wavelet order limit has not been met, setting said T=1 and performing another iteration of steps (h) through (t) until said maximum wavelet order limit has been met, wherein, if both said lag limit and said maximum wavelet order limit have been met, said efficient method of data mining is ended, resulting in an efficient identification of said desired features in said imagery data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An efficient method of data mining to facilitate ready identification of desired features within a multi-band data set, comprising:
-
(a) selecting a wavelet type for use in said efficient method of data mining; (b) providing means for manipulating said data, said means configured to perform the algorithm,
whereGDFI2n(i, t) is a wavelet-based generalized difference feature index, i refers to band i of a data collector, t is a specified lag between bands, h0, h1. . . h2n−
1 are high frequency coefficientsg0, g1. . . g2n−
1 are low frequency coefficients,wherein, a number of said high and low frequency coefficients is determined upon establishing an order of a wavelet of said selected wavelet type, n is a specified number of vanishing moments of said selected wavelet, and zi, zi+t. . . zi+(2n−
1)t are data necessary to yield at least one said wavelet-based generalized difference feature index from a spectral signature;(c) establishing a set of wavelet-based generalized difference feature indices that may be generated later in said efficient method of data mining; (d) initiating at least one said means for manipulating data by setting a maximum wavelet order limit, selecting bands and setting K=0 and setting T=1, where K is a specified wavelet array index, and T is an incremented a specified lag, defined as a specified number of said bands skipped between ones of said selected bands; (e) setting a lag limit defined as
where m is a specified number of bands in a specified dataset;(f) reading at least one said data set comprising said bands in said specified dataset into said means for manipulating; (g) identifying and discarding said specified bands having compromised data; (h) incrementing said K; (i) incrementing said T by 1; (j) computing a reduced set of difference-sum band ratios; (k) generating at least one said established wavelet-based generalized difference feature index; (l) generating a cube of each said established wavelet-based generalized difference feature index; (m) selecting at least one of said established wavelet-based generalized difference feature indices; (n) thresholding said selected pre-specified established wavelet-based generalized difference feature indices, wherein said thresholding results in only said selected pre-specified established wavelet-based generalized difference feature indices being used henceforth; (o) saving said thresholded selected pre-specified established wavelet-based generalized difference feature indices; (p) determining if said lag limit has been met; (q) if said lag limit has been met, determining if said maximum wavelet order limit has been met; (r) if said lag limit has not been met, performing another iteration of steps (h) through (r) until said lag limit has been met; (s) if said maximum wavelet order limit has been met, stopping; and (t) if said maximum wavelet order limit has not been met, setting said T=1 and performing another iteration of steps (h) through (t) until said maximum wavelet order limit has been met, wherein, if both said lag limit and said maximum wavelet order limit have been met, said efficient method of data mining is ended, resulting in an efficient identification of said desired features in said multi-band data set.
-
-
10. An efficient method of data mining to facilitate ready categorization of a data set dispersed over multiple bands, comprising:
-
selecting at least one wavelet type for use in said method, wherein said wavelet is selected to achieve optimum computational efficiency; providing software for at least implementing an algorithm to calculate at least one wavelet-based generalized difference feature index (GDFI); specifying a set of generalized difference feature indices to be calculated using said efficient method of data mining; providing a software routine to select and process a reduced set of bands of said data set dispersed over multiple bands; iterating a sub-routine of said routine while applying a lag limit and a maximum wavelet order limit to establish a number of iterations, said sub-routine to at least; compute a reduced set of difference-sum band ratios; calculate said generalized difference feature indices; generate the cube of each said calculated generalized difference feature index; select pre-specified ones of said calculated generalized difference feature indices; threshold said selected calculated generalized difference feature indices, wherein said thresholding results in only said selected calculated generalized difference feature indices being used henceforth; and save said thresholded selected calculated generalized difference feature indices; wherein, if both said lag limit and said maximum wavelet order limit have been met, said efficient method of data mining to facilitate ready categorization of a data set dispersed over multiple bands is ended, resulting in an efficient identification of said desired features in said data set.
-
-
11. A method that samples all band ratio combinations in hyperspectral data for use with rapid combinatorial computations that integrate wavelet and wavelet-variogram techniques for improved data anomaly filtering, detection and classification of imagery, comprising:
-
selecting at least one wavelet type for use in said method, wherein said at least one wavelet type is selected to achieve optimum computational efficiency; and providing software that displays results in a form that facilitates classification and feature extraction tasks while employing a least-ordered said wavelet that enables select features to be readily identified, wherein executing said software yields band ratios that provide useful information in support of said classification and feature extraction tasks, and wherein said method yields select said imagery with specific features highlighted by employing at least one generalized difference feature index (GDFI) band ratio and multiplying said generalized difference feature index (GDFI) band ratio by constants associated with coefficients of said at least one wavelet type, said GDFI band ratio defined by; where GDFI2n(i, t) is a wavelet-based generalized difference feature index, i refers to band i of a data collector, t is a lag between bands, h0, h1. . . h2n−
1 are high frequency coefficientsg0, g1. . . g2n−
1 are low frequency coefficients,wherein, the number of said high and low frequency coefficients is determined upon establishing the order of said wavelet type, n is the number of vanishing moments of said selected wavelet type, and zi, zi+t . . . zi+(2n−
1)t are data used to yield at least one said generalized difference feature index; andwherein at least one said feature appears in a resultant display as a distinct color or shade lighter than the remainder of said imagery.
-
Specification