Automatic knowledge database generation for classifying objects and systems therefor
First Claim
1. An object classification system having a computer, a digital image data source, a machine readable pattern recognition code encoded in the computer for interpreting digital image data and for characterizing images represented by the digital image data from said digital image source as a multi-dimensional descriptor vector for classification of objects using a knowledge database, and a machine readable code for automatically generating the knowledge database, the classifier comprising:
- a) a memory which stores in computer readable form a plurality of descriptor vectors derived from a plurality of training images, each respective one of the training images including a predetermined classification code to permit separating the descriptor vectors into a plurality of class clusters;
b) a similarity matrix stored in a computer memory, said similarity matrix containing similarity values as between all of said descriptor vectors to permit identifying a first least similar descriptor vector and a most similar descriptor vector in each class cluster as compared to all other class clusters to thereby provide at least two inter-class extreme points for each class cluster, and to permit identifying a second least similar descriptor vector from a selected extreme point within a class cluster to thereby permit selecting said least similar descriptor vector as an intra-class extreme point;
c) an extreme point balancer which eliminates redundant extreme points from a class cluster.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for automatically generating a knowledge database in an object classification system having a digital image data source, and a computer, includes the steps of inputting digital image data corresponding to a plurality of training images, and characterizing the digital image data according to pre-defined variables, or descriptors, to thereby provide a plurality of descriptor vectors corresponding to the training images. Predetermined classification codes are inputted for the plurality of training images, to thereby define object class clusters comprising descriptor vector points having the same classification codes in N-dimensional Euclidean space. The descriptor vectors, or points, are reduced using a similarity matrix indicating proximity in N-dimensional Euclidean space, to select those descriptors vectors, called extreme points, which lie on the boundary surface of their respective class cluster. The non-selected points interior to the class cluster are not included in the knowledge database. The extreme points are balanced by eliminating functionally redundant extreme points from each class cluster to provide a preliminary knowledge database. Fine tuning of the preliminary knowledge database is performed by either deleting extreme points that tend to reduce the accuracy of the database, or adding new rules which enhance the accuracy of the data base. Alternately, the fine tuning step may be skipped.
-
Citations
16 Claims
-
1. An object classification system having a computer, a digital image data source, a machine readable pattern recognition code encoded in the computer for interpreting digital image data and for characterizing images represented by the digital image data from said digital image source as a multi-dimensional descriptor vector for classification of objects using a knowledge database, and a machine readable code for automatically generating the knowledge database, the classifier comprising:
-
a) a memory which stores in computer readable form a plurality of descriptor vectors derived from a plurality of training images, each respective one of the training images including a predetermined classification code to permit separating the descriptor vectors into a plurality of class clusters; b) a similarity matrix stored in a computer memory, said similarity matrix containing similarity values as between all of said descriptor vectors to permit identifying a first least similar descriptor vector and a most similar descriptor vector in each class cluster as compared to all other class clusters to thereby provide at least two inter-class extreme points for each class cluster, and to permit identifying a second least similar descriptor vector from a selected extreme point within a class cluster to thereby permit selecting said least similar descriptor vector as an intra-class extreme point; c) an extreme point balancer which eliminates redundant extreme points from a class cluster. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for automatically generating a knowledge database in a classification system having a computer, a computer memory, a digital image data source, and a machine readable code for automatically generating a knowledge database, the method comprising the steps of:
-
a) characterizing digital image data from the digital image data source representing a plurality of training images and having a predetermined classification code for each respective one of the plurality of images, to provide a plurality of descriptor vectors separated by said predetermined classification codes into a plurality of class clusters stored in the computer memory; b) calculating a similarity matrix across all the descriptor vectors to permit identifying a most similar descriptor vector and a first least similar descriptor vector in each class cluster as compared to descriptor vectors in all other class clusters thereby providing at least two inter-class extreme points for each class cluster; c) identifying a second least similar descriptor vector in a class cluster as compared to a selected extreme point in the same class cluster to permit selecting said second least similar descriptor vector as an intra-class extreme point; d) balancing said extreme points to eliminate redundant extreme points and to thereby define a preliminary knowledge database; and e) fine tuning the preliminary knowledge database to improve the accuracy of the preliminary knowledge database to provide a knowledge database, said knowledge database being stored in a computer memory. - View Dependent Claims (7, 8, 9)
-
-
10. An object classification system, comprising a computer useable medium having a computer readable program code means embodied therein for causing the object classification system to automatically generate a knowledge database for classifying unknown objects, the computer readable program code means in the object classification system comprising:
-
a) computer readable code means for causing a computer to retrieve a plurality of descriptor vectors and predetermined classification codes from a computer memory, each of said descriptor vectors representing a training image, each of said descriptor vectors further comprising a plurality of numerical values corresponding to physical parameters used to characterize an object image, said descriptor vectors separated by said pre-determined classification codes into a plurality of class clusters; b) computer readable program code means for causing the computer to calculate a similarity matrix across all said descriptor vectors to permit selecting as extreme points a most similar descriptor vector and a first least similar descriptor vector in each class cluster as compared to descriptor vectors in all other class clusters; c) computer readable program code means for causing the computer to select a second least similar descriptor vector in a class cluster as compared to a selected extreme point in the same class cluster to thereby provide an intra-class extreme point; and d) computer readable program code means for causing the computer to delete at least one of a first extreme point most similar to a second extreme point according to a predetermined similarity threshold to thereby provide a preliminary knowledge database. - View Dependent Claims (11, 12)
-
-
13. A program storage device readable by a machine tangibly embodying a program of instructions executable by the machine to perform method steps for automatically generating a knowledge database for object classification, said method steps comprising:
-
a) characterizing digital image data from a digital image data source representing a plurality of training images and having a predetermined classification code for each respective one of the plurality of images, to provide a plurality of descriptor vectors separated by said predetermined classification codes into a plurality of class clusters stored in a computer memory; b) calculating a similarity matrix across all the descriptor vectors to permit identifying a most similar descriptor vector and a first least similar descriptor vector in each class cluster as compared to descriptor vectors in all other class clusters thereby providing at least two inter-class extreme points for each class cluster; c) identifying a second least similar descriptor vector in a class cluster as compared to a selected extreme point in the same class cluster to permit selecting said second least similar descriptor vector as an intra-class extreme point; d) balancing said extreme points to eliminate redundant extreme points and to thereby define a preliminary knowledge database; and e) fine tuning the preliminary knowledge database to improve the accuracy of the preliminary knowledge database to provide a knowledge database, said knowledge database being stored in a computer memory. - View Dependent Claims (14, 15, 16)
-
Specification