System and methods for providing adaptive media property classification
First Claim
1. A method of classifying data according to perceptual properties of the data, the method being suited for searching and sorting large databases of media entities, including music, video and image databases, the method comprising:
- assigning to each media entity of a plurality of media entities in a data set to at least one class, each class of said at least one class corresponding to a subset of perceptual properties pre-defined for the data set;
processing each media entity of said data set to extract at least one digital signal processing characteristic for each media entity;
generating a plurality of feature vectors for said plurality of media entities, wherein each vector includes said at least one class and said at least one digital signal processing characteristic; and
forming a classification chain based upon said plurality of feature vectors, wherein a human adds a new subset of perceptual properties to the classification chain defined by the plurality of feature vectors.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and methods are provided for automatically classifying data according to perceptual properties of the data to form a classification chain that is suited to the searching and sorting of large databases of media entities. During classification, experts assign each media entity in the training data set to one or more classes, with each class corresponding to a given subset of perceptual properties of the data. In conjunction with digital signal processing properties of the data corresponding to the perceptual properties, the classified data is then used to construct an initial classification chain. During operation, when presented with an unclassified entry, the classification chain returns an estimate of the class of the entry, as well as a confidence measure that is proportional to the level of confidence of the class assignment. Over time, as the classification chain evolves, the classification chain becomes more and more effective for quickly characterizing media entities.
109 Citations
35 Claims
-
1. A method of classifying data according to perceptual properties of the data, the method being suited for searching and sorting large databases of media entities, including music, video and image databases, the method comprising:
-
assigning to each media entity of a plurality of media entities in a data set to at least one class, each class of said at least one class corresponding to a subset of perceptual properties pre-defined for the data set; processing each media entity of said data set to extract at least one digital signal processing characteristic for each media entity; generating a plurality of feature vectors for said plurality of media entities, wherein each vector includes said at least one class and said at least one digital signal processing characteristic; and forming a classification chain based upon said plurality of feature vectors, wherein a human adds a new subset of perceptual properties to the classification chain defined by the plurality of feature vectors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage medium having stored thereon instructions for classifying data according to perceptual properties of the data, the instructions being suited for searching and sorting large databases of media entities, including music, video and image databases, the instructions when executed, perform the steps of:
-
assigning to each media entity of a plurality of media entities in a data set to at least one class, each class of said at least one class corresponding to a subset of perceptual properties pre-defined for the data set; processing each media entity of said data set to extract at least one digital signal processing characteristic for each media entity; generating a plurality of feature vectors for said plurality of media entities, wherein each vector includes said at least one class and said at least one digital signal processing characteristic; and forming a classification chain based upon said plurality of feature vectors, wherein a human adds a new subset of perceptual properties to the classification chain defined by the plurality of feature vectors.
-
-
16. A computing system, comprising:
-
a processor including; a classification chain data structure stored thereon having a plurality of classification vectors, wherein each vector includes data representative of at least one perceptual class as classified by humans and digital signal processing data as classified by at least one computing device;
wherein;the classification chain data structure comprises a subset of perceptual properties defined by the plurality of feature vectors newly added thereto; and a human adds a new subset of perceptual properties to the classification chain; and processing means for comparing an unclassified media entity to the classification chain data structure to determine at least one perceptual class of said unclassified media entity. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method of generating a classification chain for searching and sorting databases of media entities including music, video, and image databases, the classification chain having a plurality of vectors describing a plurality of media entities, the method comprising:
-
assigning, by an expert, a first value to a media entity according to a pre-defined perceptual characteristic of media entities; assigning, by a computing system, a second value to the media entity according to a pre-defined digital signal processing characteristic; generating a vector based on at least said first value and said second value; adding said vector to a classification chain data structure, wherein a human adds a new subset of perceptual properties to the classification chain data structure defined by the new unclassified data structure; calculating a neighborhood distance within the vector space of said classification chain for each of said at least one perceptual class, wherein;
said neighborhood distance defines a distance within which two vectors in the classification chain space are in the same neighborhood;said calculating of a neighborhood distance for each of said at least one perceptual class includes determining a distance within which two vectors of the classification chain possess the same class given a threshold degree of error; said threshold degree of error places a maximum limit on the distance that may be used for determining neighborhoods; and said threshold degree of error is determined by a human. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35)
-
Specification