System and methods for providing automatic classification of media entities according to consonance properties
First Claim
1. A method of classifying data according to consonance of the data, the method comprising:
- determining an initial classification for a data set and assigning each media entity of a plurality of media entities in the data set to at least one consonance class comprising a perceived harmony or agreement of media entities as identified by a trained human classifier based on at least one of a song-level attribute or a voice-level attribute as defined by a human user;
processing each media entity of said data set to extract at least one consonance characteristic based on digital signal processing of each media entity, wherein said at least one consonance characteristic relates to a correspondence or a recurrence of sounds in each of said plurality of media entities;
generating a plurality of consonance vectors for said plurality of media entities, wherein each consonance vector includes (1) said at least one consonance class based on the at least one of the song-level attribute or the voice-level attribute as identified by the trained human classifier and (2) said at least one consonance characteristic based on said digital signal processing, and wherein said consonance vectors include a mean energy of a ratio between peaks for all frames in said plurality of media entities and wherein each consonance vector contains the consonance characteristic and the consonance class attributes assigned to the media entity being classified;
forming a classification chain based upon said plurality of consonance vectors;
creating a simple rule when a plurality of classification chains are created that each meet a certain criteria;
testing the simple rule against a pre-defined set of identified media entities to create a general rule which is subjected to analysis by a trained human classifier to determine a classification accuracy of the general rule;
utilizing feedback from the trained human classifier regarding the classification accuracy of the general rule to identify at least one consonance class to create a relational rule; and
storing each created relational rule in a computer memory for later retrieval and use in classification actions.
2 Assignments
0 Petitions
Accused Products
Abstract
In connection with a classification system for classifying media entities that merges perceptual classification techniques and digital signal processing classification techniques for improved classification of media entities, a system and methods are provided for automatically classifying and characterizing musical consonance properties of media entities. Such a system and methods may be useful for the indexing of a database or other storage collection of media entities, such as media entities that are audio files, or have portions that are audio files. The methods also help to determine media entities that have similar consonance by utilizing classification chain techniques that test distances between media entities in terms of their properties. For example, a neighborhood of songs may be determined within which each song has a similar consonance.
-
Citations
8 Claims
-
1. A method of classifying data according to consonance of the data, the method comprising:
-
determining an initial classification for a data set and assigning each media entity of a plurality of media entities in the data set to at least one consonance class comprising a perceived harmony or agreement of media entities as identified by a trained human classifier based on at least one of a song-level attribute or a voice-level attribute as defined by a human user; processing each media entity of said data set to extract at least one consonance characteristic based on digital signal processing of each media entity, wherein said at least one consonance characteristic relates to a correspondence or a recurrence of sounds in each of said plurality of media entities; generating a plurality of consonance vectors for said plurality of media entities, wherein each consonance vector includes (1) said at least one consonance class based on the at least one of the song-level attribute or the voice-level attribute as identified by the trained human classifier and (2) said at least one consonance characteristic based on said digital signal processing, and wherein said consonance vectors include a mean energy of a ratio between peaks for all frames in said plurality of media entities and wherein each consonance vector contains the consonance characteristic and the consonance class attributes assigned to the media entity being classified; forming a classification chain based upon said plurality of consonance vectors; creating a simple rule when a plurality of classification chains are created that each meet a certain criteria; testing the simple rule against a pre-defined set of identified media entities to create a general rule which is subjected to analysis by a trained human classifier to determine a classification accuracy of the general rule; utilizing feedback from the trained human classifier regarding the classification accuracy of the general rule to identify at least one consonance class to create a relational rule; and storing each created relational rule in a computer memory for later retrieval and use in classification actions. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer readable storage medium bearing computer executable instructions implemented on a computer, the computer readable storage medium comprising:
-
an instruction that determines an initial classification and assigns to each media entity of a plurality of media entities in a data set to at least one consonance class comprising a perceived harmony or agreement of media entities as identified by a trained human classifier based on at least one of a song-level attribute or a voice-level attribute defined by a human user; an instruction that processes each media entity of said data set to extract at least one consonance characteristic based on digital signal processing of each media entity, wherein said at least one consonance characteristic relates to a correspondence or recurrence of sounds in each of said plurality of media entities; an instruction that generates a plurality of consonance vectors for said plurality of media entities, wherein each consonance vector includes (1) said at least one consonance class based on the at least one of the song-level attribute or the voice-level attribute as identified by the trained human classifier and (2) said at least one consonance characteristic based on said digital signal processing, and wherein said consonance vectors include a mean energy of a ratio between peaks for all frames in said plurality of media entities and wherein each consonance vector contains the consonance characteristic and the consonance class attributes assigned to the media entity being classified; an instruction that forms a classification chain based upon said plurality of consonance vectors; an instruction that creates a simple rule when a plurality of classification chains are created that each meet a certain criteria; an instruction that tests the simple rule against a pre-defined set of identified media entities to create a general rule which is subjected to analysis by a trained human classifier to determine a classification accuracy of the general rule; an instruction to utilize feedback from the trained human classifier regarding the classification accuracy of the general rule to identify at least one consonance class to create a relational rule; and an instruction that stores each created relational rule in a computer memory.
-
-
8. At least one computing device comprising:
-
means for determining an initial classification and assigning to each media entity of a plurality of media entities in a data set to at least one consonance class comprising a perceived harmony or agreement of media entities as identified by a trained human classifier based on at least one of a song-level attribute or a voice-level attribute as defined by a human user; means for processing each media entity of said data set to extract at least one consonance characteristic based on digital signal processing of each media entity, wherein said at least one consonance characteristic relates to a correspondence or a recurrence of sounds in each of said plurality of media entities; means for generating a plurality of consonance vectors for said plurality of media entities, wherein each consonance vector includes (1) said at least one consonance class based on the at least one of the song-level attribute or the voice-level attribute as identified by the trained human classifier and (2) said at least one consonance characteristic based on digital signal processing, and wherein said consonance vectors include a mean energy of a ratio between peaks for all frames in said plurality of media entities and wherein each consonance vector contains the consonance characteristic and the consonance class attributes assigned to the media entity being classified; means for forming a classification chain based upon said plurality of consonance vectors; means for creating a simple rule when a plurality of classification chains are created that each meet a certain criteria; means for testing the simple rule against a pre-defined set of identified media entities to create a general rule which is subjected to analysis by a trained human classifier to determine a classification accuracy of the general rule; means for utilizing feedback from the trained human classifier regarding the classification accuracy of the general rule to identify at least one consonance class to create a relational rule; and means for storing each created relation rule in a computer memory.
-
Specification