Method and apparatus for classifying mood of music at high speed
First Claim
Patent Images
1. A method of classifying a mood of a music file, comprising:
- extracting a Modified Discrete Cosine Transformation-based timbre feature from a compressed domain of a music file;
extracting a Modified Discrete Cosine Transformation-based tempo feature from the compressed domain of the music file; and
classifying a mood of the music file based on the extracted timbre feature and the extracted tempo feature,wherein the extracting the Modified Discrete Cosine Transformation-based timbre feature from the compressed domain of the music file comprises;
extracting Modified Discrete Cosine Transformation coefficients by decoding a part of the music file;
selecting the Modified Discrete Cosine Transformation coefficients of a predetermined number of sub-bands from the extracted Modified Discrete Cosine Transformation coefficients; and
extracting a spectral centroid, a bandwidth, a rolloff, and a flux from the selected Modified Discrete Cosine Transformation coefficients,wherein the classifying the mood of the music file comprises;
classifying a genre of the music file based on the extracted timbre feature; and
reclassifying a category of the music file of the genre when uncertainty of a genre classification result is greater than a predetermined value; and
wherein, in the reclassifying a category of the music file of the genre, the category of the music file of the genre is reclassified based on the extracted tempo feature.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for classifying mood of music at high speed. The method includes: extracting a Modified Discrete Cosine Transformation-based timbre feature from a compressed domain of a music file; extracting a Modified Discrete Cosine Transformation-based tempo feature from the compressed domain of the music file; and classifying the mood of the music file based on the extracted timbre feature and the extracted tempo feature.
113 Citations
13 Claims
-
1. A method of classifying a mood of a music file, comprising:
-
extracting a Modified Discrete Cosine Transformation-based timbre feature from a compressed domain of a music file; extracting a Modified Discrete Cosine Transformation-based tempo feature from the compressed domain of the music file; and classifying a mood of the music file based on the extracted timbre feature and the extracted tempo feature, wherein the extracting the Modified Discrete Cosine Transformation-based timbre feature from the compressed domain of the music file comprises; extracting Modified Discrete Cosine Transformation coefficients by decoding a part of the music file; selecting the Modified Discrete Cosine Transformation coefficients of a predetermined number of sub-bands from the extracted Modified Discrete Cosine Transformation coefficients; and extracting a spectral centroid, a bandwidth, a rolloff, and a flux from the selected Modified Discrete Cosine Transformation coefficients, wherein the classifying the mood of the music file comprises; classifying a genre of the music file based on the extracted timbre feature; and reclassifying a category of the music file of the genre when uncertainty of a genre classification result is greater than a predetermined value; and wherein, in the reclassifying a category of the music file of the genre, the category of the music file of the genre is reclassified based on the extracted tempo feature. - View Dependent Claims (2, 3, 4, 5, 6, 7, 9)
-
-
8. A method of classifying a mood of a music file, comprising:
-
extracting a Modified Discrete Cosine Transformation-based timbre feature from a compressed domain of a music file; extracting a Modified Discrete Cosine Transformation-based tempo feature from the compressed domain of the music file; and classifying a mood of the music file based on the extracted timbre feature and the extracted tempo feature, wherein the extracting the Modified Discrete Cosine Transformation-based timbre feature from the compressed domain of the music file comprises; extracting Modified Discrete Cosine Transformation coefficients by decoding a part of the music file; selecting the Modified Discrete Cosine Transformation coefficients of a predetermined number of sub-bands from the extracted Modified Discrete Cosine Transformation coefficients; and extracting a spectral centroid, a bandwidth, a rolloff, and a flux from the selected Modified Discrete Cosine Transformation coefficients, wherein the classifying the mood of the music file comprises; classifying a genre of the music file based on the extracted timbre feature; and reclassifying a category of the music file of the genre based on the extracted tempo feature when uncertainty of a genre classification result is greater than a predetermined value; wherein, in the classifying a genre of the music file, the genre of the music file is classified into one of a sad, an exciting, a calm in classic, a calm in pop, a pleasant in pop, a pleasant in classic, and a pleasant in jazz genre, and wherein, in the reclassifying a category of the music file of the genre, the category of the music file classified into the pleasant in classic genre is reclassified into the calm and the pleasant in classic genres according to the extracted tempo feature, and the category of the music file classified into the pleasant in jazz genre is reclassified into the sad and the pleasant in jazz genres according to the extracted tempo feature.
-
-
10. An apparatus for classifying a mood of a music file, comprising:
-
a timbre extraction unit to extract a Modified Discrete Cosine Transformation-based timbre feature from a compressed domain of a music file; a tempo extraction unit to extract a Modified Discrete Cosine Transformation-based tempo feature from the compressed domain of the music file; and a mood classification unit to classify the mood of the music file based on the extracted timbre feature and the extracted tempo feature, wherein the timbre extraction unit extracts the Modified Discrete Cosine Transformation-based timbre feature from the compressed domain of the music file by extracting Modified Discrete Cosine Transformation coefficients by decoding a part of the music file;
selecting the Modified Discrete Cosine Transformation coefficients of a predetermined number of sub-bands from the extracted Modified Discrete Cosine Transformation coefficients; and
extracting a spectral centroid, a bandwidth, a rolloff, and a flux from the selected Modified Discrete Cosine Transformation coefficients, andwherein the mood classification unit comprises; a first classification unit classifying a genre of the music file based on the extracted timbre feature; and a second classification unit reclassifying a category of the music file based on the extracted tempo feature when uncertainty of a genre classification result is greater than a predetermined value. - View Dependent Claims (11)
-
-
12. A method of improving a reliability of music mood classification, comprising:
-
extracting Modified Discrete Cosine Transformation-based timbre and tempo features from a compressed portion of a music file; and classifying a mood of the music file by classifying a genre of the music file based on the extracted timbre feature, ascertaining whether the genre resulting from the genre classification has an uncertainty in excess of a threshold amount, and reclassifying a category of the music file of the genre when the uncertainty of the genre classification exceeds the threshold amount, wherein the extracting the Modified Discrete Cosine Transformation-based timbre feature from the compressed domain of the music file comprises; extracting Modified Discrete Cosine Transformation coefficients by decoding a part of the music file; selecting the Modified Discrete Cosine Transformation coefficients of a predetermined number of sub-bands from the extracted Modified Discrete Cosine Transformation coefficients; and extracting a spectral centroid, a bandwidth, a rolloff, and a flux from the selected Modified Discrete Cosine Transformation coefficients, and wherein, in the reclassifying a category of the music file of the genre, the category of the music file of the genre is reclassified based on the extracted tempo feature when the genre classification exceeds the threshold amount. - View Dependent Claims (13)
-
Specification