System and methods for providing automatic classification of media entities according to tempo properties
First Claim
1. A method for automatically classifying tempo properties of audio data, comprising:
- reducing input audio data to a lower resolution;
applying an accentuation process to the input audio data to accentuate the tempo properties of the data and applying a smoothing process to smooth at least one of an aberration, bias and trend of the tempo properties of the input audio data;
generating a plurality of bins of data that correspond to at least one tempo property of the input audio data; and
generating a tempo property feature vector from said at least one tempo property.
3 Assignments
0 Petitions
Accused Products
Abstract
In connection with a classification system for classifying media entities that merges perceptual classification techniques and digital signal processing classification techniques for improved classification of media entities, a system and methods are provided for automatically classifying and characterizing tempo properties of media entities. Such a system and methods may be useful for the indexing of a database or other storage collection of media entities, such as media entities that are audio files, or have portions that are audio files. The methods also help to determine media entities that have similar, or dissimilar as a request may indicate, tempo(s) by utilizing classification chain techniques that test distances between media entities in terms of their properties. For example, a neighborhood of songs may be determined within which each song has similar tempo characteristics.
-
Citations
35 Claims
-
1. A method for automatically classifying tempo properties of audio data, comprising:
-
reducing input audio data to a lower resolution;
applying an accentuation process to the input audio data to accentuate the tempo properties of the data and applying a smoothing process to smooth at least one of an aberration, bias and trend of the tempo properties of the input audio data;
generating a plurality of bins of data that correspond to at least one tempo property of the input audio data; and
generating a tempo property feature vector from said at least one tempo property. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of classifying data according to tempo properties of the data, comprising:
-
assigning to each media entity of a plurality of media entities in a data set to at least one tempo properties class;
processing each media entity of said data set to extract at least one tempo properties characteristic based on digital signal processing of each media entity;
generating pluralities of tempo properties bins of data for said plurality of media entities, wherein each plurality of tempo properties bins includes said at least one tempo properties class and at least one tempo properties characteristic based on digital signal processing; and
forming a classification chain based upon said pluralities of tempo properties bins of data. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
processing an unclassified media entity to extract at least one tempo properties characteristic based on digital signal processing of the unclassified media entity;
presenting said at least one digital signal processing tempo properties characteristic for the unclassified media entity to the classification chain;
calculating an estimate of the tempo properties class via a comparison process between said at least one digital signal processing tempo properties characteristic and the classification chain; and
classifying the unclassified entry with the estimate of the tempo properties class.
-
-
21. A method according to claim 20, wherein said processing includes cycling through each of the points in the classification chain and generating a table of distance values and tempo values.
-
22. A method according to claim 21, wherein cycling through a point in the classification chain includes computing a plurality of distances, with each distance being computed between a dimension represented by the at least one digital signal processing tempo properties characteristic and a dimension represented by a point in the classification chain, and taking the square root of the result of summing the squares of the plurality of distances, to produce a distance for the point.
-
23. A method according to claim 22, wherein if said distance for the point is less than a threshold value, the distance for the point and the tempo value associated with the point is added to the table.
-
24. A method according to claim 21, wherein confidence values for the distance values in the table are calculated in proportion to the distance values, respectively.
-
25. A method according to claim 24, wherein a confidence value for a distance value is computed in exponential proportion to the distance value.
-
26. A method according to claim 24, wherein a tempo component and a harmonic component is computed for each of said tempo values in the table.
-
27. A method according to claim 26, wherein for each of said tempo values, the log base two of the tempo value divided by 10 is performed, and the integer portion of the result is the harmonic component and the fractional portion of the result is the tempo component.
-
28. A method according to claim 26, further including an averaging process wherein said tempo components are averaged to generate an intermediate estimated tempo class.
-
29. A method according to claim 28, wherein calculating the intermediate estimated tempo class includes mapping the tempo components to angular representations between 0 and 2π
- , calculating the sine and cosine of the angular representations and multiplying the sine values and cosine values by the corresponding confidence values, calculating the inverse tangent of the division of the sum of the multiplied sine values by the sum of the multiplied cosine values and mapping the resulting angle to a value between zero and one.
-
30. A method according to claim 28, further including generating a harmonic component estimate of the unclassified media entity based on the intermediate estimated tempo class.
-
31. A method according to claim 30, wherein said estimated tempo class of the unclassified media entity is calculated based on said harmonic component estimate and said intermediate estimated tempo class.
-
32. A method according to claim 20, wherein said classifying of the unclassified entry with the estimate of the tempo properties class includes returning at least one number indicating an overall confidence of the estimate of the tempo properties class.
-
33. A computer readable medium bearing computer executable instructions for carrying out the method of claim 19.
-
34. A modulated data signal carrying computer executable instructions for performing the method of claim 19.
-
35. At least one computing device comprising means for performing the method of claim 19.
Specification