Audio processing techniques for semantic audio recognition and report generation
First Claim
1. An apparatus to determine semantic audio information for audio, the apparatus comprising:
- memory including computer readable instructions; and
a processor to execute the computer readable instructions to;
extract a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature;
compare the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and
determine a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio, wherein the tag is associated with at least one of an audio timbre range, a beat range, a loudness range or a spectral histogram range.
8 Assignments
0 Petitions
Accused Products
Abstract
Example apparatus, articles of manufacture and methods to determine semantic audio information for audio are disclosed. Example methods include extracting a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature. Example methods also include comparing the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith. Example methods further include determining a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio.
87 Citations
15 Claims
-
1. An apparatus to determine semantic audio information for audio, the apparatus comprising:
-
memory including computer readable instructions; and a processor to execute the computer readable instructions to; extract a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature; compare the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and determine a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio, wherein the tag is associated with at least one of an audio timbre range, a beat range, a loudness range or a spectral histogram range. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An article of manufacture comprising computer readable instructions that, when executed, cause a computing device to at least:
-
extract a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature; compare the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and determine a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio, wherein the tag is associated with at least one of an audio timbre range, a beat range, a loudness range or a spectral histogram range. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method to determine semantic audio information for audio, the method comprising:
-
extracting, by executing an instruction with a processor, a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature; comparing, by executing an instruction with the processor, the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and determining, by executing an instruction with the processor, a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio, wherein the tag is associated with at least one of an audio timbre range, a beat range, a loudness range or a spectral histogram range. - View Dependent Claims (12, 13, 14, 15)
-
Specification