Audio Processing Techniques for Semantic Audio Recognition and Report Generation
First Claim
1. A apparatus to determine semantic audio information for audio, the apparatus comprising:
- memory including computer readable instructions; and
a processor to execute the computer readable instructions to;
extract a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature;
compare the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and
determine a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio.
8 Assignments
0 Petitions
Accused Products
Abstract
Example apparatus, articles of manufacture and methods to determine semantic audio information for audio are disclosed. Example methods include extracting a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature. Example methods also include comparing the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith. Example methods further include determining a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio.
11 Citations
20 Claims
-
1. A apparatus to determine semantic audio information for audio, the apparatus comprising:
-
memory including computer readable instructions; and a processor to execute the computer readable instructions to; extract a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature; compare the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and determine a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An article of manufacture comprising computer readable instructions that, when executed, cause a computing device to at least:
-
extract a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature, compare the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and determine a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method to determine semantic audio information for audio, the method comprising:
-
extracting, by executing an instruction with a processor, a plurality of audio features from the audio, at least one of the plurality of audio features including at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature; comparing, by executing an instruction with the processor, the plurality of audio features to a plurality of stored audio feature ranges having tags associated therewith; and determining, by executing an instruction with the processor, a set of ranges of the plurality of stored audio feature ranges having closest matches to the plurality of audio features, a tag associated with the set of ranges having the closest matches to be used to determine the semantic audio information for the audio. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification