Audio matching with semantic audio recognition and report generation
First Claim
1. A processor-based method for producing supplemental information for audio signature data, comprising:
- obtaining the audio signature data during a first time period, the audio signature data including data relating to at least one of time or frequency representing a first characteristic of media;
obtaining first semantic audio signature data for the first time period, the first semantic audio signature data being a measure of at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature relating to a second characteristic of the media; and
storing in a memory, the audio signature data of the first time period in association with a second time period when the processor determines that second semantic audio signature data for the second time period substantially matches the first semantic audio signature data for the first time period.
10 Assignments
0 Petitions
Accused Products
Abstract
System, apparatus and method for determining semantic information from audio, where incoming audio is sampled and processed to extract audio features, including temporal, spectral, harmonic and rhythmic features. The extracted audio features are compared to stored audio templates that include ranges and/or values for certain features and are tagged for specific ranges and/or values. The semantic information may be associated with audio signature data
Extracted audio features that are most similar to one or more templates from the comparison are identified according to the tagged information. The tags are used to determine the semantic audio data that includes genre, instrumentation, style, acoustical dynamics, and emotive descriptor for the audio signal.
-
Citations
39 Claims
-
1. A processor-based method for producing supplemental information for audio signature data, comprising:
-
obtaining the audio signature data during a first time period, the audio signature data including data relating to at least one of time or frequency representing a first characteristic of media; obtaining first semantic audio signature data for the first time period, the first semantic audio signature data being a measure of at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature relating to a second characteristic of the media; and storing in a memory, the audio signature data of the first time period in association with a second time period when the processor determines that second semantic audio signature data for the second time period substantially matches the first semantic audio signature data for the first time period. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus for producing supplemental information for audio signature data, the apparatus including:
-
a processor to; obtain the audio signature data during a first time period, the audio signature data including data relating to at least one of time or frequency representing a first characteristic of media; obtain first semantic audio signature data for the first time period, the first semantic audio signature data being a measure of at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature relating to a second characteristic of the media; and a memory to store the audio signature data of the first time period in association with a second time period when the processor determines that second semantic audio signature data for the second time period substantially matches the first semantic audio signature data for the first time period. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A processor-based method for producing supplemental information for audio signature data, comprising:
-
obtaining the audio signature data at an input from a data network, the audio signature data received from a device, the audio signature data including data relating to at least one of time or frequency representing a first characteristic of media; obtaining semantic audio signature data at the input from the data network, the semantic audio signature data received from the device, the semantic audio signature data being a measure of at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature relating to a second characteristic of the media; associating the semantic audio signature data to the audio signature data using a processor; and processing the associated semantic audio signature data and audio signature data to determine a change in the second characteristic relative to the first characteristic. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. An article of manufacture comprising instructions that, when executed, cause a processor to at least:
-
obtain audio signature data during a first time period, the audio signature data including data relating to at least one of time or frequency representing a first characteristic of media; obtain first semantic audio signature data for the first time period, the first semantic audio signature data being a measure of at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature relating to a second characteristic of the media; and store in a memory, the audio signature data of the first time period in association with a second time period when the processor determines that second semantic audio signature data for the second time period substantially matches the first semantic audio signature data for the first time period. - View Dependent Claims (22, 23, 24, 25, 26, 27)
-
-
28. An apparatus for producing supplemental information for audio signature data, the apparatus including:
a processor to; obtain the audio signature data at an input from a data network, the audio signature data received from a device, the audio signature data including data relating to at least one of time or frequency representing a first characteristic of media; obtain semantic audio signature data at the input from the data network, the semantic audio signature data received from the device, the semantic audio signature data being a measure of at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature relating to a second characteristic of the media; associate the semantic audio signature data to the audio signature data; and process the associated semantic audio signature data and audio signature data to determine a change in the second characteristic relative to the first characteristic. - View Dependent Claims (29, 30, 31, 32, 33)
-
34. An article of manufacture comprising instructions that, when executed, cause a processor to at least:
-
obtain the audio signature data at an input from a data network, the audio signature data received from a device, the audio signature data including data relating to at least one of time or frequency representing a first characteristic of media; obtain semantic audio signature data at the input from the data network, the semantic audio signature data received from the device, the semantic audio signature data being a measure of at least one of a temporal feature, a spectral feature, a harmonic feature, or a rhythmic feature relating to a second characteristic of the media; associate the semantic audio signature data to the audio signature data; and process the associated semantic audio signature data and audio signature data to determine a change in the second characteristic relative to the first characteristic. - View Dependent Claims (35, 36, 37, 38, 39)
-
Specification