AUTOMATIC LABELING AND CONTROL OF AUDIO ALGORITHMS BY AUDIO RECOGNITION
First Claim
1. A method for multi-stage audio signal analysis, the method comprising:
- performing a first-stage processing operation on an audio signal, the first stage processing operation including a windowed signal analysis that derives a raw feature vector;
performing a second stage statistical processing operation on the raw feature vector to derive a reduced feature vector;
performing a third stage processing operation on the reduced feature vector to derive at least one sound object label that refers to the original audio signal; and
mapping the at least one sound object label into a stream of control events sent to a sound-object-driven, multimedia-aware software application, wherein any of the processing operations of the first through third stages are configurable and scriptable.
6 Assignments
0 Petitions
Accused Products
Abstract
Controlling a multimedia software application using high-level metadata features and symbolic object labels derived from an audio source, wherein a first-pass of low-level signal analysis is performed, followed by a stage of statistical and perceptual processing, followed by a symbolic machine-learning or data-mining processing component is disclosed. This multi-stage analysis system delivers high-level metadata features, sound object identifiers, stream labels or other symbolic metadata to the application scripts or programs, which use the data to configure processing chains, or map it to other media. Embodiments of the invention can be incorporated into multimedia content players, musical instruments, recording studio equipment, installed and live sound equipment, broadcast equipment, metadata-generation applications, software-as-a-service applications, search engines, and mobile devices.
-
Citations
6 Claims
-
1. A method for multi-stage audio signal analysis, the method comprising:
-
performing a first-stage processing operation on an audio signal, the first stage processing operation including a windowed signal analysis that derives a raw feature vector; performing a second stage statistical processing operation on the raw feature vector to derive a reduced feature vector; performing a third stage processing operation on the reduced feature vector to derive at least one sound object label that refers to the original audio signal; and mapping the at least one sound object label into a stream of control events sent to a sound-object-driven, multimedia-aware software application, wherein any of the processing operations of the first through third stages are configurable and scriptable. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification