AUTOMATIC LABELING AND CONTROL OF AUDIO ALGORITHMS BY AUDIO RECOGNITION

US 20110075851A1
Filed: 09/28/2010
Published: 03/31/2011
Est. Priority Date: 09/28/2009
Status: Active Grant

First Claim

Patent Images

1. A method for multi-stage audio signal analysis, the method comprising:

performing a first-stage processing operation on an audio signal, the first stage processing operation including a windowed signal analysis that derives a raw feature vector;

performing a second stage statistical processing operation on the raw feature vector to derive a reduced feature vector;

performing a third stage processing operation on the reduced feature vector to derive at least one sound object label that refers to the original audio signal; and

mapping the at least one sound object label into a stream of control events sent to a sound-object-driven, multimedia-aware software application, wherein any of the processing operations of the first through third stages are configurable and scriptable.

View all claims

6 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Controlling a multimedia software application using high-level metadata features and symbolic object labels derived from an audio source, wherein a first-pass of low-level signal analysis is performed, followed by a stage of statistical and perceptual processing, followed by a symbolic machine-learning or data-mining processing component is disclosed. This multi-stage analysis system delivers high-level metadata features, sound object identifiers, stream labels or other symbolic metadata to the application scripts or programs, which use the data to configure processing chains, or map it to other media. Embodiments of the invention can be incorporated into multimedia content players, musical instruments, recording studio equipment, installed and live sound equipment, broadcast equipment, metadata-generation applications, software-as-a-service applications, search engines, and mobile devices.

Citations

6 Claims

1. A method for multi-stage audio signal analysis, the method comprising:
- performing a first-stage processing operation on an audio signal, the first stage processing operation including a windowed signal analysis that derives a raw feature vector;
  
  performing a second stage statistical processing operation on the raw feature vector to derive a reduced feature vector;
  
  performing a third stage processing operation on the reduced feature vector to derive at least one sound object label that refers to the original audio signal; and
  
  mapping the at least one sound object label into a stream of control events sent to a sound-object-driven, multimedia-aware software application, wherein any of the processing operations of the first through third stages are configurable and scriptable.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The method of claim 1, wherein the audio signal is a file.
  - 3. The method of claim 1, wherein the audio signal is a stream.
  - 4. The method of claim 1, wherein the first stage processing operation is selected from the group consisting of amplitude-detection, FFT, MFCC, LPC, wavelet analysis, spectral measures, and stereo/spatial feature extraction.
  - 5. The method of claim 1, wherein the second stage processing operation is selected from the group consisting of statistical averaging, mean/variance calculation, statistical moments, Gaussian mixture models, principal component analysis (PCA), independent subspace analysis (ISA), hidden Markhov models, tempo-tracking, pitch-tracking, peak/partial-tracking, onset detection, segmentation, and/or bark/sone mapping.
  - 6. The method of claim 1, wherein the third stage processing operation is selected from the group consisting of support vector machines (SVN), neural networks (NN), partitioning/clustering, constraint satisfaction, stream labeling, rule-based expert systems, classification according to instrument, genre, artist, etc., musical transcription, and/or sound object source separation.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Native Instruments
Original Assignee
iZotope, Inc. (Soundwide GmbH)
Inventors
LeBoeuf, Jay, Pope, Stephen

Granted Patent

US 9,031,243 B2
Time in Patent Office

Days
Field of Search
US Class Current

381/56
CPC Class Codes

G10L 25/51 for comparison or discrimin...

H04R 29/00 Monitoring arrangements; Te...

AUTOMATIC LABELING AND CONTROL OF AUDIO ALGORITHMS BY AUDIO RECOGNITION

First Claim

6 Assignments

0 Petitions

Accused Products

Abstract

Citations

6 Claims

Specification

Solutions

Use Cases

Quick Links

AUTOMATIC LABELING AND CONTROL OF AUDIO ALGORITHMS BY AUDIO RECOGNITION

First Claim

6 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

6 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links