Apparatus and method for generating audio output signals using object based metadata
First Claim
1. Apparatus for generating at least one audio output signal representing a superposition of at least two different audio objects, the at least two different audio objects comprising a first audio object and a second audio object, the apparatus comprising:
- a processor arranged to process an audio input signal, the audio input signal being an object downmix comprising the first audio object and the second audio object, to provide an object representation of the audio input signal, in which the first audio object and the second audio object are separated from each other, in which the first audio object and the second audio object are available as a first audio object signal and a separate second audio object signal, and in which the first audio object signal and the second audio object signal are manipulatable independently from each other;
an object manipulator arranged to manipulate the first audio object signal based on audio object based metadata referring to the first audio object to obtain a manipulated first audio object signal for the first audio object; and
an object mixer arranged to combine the manipulated first audio object signal with the second audio object signal, the second audio object signal not being manipulated by the object manipulator or arranged to combine the manipulated first audio object signal with a manipulated second audio object signal, the manipulated second audio object signal being manipulated by the object manipulator based on audio object based metadata referring to the second audio object in a different way as compared to the manipulated first audio object signal.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus for generating at least one audio output signal representing a superposition of at least two different audio objects comprises a processor for processing an audio input signal to provide an object representation of the audio input signal, where this object representation can be generated by a parametrically guided approximation of original objects using an object downmix signal. An object manipulator individually manipulates objects using audio object based metadata referring to the individual audio objects to obtain manipulated audio objects. The manipulated audio objects are mixed using an object mixer for finally obtaining an audio output signal having one or several channel signals depending on a specific rendering setup.
-
Citations
8 Claims
-
1. Apparatus for generating at least one audio output signal representing a superposition of at least two different audio objects, the at least two different audio objects comprising a first audio object and a second audio object, the apparatus comprising:
-
a processor arranged to process an audio input signal, the audio input signal being an object downmix comprising the first audio object and the second audio object, to provide an object representation of the audio input signal, in which the first audio object and the second audio object are separated from each other, in which the first audio object and the second audio object are available as a first audio object signal and a separate second audio object signal, and in which the first audio object signal and the second audio object signal are manipulatable independently from each other; an object manipulator arranged to manipulate the first audio object signal based on audio object based metadata referring to the first audio object to obtain a manipulated first audio object signal for the first audio object; and an object mixer arranged to combine the manipulated first audio object signal with the second audio object signal, the second audio object signal not being manipulated by the object manipulator or arranged to combine the manipulated first audio object signal with a manipulated second audio object signal, the manipulated second audio object signal being manipulated by the object manipulator based on audio object based metadata referring to the second audio object in a different way as compared to the manipulated first audio object signal. - View Dependent Claims (2, 3)
-
-
4. Method of generating at least one audio output signal representing a superposition of at least two different audio objects, the at least two different audio objects comprising a first audio object and a second audio object, the method comprising:
-
processing an audio input signal, the audio input signal being an object downmix comprising the first audio object and the second audio object, to provide an object representation of the audio input signal, in which the first audio object and the second audio object are separated from each other, in which the first audio object and the second audio object are available as a first audio object signal and a separate second audio object signal, and in which the first audio object signal and the second audio object signal are manipulatable independently from each other; manipulating the first audio object signal based on audio object based metadata referring to the first audio object to obtain a manipulated first audio object signal for the first audio object; and combining the manipulated first audio object signal with the second audio object signal, the second audio object signal not being manipulated by the manipulating;
orcombining the manipulated first audio object signal with a manipulated second audio object signal, the manipulated second audio object signal being manipulated by the manipulating based on audio object based metadata referring to the second audio object in a different way compared to the manipulated first audio object signal. - View Dependent Claims (5)
-
-
6. Apparatus for generating at least one audio output signal representing a superposition of at least two different audio objects, the at least two different audio objects comprising a first audio object and a second audio object, the apparatus comprising:
-
a processor arranged to process an audio input signal to provide an object representation of the audio input signal, in which the first audio object and the second audio object are separated from each other, in which the first audio object and the second audio object are available as a first audio object signal and a separate second audio object signal, and in which the first audio object signal and the second audio object signal are manipulatable independently from each other; a first object downmixer arranged to distribute the first audio object signal into output channels using rendering information to obtain a first plurality of first object component signals; a second object downmixer arranged to distribute the second audio object signal into the output channels using the rendering information to obtain a second plurality of second object component signals; an object manipulator arranged to manipulate each of the first object component signals of the first plurality in a same manner based on audio object based metadata referring to the first audio object to obtain manipulated first object component signals for the first audio object, and an object mixer arranged to add, per output channel, the manipulated first object component signals with the second object component signals not being manipulated by the object manipulator or arranged to add, per output channel, the manipulated first object component signals with manipulated second object component signals, the manipulated second object component signals being manipulated by the object manipulator in a same manner based on audio object based metadata referring to the second audio object, the manipulated second object component signals being manipulated in a different way compared to the manipulated first object component signals.
-
-
7. Method of generating at least one audio output signal representing a superposition of at least two different audio objects, the at least two different audio objects comprising a first audio object and a second audio object, the method comprising:
-
processing an audio input signal to provide an object representation of the audio input signal, in which the first audio object and the second audio object are separated from each other, in which the first audio object and the second audio object are available as a first audio object signal and a separate second audio object signal, and in which the first audio object signal and the second audio object signal are manipulatable independently from each other; distributing the first audio object signal into output channels using rendering information to obtain a first plurality of first object component signals; distributing the second audio object signal into the output channels using the rendering information to obtain a second plurality of second object component signals; manipulating each of the first object component signals of the first plurality in a same manner based on audio object based metadata referring to the first audio object to obtain manipulated first object component signals for the first audio object; and adding, per output channel, the manipulated first object component signals with the second object component signals not being manipulated by the manipulating, or adding, per output channel, the manipulated first object component signals with manipulated second object component signals, the manipulated second object component signals being manipulated by the manipulating in a same manner based on audio object based metadata referring to the second audio object, the manipulated second object component signals being manipulated in a different way compared to the manipulated first object component signals. - View Dependent Claims (8)
-
Specification