Converting multi-microphone captured signals to shifted signals useful for binaural signal processing and use thereof
First Claim
1. A method comprising:
- estimating directional information based on multiple input channel signals representing at least one arriving sound from a sound source captured by respective multiple microphones that have respective known locations relative to each other;
deriving a mid-signal and a side signal on basis of a first input channel signal, a second input channel signal and said estimated directional information; and
generating an output signal comprising a plurality of output channels using said mid-signal, said side signal and said estimated directional information such that the output signal retains a spatial representation of the captured at least one arriving sound, wherein said generating comprises processing the mid-signal and the side signal using said estimated directional information, and combining the processed mid-signal and the processed side signal to determine at least a left channel signal and a right channel signal of said output signal that retains the spatial representation of the captured at least one arriving sound.
2 Assignments
0 Petitions
Accused Products
Abstract
A method includes, for each of a number of subbands of a frequency range and for at least first and second frequency-domain signals that are frequency-domain representations of corresponding first and second audio signals: determining a time delay of the first frequency-domain signal that removes a time difference between the first and second frequency-domain signals in the subband. The method includes forming a first resultant signal including, for each of the number of subbands, a sum of one of the first or second frequency-domain signals shifted by the time delay and of the other of the first or second frequency-domain signals; and forming a second resultant signal including, for each of the number of subbands, a difference between the shifted one of the first or second frequency-domain signals and the other of the first or second frequency-domain signals. Apparatus and program products are also disclosed.
-
Citations
20 Claims
-
1. A method comprising:
-
estimating directional information based on multiple input channel signals representing at least one arriving sound from a sound source captured by respective multiple microphones that have respective known locations relative to each other; deriving a mid-signal and a side signal on basis of a first input channel signal, a second input channel signal and said estimated directional information; and generating an output signal comprising a plurality of output channels using said mid-signal, said side signal and said estimated directional information such that the output signal retains a spatial representation of the captured at least one arriving sound, wherein said generating comprises processing the mid-signal and the side signal using said estimated directional information, and combining the processed mid-signal and the processed side signal to determine at least a left channel signal and a right channel signal of said output signal that retains the spatial representation of the captured at least one arriving sound. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 19)
-
-
14. An apparatus, comprising
at least one processor, and at least one non-transitory computer readable medium including computer program code, the at least one non-transitory computer readable medium and the computer program code configured to, with the at least one processor, cause the apparatus at least to perform: -
estimating directional information based on multiple input channel signals representing at least one arriving sound from a sound source captured by respective multiple microphones that have respective known locations relative to each other; deriving a mid-signal and a side signal on basis of a first input channel signal, a second input channel signal and said estimated directional information and generating an output signal comprising a plurality of output channels using said mid-signal, said side signal and said estimated directional information such that the output signal retains a spatial representation of the captured at least one arriving sound, wherein said generating comprises processing the mid-signal and the side signal using said estimated directional information, and combining the processed mid-signal and the processed side signal to determine at least a left channel signal and a right channel signal of said output signal that retains the spatial representation of the captured at least one arriving sound. - View Dependent Claims (15, 16, 17, 18, 20)
-
Specification