Method for controlling a speaker array to provide spatialized, localized, and binaural virtual surround sound
First Claim
Patent Images
1. A method for producing multi-dimensional sound from a speaker array, comprising:
- receiving a plurality of audio signals from a plurality of sources;
filtering each audio signal through each of a left Head-Related Transfer Function (HRTF) and a right HRTF to generate HRTF-filtered left and HTRF-filtered right audio signals, wherein the left HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a left ear of a user, and wherein the right HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a right ear of a user;
filtering each of the HRTF-filtered left and HRTF-filtered right audio signals with a Psychoacoustic Bandwidth Extension Processor (PBEP);
merging the PBEP HRTF-filtered left audio signals into a left total binaural signal;
merging the PBEP HRTF-filtered right audio signals into a right total binaural signal;
filtering the left total binaural signal through a set of left spatialization filters, wherein a separate left spatialization filter is provided for each speaker in the speaker array;
filtering the right total binaural signal through a set of right spatialization filters, wherein a separate right spatialization filter is provided for each speaker in the speaker array;
summing the filtered left total binaural signal and filtered right total binaural signal for each respective speaker into a speaker signal;
feeding the speaker signal to the respective speaker in the speaker array; and
transmitting the speaker signal through the respective speaker to the user.
5 Assignments
0 Petitions
Accused Products
Abstract
A signal processing method and system are provided for delivering spatialized sound using highly optimized inverse filters to deliver narrow localized beams of sound from the included speaker array. The inventive method can be used to provide private listening areas in a public space and provide spatialization of source material for individual users to create a virtual 3D audio effect. In a binaural mode, a speaker array provides two targeted beams aimed towards the primary user'"'"'s ears—one discrete beam for the left ear and one discrete beam for the right ear.
47 Citations
41 Claims
-
1. A method for producing multi-dimensional sound from a speaker array, comprising:
-
receiving a plurality of audio signals from a plurality of sources; filtering each audio signal through each of a left Head-Related Transfer Function (HRTF) and a right HRTF to generate HRTF-filtered left and HTRF-filtered right audio signals, wherein the left HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a left ear of a user, and wherein the right HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a right ear of a user; filtering each of the HRTF-filtered left and HRTF-filtered right audio signals with a Psychoacoustic Bandwidth Extension Processor (PBEP); merging the PBEP HRTF-filtered left audio signals into a left total binaural signal; merging the PBEP HRTF-filtered right audio signals into a right total binaural signal; filtering the left total binaural signal through a set of left spatialization filters, wherein a separate left spatialization filter is provided for each speaker in the speaker array; filtering the right total binaural signal through a set of right spatialization filters, wherein a separate right spatialization filter is provided for each speaker in the speaker array; summing the filtered left total binaural signal and filtered right total binaural signal for each respective speaker into a speaker signal; feeding the speaker signal to the respective speaker in the speaker array; and transmitting the speaker signal through the respective speaker to the user. - View Dependent Claims (2, 3, 4, 5, 6, 21, 22)
-
-
7. A method for producing a localized sound from a speaker array comprising a plurality of speakers, comprising:
-
receiving at least one audio signal; pre-filtering the at least one audio signal with a Psychoacoustic Bandwidth Extension Processor (PBEP); filtering the at least one audio signal through a set of finite impulse response (FIR) filters, wherein a separate FIR filter is provided for each speaker in the speaker array, wherein each FIR filter has filter coefficients a(f) optimized in a frequency domain by minimizing a cost function J for each frequency f according to the relationship
J(f)=∥
H(f)a(f)−
p(f )∥
2 +β
∥
a(f)∥
2,where H(f) is a M×
N matrix of electro-acoustical transfer functions computed for N speakers and M virtual control points, p(f) is a vector representing a target sound field at the M virtual control points as a function of frequency, ∥
. . . ∥
indicates L2 norm of a vector, and β
is a regularization parameter;summing the filtered audio signals for each respective speaker into a speaker signal; transmitting each speaker signal to the respective speaker in the speaker array; and delivering each speaker signal to one or more regions of space occupied by one or more users. - View Dependent Claims (8, 9, 10, 11, 12, 23)
-
-
13. A speaker array system for producing localized sound, comprising:
-
an input which receives a plurality of audio signals from at least one source; a processor in communication with a non-transitory computer-readable medium containing instructions configured for causing the processor to determine whether the plurality of audio signals should be processed by a binaural processing system or a beamforming processing system; and a speaker array comprising a plurality of loudspeakers; wherein the binaural processing system comprises; at least one filter which filters each audio signal through a left Head-Related Transfer Function (HRTF) and a right HRTF, wherein the left HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a left ear of a user; and
wherein the right HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a right ear of a user;a left combiner which combines all of the audio signals from the left HRTF into a left total binaural signal; a right combiner which combines all of the audio signals from the right HRTF into a right total binaural signal; at least one left spatialization filter which filters the left total binaural signal, wherein a separate left spatialization filter is provided for each loudspeaker in a speaker array; at least one right spatialization filter which filters the right total binaural signal, wherein a separate right spatialization filter is provided for each loudspeaker in the speaker array; and a binaural combiner which sums the filtered left total binaural signal and filtered right total binaural signal into a binaural speaker signal for each respective loudspeaker and transmits each binaural speaker signal to the respective loudspeaker; wherein the beamforming processing system comprises; a plurality of beamforming spatialization filters which filters each audio signal, wherein a separate spatialization filter is provided for each loudspeaker in the speaker array; and a beamforming combiner which sums the filtered audio signals for each respective loudspeaker into a beamforming speaker signal and transmits each beamforming speaker signal to the respective speaker in the speaker array; wherein the speaker array delivers the respective binaural speaker signal or the beamforming speaker signal through the plurality of loudspeakers to one or more users. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
24. A method for producing multidimensional sound from a speaker array, comprising:
-
receiving a plurality of audio signals, each audio signal comprising a plurality of frequencies, from a plurality of sources; filtering each audio signal through each of a left Head-Related Transfer Function (HRTF) and a right HRTF to generate HRTF-filtered left and HTRF-filtered right audio signals, wherein the left HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a left ear of a user, and wherein the right HRTF is calculated based on an angle at which the plurality of audio signals will be transmitted to a right ear of a user; merging the HRTF-filtered left audio signals into a left total binaural signal; merging the HRTF-filtered right audio signals into a right total binaural signal; filtering the left total binaural signal through a set of left finite impulse response (FIR) filters, wherein a separate left FIR filter is provided for each speaker in the speaker array; filtering the right total binaural signal through a set of right FIR filters, wherein a separate right FIR filter is provided for each speaker in the speaker array; wherein each FIR filter has filter coefficients optimized in a frequency domain by minimizing a cost function J for each frequency according to the relationship J(f)=∥
H(f) a(f)−
p(f)∥
2 +∥
a(f) ∥
2, where H(f) is a M×
N matrix of electro-acoustical transfer functions computed for N speakers and M virtual control points, p(f) is a vector representing a target sound field at the M virtual control points as a function of frequency, ∥
. . . ∥
indicates L2 norm of a vector, and β
is a regularization parameter;summing the filtered left total binaural signal and filtered right total binaural signal for each respective speaker into a speaker signal; feeding the speaker signal to the respective speaker in the speaker array; and transmitting the speaker signal through the respective speaker to the user. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method for producing a localized sound from a speaker array comprising a plurality of speakers, comprising:
-
receiving at least one audio signal comprising a plurality of frequencies; filtering the at least one audio signal through a set of finite impulse response (FIR) filters, wherein a separate FIR filter is provided for each speaker in the speaker array, wherein each FIR filter has filter coefficients a(f) optimized in a frequency domain by minimizing a cost function J for each frequency f according to the relationship
J(f)=∥
H(f)a(f)−
p(f)∥
2 +β
∥
a(f)∥
2,where H(f) is a M×
N matrix of electro-acoustical transfer functions computed for N speakers and M virtual control points, p(f) is a vector representing a target sound field at the M virtual control points as a function of frequency, ∥
. . . ∥
indicates L2 norm of a vector, and β
is a regularization parameter;summing the filtered audio signals for each respective speaker into a speaker signal; transmitting each speaker signal to the respective speaker in the speaker array; and delivering each speaker signal to one or more regions of space occupied by one or more users. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41)
-
Specification