Synthesis of signals for immersive audio playback
First Claim
Patent Images
1. A method for synthesizing sound, comprising:
- receiving one or more first inputs, each first input comprising a respective monaural audio track;
receiving one or more second inputs indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs;
assigning to each of the first inputs respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations; and
synthesizing left and right stereo output signals by applying the respective left and right filter responses to the first inputs,wherein the one or more first inputs comprise a first plurality of audio input tracks, and wherein synthesizing the left and right stereo output signals comprises;
spatially upsampling the first plurality of the input audio tracks in order to generate a second plurality of synthesized inputs, having synthesized 3D source locations with respective coordinates different from the respective 3D source locations associated with the first inputs;
filtering the synthesized inputs using the filter response functions computed at the azimuth and elevation coordinates of the synthesized 3D source locations; and
after filtering the first inputs using the respective left and right filter responses, summing the filtered synthesized inputs with the filtered first inputs to produce the stereo output signals.
3 Assignments
0 Petitions
Accused Products
Abstract
A method for synthesizing sound includes receiving one or more first inputs (80), each including a respective monaural audio track (82). One or more second inputs are received, indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs. Each of the first inputs is assigned respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations. Left and right stereo output signals (94) are synthesized by applying the respective left and right filter responses to the first inputs.
-
Citations
34 Claims
-
1. A method for synthesizing sound, comprising:
-
receiving one or more first inputs, each first input comprising a respective monaural audio track; receiving one or more second inputs indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs; assigning to each of the first inputs respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations; and synthesizing left and right stereo output signals by applying the respective left and right filter responses to the first inputs, wherein the one or more first inputs comprise a first plurality of audio input tracks, and wherein synthesizing the left and right stereo output signals comprises; spatially upsampling the first plurality of the input audio tracks in order to generate a second plurality of synthesized inputs, having synthesized 3D source locations with respective coordinates different from the respective 3D source locations associated with the first inputs; filtering the synthesized inputs using the filter response functions computed at the azimuth and elevation coordinates of the synthesized 3D source locations; and after filtering the first inputs using the respective left and right filter responses, summing the filtered synthesized inputs with the filtered first inputs to produce the stereo output signals. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 11)
-
-
10. A method for synthesizing sound, comprising:
-
receiving one or more first inputs, each first input comprising a respective monaural audio track; receiving one or more second inputs indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs; assigning to each of the first inputs respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations; and synthesizing left and right stereo output signals by applying the respective left and right filter responses to the first inputs, wherein at least one of the second inputs specifies a 3D trajectory in space, and wherein assigning the left and right filter responses comprises specifying, at each of a plurality of points along the 3D trajectory, filter responses that vary over the trajectory responsively to the azimuth and elevation coordinates of the points, and wherein synthesizing the left and right stereo output signals comprises sequentially applying to the first input that is associated with the at least one of the second inputs the filter responses that are specified for the points along the 3D trajectory, and wherein receiving the one or more second inputs comprises; receiving a start point and a start time of the trajectory; receiving an end point and an end time of the trajectory; and automatically computing the 3D trajectory between the start point and the end point such that the trajectory is traversed from the start time to the end time.
-
-
12. Apparatus for synthesizing sound, comprising:
-
an input interface configured to receive one or more first inputs, each first input comprising a respective monaural audio track, and to receive one or more second inputs indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs; and a processor, which is configured to assign to each of the first inputs respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations, and to synthesize left and right stereo output signals by applying the respective left and right filter responses to the first inputs, wherein the one or more first inputs comprise a first plurality of audio input tracks, and wherein the processor is configured to spatially upsample the first plurality of the input audio tracks in order to generate a second plurality of synthesized inputs, having synthesized 3D source locations with respective coordinates different from the respective 3D source locations associated with the first inputs, to filter the synthesized inputs using the filter response functions computed at the azimuth and elevation coordinates of the synthesized 3D source locations, and to sum the filtered synthesized inputs with the filtered first inputs to produce the stereo output signals. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. Apparatus for synthesizing sound, comprising:
-
an input interface configured to receive one or more first inputs, each first input comprising a respective monaural audio track, and to receive one or more second inputs indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs; and a processor, which is configured to assign to each of the first inputs respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations, and to synthesize left and right stereo output signals by applying the respective left and right filter responses to the first inputs, wherein at least one of the second inputs specifies a 3D trajectory in space, and wherein the processor is configured to specify, at each of a plurality of points along the 3D trajectory, filter responses that vary over the trajectory responsively to the azimuth and elevation coordinates of the points, and to sequentially apply to the first input that is associated with the at least one of the second inputs the filter responses that are specified for the points along the 3D trajectory, and wherein the processor is configured to receive a start point and a start time of the trajectory and an end point and an end time of the trajectory, and to automatically compute the 3D trajectory between the start point and the end point such that the trajectory is traversed from the start time to the end time. - View Dependent Claims (23)
-
-
24. A computer software product, comprising a non-transitory computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive one or more first inputs, each first input comprising a respective monaural audio track, and to receive one or more second inputs indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs,
wherein the instructions cause the computer to assign to each of the first inputs respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations, and to synthesize left and right stereo output signals by applying the respective left and right filter responses to the first inputs, and wherein the one or more first inputs comprise a first plurality of audio input tracks, and wherein the instructions cause the computer to spatially upsample the first plurality of the input audio tracks in order to generate a second plurality of synthesized inputs, having synthesized 3D source locations with respective coordinates different from the respective 3D source locations associated with the first inputs, to filter the synthesized inputs using the filter response functions computed at the azimuth and elevation coordinates of the synthesized 3D source locations, and to sum the filtered synthesized inputs with the filtered first inputs to produce the stereo output signals.
-
33. A computer software product, comprising a non-transitory computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive one or more first inputs, each first input comprising a respective monaural audio track, and to receive one or more second inputs indicating respective three-dimensional (3D) source locations having azimuth and elevation coordinates to be associated with the first inputs,
wherein the instructions cause the computer to assign to each of the first inputs respective left and right filter responses based on filter response functions that depend upon the azimuth and elevation coordinates of the respective 3D source locations, and to synthesize left and right stereo output signals by applying the respective left and right filter responses to the first inputs, and wherein at least one of the second inputs specifies a 3D trajectory in space, and wherein the instructions cause the computer to specify, at each of a plurality of points along the 3D trajectory, filter responses that vary over the trajectory responsively to the azimuth and elevation coordinates of the points, and to sequentially apply to the first input that is associated with the at least one of the second inputs the filter responses that are specified for the points along the 3D trajectory, and wherein the instructions cause the computer to receive a start point and a start time of the trajectory and an end point and an end time of the trajectory, and to automatically compute the 3D trajectory between the start point and the end point such that the trajectory is traversed from the start time to the end time.
Specification