Ambisonic depth extraction
First Claim
1. A method for positioning a virtual source to be rendered at an intended depth relative to a listener position, the virtual source including information from two or more spatial audio submix signals configured to be spatially rendered together relative to a first listener position, and each of the spatial audio submix signals corresponds to a respective different reference depth relative to a reference position, the method comprising:
- identifying, in each of the spatial audio submix signals, respective candidate components of the virtual source;
determining a first relatedness metric for the identified candidate components of the virtual source from the spatial audio submix signals; and
using the first relatedness metric, determining depths other than the respective reference depths of the spatial audio submix signals at which to render the candidate components from the spatial audio submix signals for a listener at the first listener position such that the listener at the first listener position perceives the virtual source substantially at the intended depth.
3 Assignments
0 Petitions
Accused Products
Abstract
The systems and methods described herein can be configured to identify, manipulate, and render different audio source components from encoded 3D audio mixes, such as can include content mixed for azimuth, elevation, and/or depth relative to a listener. The systems and methods can be configured to decouple depth encoding and decoding to permit spatial performance to be tailored to a particular playback environment or platform. In an example, the systems and methods improve rendering in applications that involve listener tracking, including tracking over six degrees of freedom (e.g., yaw, pitch, roll orientation, and x, y, z position).
37 Citations
20 Claims
-
1. A method for positioning a virtual source to be rendered at an intended depth relative to a listener position, the virtual source including information from two or more spatial audio submix signals configured to be spatially rendered together relative to a first listener position, and each of the spatial audio submix signals corresponds to a respective different reference depth relative to a reference position, the method comprising:
-
identifying, in each of the spatial audio submix signals, respective candidate components of the virtual source; determining a first relatedness metric for the identified candidate components of the virtual source from the spatial audio submix signals; and using the first relatedness metric, determining depths other than the respective reference depths of the spatial audio submix signals at which to render the candidate components from the spatial audio submix signals for a listener at the first listener position such that the listener at the first listener position perceives the virtual source substantially at the intended depth. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for processing audio information to position a virtual audio source to be rendered at an intended depth relative to a listener position, the virtual source including information from two or more spatial audio submix signals configured to be spatially rendered together relative to a first listener position, and each of the spatial audio submix signals corresponds to a respective different reference depth relative to a reference position, the system comprising:
an audio signal depth processor circuit configured to; identify, in each of the spatial audio submix signals, respective candidate components of the virtual source; determine a first relatedness metric for the identified candidate components of the virtual source from the spatial audio submix signals; and using the first relatedness metric, determine depths other than the respective reference depths of the spatial audio submix signals at which to render the candidate components from the spatial audio submix signals for a listener at the first listener position such that the listener at the first listener position perceives the virtual source substantially at the intended depth. - View Dependent Claims (16, 17)
-
18. A method for positioning a virtual source to be rendered at an intended depth relative to a listener position, the virtual source based on information from one or more spatial audio signals and each of the spatial audio signals corresponds to a respective different reference depth relative to a reference position, the method comprising:
-
identifying, in each of multiple spatial audio signals, respective candidate components of the virtual source; determining a first relatedness metric for the identified candidate components of the virtual source from the spatial audio signals; and determining a confidence for the first relatedness metric, the confidence indicating a belongingness of the one or more candidate components to the virtual source; and when the confidence for the first metric indicates a correspondence in content and/or location between the identified candidate components, determining first depths at which to render the candidate components for a listener at the first listener position such that the listener perceives the virtual source substantially at the intended depth, wherein at least one of the determined first depths is other than its corresponding reference depth; and when the confidence for the first relatedness metric indicates a non-correspondence in content or location between the identified candidate components, determining second depths at which to render the candidate components for the listener at the first listener position such that the listener perceives the virtual source substantially at the intended depth, wherein the determined second depths correspond to the reference depths. - View Dependent Claims (19, 20)
-
Specification