PERFORMING SPATIAL MASKING WITH RESPECT TO SPHERICAL HARMONIC COEFFICIENTS
First Claim
1. A method of compressing multi-channel audio data comprising:
- performing spatial analysis based on a plurality of spherical harmonic coefficients that describe a three-dimensional sound field to identify a spatial masking threshold;
rendering the multi-channel audio data from the plurality of spherical harmonic coefficients; and
compressing the multi-channel audio data based on the identified spatial masking threshold to generate a bitstream.
1 Assignment
0 Petitions
Accused Products
Abstract
In general, techniques are described by which to perform spatial masking with respect to spherical harmonic coefficients. As one example, an audio encoding device comprising a processor may perform various aspects of the techniques. The processor may be configured to perform spatial analysis based on the spherical harmonic coefficients describing a three-dimensional sound field to identify a spatial masking threshold. The processor may further be configured to render the multi-channel audio data from the plurality of spherical harmonic coefficients, and compress the multi-channel audio data based on the identified spatial masking threshold to generate a bitstream.
255 Citations
48 Claims
-
1. A method of compressing multi-channel audio data comprising:
-
performing spatial analysis based on a plurality of spherical harmonic coefficients that describe a three-dimensional sound field to identify a spatial masking threshold; rendering the multi-channel audio data from the plurality of spherical harmonic coefficients; and compressing the multi-channel audio data based on the identified spatial masking threshold to generate a bitstream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An audio encoding device comprising:
one or more processors configured to perform spatial analysis based on a plurality of spherical harmonic coefficients that describe a three-dimensional sound field to identify spatial masking thresholds, render the multi-channel audio data from the plurality of spherical harmonic coefficients, and compress the multi-channel audio data based on the identified spatial masking thresholds to generate a bitstream. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
21. An audio encoding device comprising:
-
means for performing spatial analysis based on a plurality of spherical harmonic coefficients that describe a three-dimensional sound field to identify a spatial masking threshold; means for rendering the multi-channel audio data from the plurality of spherical harmonic coefficients; and means for compressing the multi-channel audio data based on the identified spatial masking threshold to generate a bitstream.
-
-
22. A non-transitory computer-readable storage medium having stored thereon instructions that, when executed, cause one or more processors of an audio encoding device to:
-
perform spatial analysis based on a plurality of spherical harmonic coefficients that describe a three-dimensional sound field to identify a spatial masking threshold; render the multi-channel audio data from the plurality of spherical harmonic coefficients; and compress the multi-channel audio data based on the identified spatial masking thresholds to generate a bitstream.
-
-
23. A method comprising:
-
decoding a bitstream to generate first multi-channel audio data having a plurality of channels corresponding to speakers arranged in a defined speaker geometry; performing an inverse rendering process with respect to the generated multi-channel audio data to generate a plurality of spherical harmonic coefficients; and rendering second multi-channel audio data having a plurality of channels corresponding to speakers arranged in a local speaker geometry based on the plurality of spherical harmonic coefficients. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. An audio decoding device comprising:
one or more processors configured to decode a bitstream to generate first multi-channel audio data having a plurality of channels corresponding to speakers arranged in a first speaker geometry, perform an inverse rendering process with respect to the generated multi-channel audio data to generate a plurality of spherical harmonic coefficients, and render second multi-channel audio data having a plurality of channels corresponding to speakers arranged in a local speaker geometry based on the plurality of spherical harmonic coefficients. - View Dependent Claims (36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
-
47. An audio decoding device comprising:
-
means for decoding a bitstream to generate first multi-channel audio data having a plurality of channels corresponding to speakers arranged in a first speaker geometry; means for performing an inverse rendering process with respect to the generated multi-channel audio data to generate a plurality of spherical harmonic coefficients; and means for rendering second multi-channel audio data having a plurality of channels corresponding to speakers arranged in a local speaker geometry based on the plurality of spherical harmonic coefficients.
-
-
48. A non-transitory computer-readable storage medium having stored thereon instructions that, when executed, cause one or more processors of an audio decoding device to:
-
decode a bitstream to generate first multi-channel audio data having a plurality of channels corresponding to speakers arranged in a first speaker geometry; perform an inverse rendering process with respect to the generated multi-channel audio data to generate a plurality of spherical harmonic coefficients; and render second multi-channel audio data having a plurality of channels corresponding to speakers arranged in a local speaker geometry based on the plurality of spherical harmonic coefficients.
-
Specification