Low bit-rate spatial coding method and system
First Claim
1. A low bit-rate spatial coding system for encoding a plurality of audio streams representing a soundfield into an encoded signal and decoding said encoded signal, said system including an encoder and a decoder, said encoder comprisingmeans for generating a plurality of subband signals in response to said plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams,means for generating a composite signal representing the combination of subband signals in respective frequency subbands,means for generating a steering control signal for said composite signal indicating the principal direction of said soundfield in respective subbands,means for generating encoded information by allocating bits to said composite signal and said steering control signal, andmeans for assembling said encoded information into an encoded signal, andsaid decoder comprisingmeans for deriving the composite signal and steering control signal from said encoded signal,means for deriving subband signals in response to said composite signal and said steering control signal,means for supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, wherein there are three or more output channels, andmeans for generating an audio stream in no more than two output channels at any instant in response to said subband signals and reproduction information.
2 Assignments
0 Petitions
Accused Products
Abstract
A spatial audio coding system, including an encoder and a decoder, operates at very low bit-rates and is useful for audio via the Internet. The listener or listeners preferably are located within a predictable listening area, for example, users of a personal computer or television viewers. An encoder produces a composite audio-information signal representing the soundfield to be reproduced and a directional vector or "steering control signal." The composite audio-information signal has its frequency spectrum broken into a number of subbands, preferably commensurate with the critical bands of the human ear. The steering control signal has a component relating to the dominant direction of the soundfield in each of the subbands. Because the system is based on the premise that only sound from a single direction is heard at any instant, the decoder need not apply a signal to more than two sound transducers at any instant.
103 Citations
48 Claims
-
1. A low bit-rate spatial coding system for encoding a plurality of audio streams representing a soundfield into an encoded signal and decoding said encoded signal, said system including an encoder and a decoder, said encoder comprising
means for generating a plurality of subband signals in response to said plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, means for generating a composite signal representing the combination of subband signals in respective frequency subbands, means for generating a steering control signal for said composite signal indicating the principal direction of said soundfield in respective subbands, means for generating encoded information by allocating bits to said composite signal and said steering control signal, and means for assembling said encoded information into an encoded signal, and said decoder comprising means for deriving the composite signal and steering control signal from said encoded signal, means for deriving subband signals in response to said composite signal and said steering control signal, means for supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, wherein there are three or more output channels, and means for generating an audio stream in no more than two output channels at any instant in response to said subband signals and reproduction information.
-
2. A low bit-rate spatial coding system for encoding a plurality of audio streams representing a soundfield into an encoded signal, decoding said encoded signal, and reproducing an auditory likeness of said soundfield, said system including an encoder and a decoder, said encoder comprising
means for generating a plurality of subband signals in response to said plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, means for generating a composite signal representing the combination of subband signals in each frequency subband, means for generating a steering control signal for said composite signal indicating the principal direction of said soundfield in each subband, means for generating encoded information by allocating bits to said composite signal and said steering control signal, and means for assembling said encoded information into an encoded signal, said decoder comprising means for deriving the composite signal and steering control signal from said encoded signal, means for deriving subband signals in response to said composite signal and said steering control signal, means for supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, and means for generating an audio stream in one or more output channels in response to said subband signals and reproduction information, and further comprising a plurality of sound transducers coupled to the output channels of said decoder and arranged so as to generate an auditory likeness of said soundfield to a listener or listeners within a spatial coding sweet-spot listening area.
-
9. A decoder for use in a low bit-rate spatial coding system for decoding an encoded signal derived from a plurality of audio streams representing a soundfield by generating a plurality of subband signals in response to the plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in respective frequency subbands, generating a steering control signal for the composite signal indicating the principal direction of said soundfield in respective subbands, generating encoded information by allocating bits to the composite signal and the steering control signal, and assembling the encoded information into an encoded signal, comprising
means for deriving the composite signal and steering control signal from said encoded signal, means for deriving subband signals in response to said composite signal and said steering control signal, means for supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, wherein there are three or more output channels, and means for generating an audio stream in no more than two output channels at any instant in response to said subband signals and reproduction information.
-
10. A decoder and reproduction system for use in a low bit-rate spatial coding system for decoding and reproducing an encoded signal derived from a plurality of audio streams representing a soundfield by generating a plurality of subband signals in response to the plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in respective frequency subbands, generating a steering control signal for the composite signal indicating the principal direction of said soundfield in respective subbands, generating encoded information by allocating bits to the composite signal and the steering control signal, and assembling the encoded information into an encoded signal, comprising
means for deriving the composite signal and steering control signal from said encoded signal, means for deriving subband signals in response to said composite signal and said steering control signal, means for supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, and means for generating an audio stream in one or more output channels in response to said subband signals and reproduction information, and a plurality of sound transducers coupled to the output channels of said decoder and arranged so as to generate an auditory likeness of said soundfield to a listener or listeners within a spatial coding sweet-spot listening area.
-
17. A low bit-rate spatial coding system for encoding a plurality of audio streams representing a soundfield into an encoded signal and decoding said encoded signal, said system including an encoder and a decoder, said encoder comprising
a subband signal generator generating a plurality of subband signals in response to said plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, a signal combiner generating a composite signal representing the combination of subband signals in respective frequency subbands, a soundfield direction detector generating a steering control signal for said composite signal indicating the principal direction of said soundfield in respective subbands, an encoder and bit allocator generating encoded information by allocating bits to said composite signal and said steering control signal, and a formatter assembling said encoded information into an encoded signal, and said decoder comprising a deformatter deriving the composite signal and steering control signal from said encoded signal, an inverse subband generator deriving subband signals in response to said composite signal and said steering control signal, an information input describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, wherein there are three or more output channels, and a signal generator generating an audio stream in no more than two output channels at any instant in response to said subband signals and reproduction information.
-
18. A low bit-rate spatial coding system for encoding a plurality of audio streams representing a soundfield into an encoded signal, decoding said encoded signal, and reproducing an auditory likeness of said soundfield, said system including an encoder and a decoder, said encoder comprising
a subband signal generator generating a plurality of subband signals in response to said plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, a signal combiner generating a composite signal representing the combination of subband signals in each frequency subband, a soundfield direction detector generating a steering control signal for said composite signal indicating the principal direction of said soundfield in each subband, an encoder and bit allocator generating encoded information by allocating bits to said composite signal and said steering control signal, and a formatter assembling said encoded information into an encoded signal, said decoder comprising a deformatter deriving the composite signal and steering control signal from said encoded signal, an inverse subband generator deriving subband signals in response to said composite signal and said steering control signal, an information input describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, and a signal generator generating an audio stream in one or more output channels in response to said subband signals and reproduction information, and further comprising a plurality of sound transducers coupled to the output channels of said decoder and arranged so as to generate an auditory likeness of said soundfield to a listener or listeners within a spatial coding sweet-spot listening area.
-
25. A decoder for use in a low bit-rate spatial coding system for decoding an encoded signal derived from a plurality of audio streams representing a soundfield by generating a plurality of subband signals in response to the plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in respective frequency subbands, generating a steering control signal for the composite signal indicating the principal direction of said soundfield in respective subbands, generating encoded information by allocating bits to the composite signal and the steering control signal, and assembling the encoded information into an encoded signal, comprising
a deformatter deriving the composite signal and steering control signal from said encoded signal, an inverse subband generator deriving subband signals in response to said composite signal and said steering control signal, an information input describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, wherein there are three or more output channels, and a signal generator generating an audio stream in no more than two output channels at any instant in response to said subband signals and reproduction information.
-
26. A decoder and reproduction system for use in a low bit-rate spatial coding system for decoding and reproducing an encoded signal derived from a plurality of audio streams representing a soundfield by generating a plurality of subband signals in response to the plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in respective frequency subbands, generating a steering control signal for the composite signal indicating the principal direction of said soundfield in respective subbands, generating encoded information by allocating bits to the composite signal and the steering control signal, and assembling the encoded information into an encoded signal, comprising
a deformatter deriving the composite signal and steering control signal from said encoded signal, an inverse subband generator deriving subband signals in response to said composite signal and said steering control signal, an information input describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, and a signal generator generating an audio stream in one or more output channels in response to said subband signals and reproduction information, and a plurality of sound transducers coupled to the output channels of said decoder and arranged so as to generate an auditory likeness of said soundfield to a listener or listeners within a spatial coding sweet-spot listening area.
-
33. A low bit-rate spatial coding method for encoding a plurality of audio streams representing a soundfield into an encoded signal and decoding said encoded signal, said method including encoding and decoding, said encoding comprising
generating a plurality of subband signals in response to said plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in respective frequency subbands, generating a steering control signal for said composite signal indicating the principal direction of said soundfield in respective subbands, generating encoded information by allocating bits to said composite signal and said steering control signal, and assembling said encoded information into an encoded signal, and said decoding comprising deriving the composite signal and steering control signal from said encoded signal, deriving subband signals in response to said composite signal and said steering control signal, supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, wherein there are three or more output channels, and generating an audio stream in no more than two output channels at any instant in response to said subband signals and reproduction information.
-
34. A low bit-rate spatial coding method for encoding a plurality of audio streams representing a soundfield into an encoded signal, decoding said encoded signal, and reproducing an auditory likeness of said soundfield, said method including an encoder and a decoder, said encoding comprising
generating a plurality of subband signals in response to said plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in each frequency subband, generating a steering control signal for said composite signal indicating the principal direction of said soundfield in each subband, generating encoded information by allocating bits to said composite signal and said steering control signal, and assembling said encoded information into an encoded signal, said decoding comprising deriving the composite signal and steering control signal from said encoded signal, deriving subband signals in response to said composite signal and said steering control signal, supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, and generating an audio stream in one or more output channels in response to said subband signals and reproduction information, and further comprising coupling said output channels to a plurality of sound transducers arranged so as to generate an auditory likeness of said soundfield to a listener or listeners within a spatial coding sweet-spot listening area.
-
41. A low bit-rate spatial coding decoding method for decoding an encoded signal derived from a plurality of audio streams representing a soundfield by generating a plurality of subband signals in response to the plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in respective frequency subbands, generating a steering control signal for the composite signal indicating the principal direction of said soundfield in respective subbands, generating encoded information by allocating bits to the composite signal and the steering control signal, and assembling the encoded information into an encoded signal, comprising
deriving the composite signal and steering control signal from said encoded signal, deriving subband signals in response to said composite signal and said steering control signal, supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, wherein there are three or more output channels, and generating an audio stream in no more than two output channels at any instant in response to said subband signals and reproduction information.
-
42. A low bit-rate spatial coding decoding and reproduction method for decoding and reproducing an encoded signal derived from a plurality of audio streams representing a soundfield by generating a plurality of subband signals in response to the plurality of audio streams, each subband signal representing a respective frequency subband of a respective one of said audio streams, generating a composite signal representing the combination of subband signals in respective frequency subbands, generating a steering control signal for the composite signal indicating the principal direction of said soundfield in respective subbands, generating encoded information by allocating bits to the composite signal and the steering control signal, and assembling the encoded information into an encoded signal, comprising
deriving the composite signal and steering control signal from said encoded signal, deriving subband signals in response to said composite signal and said steering control signal, supplying reproduction information describing the number of output channels of said decoder and the location or virtual location of sound transducers connected to the respective output channels, and generating an audio stream in one or more output channels in response to said subband signals and reproduction information, and coupling a plurality of sound transducers to the output channels of said decoder, the sound transducers arranged so as to generate an auditory likeness of said soundfield to a listener or listeners within a spatial coding sweet-spot listening area.
Specification