Audio Bitstreams with Supplementary Data and Encoding and Decoding of Such Bitstreams

US 20150348558A1
Filed: 08/10/2015
Published: 12/03/2015
Est. Priority Date: 12/03/2010
Status: Abandoned Application

First Claim

Patent Images

1. A method for generating an encoded audio bitstream, said method including steps of:

(a) providing primary audio data and supplementary data, wherein the primary audio data include audio data of at least one audio channel and the supplementary data include synchronization words and additional synchronization words; and

(b) combining the primary audio data with at least some of the supplementary data to generate the encoded audio bitstream, such that said encoded audio bitstream comprises a sequence of frames, the frames are organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, wherein step (b) includes a step of including at least the synchronization words as the P least significant bits of each of at least some of the audio segments or a step of including at least the synchronization words in at least one said guard band segment, and some of the primary audio data is included as the M-P most significant bits of said each of at least some of the audio segments, where each of N, M, and P is a positive integer, and P is less than M, andincludes a step of generating at least one additional encoded audio bitstream, including by;

(c) providing additional primary audio data, wherein the additional primary audio data include audio data of at least one additional audio channel; and

(d) combining the additional primary audio data with at least some of the supplementary data to generate the additional encoded audio bitstream, such that said additional encoded audio bitstream comprises a sequence of frames, the frames are organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, wherein step (d) includes a step of including at least the additional synchronization words in at least one said guard band segment of the additional encoded audio bitstream or a step of including at least the additional synchronization words as the P least significant bits of each of at least some of the audio segments of the additional encoded audio bitstream, andwherein the synchronization words and the additional synchronization words are suitable for time aligning the primary audio data and the additional primary audio data, or the primary audio data, the additional primary audio data, and the supplementary data.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Methods for generating or decoding an encoded audio bitstream including audio data and supplementary data (e.g., metadata and/or unrelated audio data), where at least some of the supplementary data is included as LSBs of audio segments, and/or at least some of the supplementary data is included in guard bands. Typical embodiments provide a scalable and video synchronous format compatible with real-time and file-based infrastructure components that support the SMPTE 337 format for carrying data in AES3 serial bitstreams, and/or provide a framework for extending distribution codecs to scale beyond an 8-channel limit to support multiples of 8 channels synchronously across multiple AES3 interfaces. Another aspect is an audio processing unit configured to perform any embodiment of the method or including a buffer memory storing at least one segment of an audio bitstream generated in accordance with any embodiment of the method.

26 Citations

View as Search Results

22 Claims

1. A method for generating an encoded audio bitstream, said method including steps of:
- (a) providing primary audio data and supplementary data, wherein the primary audio data include audio data of at least one audio channel and the supplementary data include synchronization words and additional synchronization words; and
  
  (b) combining the primary audio data with at least some of the supplementary data to generate the encoded audio bitstream, such that said encoded audio bitstream comprises a sequence of frames, the frames are organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, wherein step (b) includes a step of including at least the synchronization words as the P least significant bits of each of at least some of the audio segments or a step of including at least the synchronization words in at least one said guard band segment, and some of the primary audio data is included as the M-P most significant bits of said each of at least some of the audio segments, where each of N, M, and P is a positive integer, and P is less than M, andincludes a step of generating at least one additional encoded audio bitstream, including by;
  
  (c) providing additional primary audio data, wherein the additional primary audio data include audio data of at least one additional audio channel; and
  
  (d) combining the additional primary audio data with at least some of the supplementary data to generate the additional encoded audio bitstream, such that said additional encoded audio bitstream comprises a sequence of frames, the frames are organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, wherein step (d) includes a step of including at least the additional synchronization words in at least one said guard band segment of the additional encoded audio bitstream or a step of including at least the additional synchronization words as the P least significant bits of each of at least some of the audio segments of the additional encoded audio bitstream, andwherein the synchronization words and the additional synchronization words are suitable for time aligning the primary audio data and the additional primary audio data, or the primary audio data, the additional primary audio data, and the supplementary data.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
- - 2. The method of claim 1, wherein each of the frames has the structure of an AES3 frame, and N=2, M=24, and P=4.
  - 3. The method of claim 2, wherein the encoded audio bitstream is a Dolby E bitstream.
  - 4. The method of claim 1, wherein the supplementary data includes at least one of additional audio content unrelated to the primary audio data, metadata associated with the primary audio data, synchronization words, protection bits, or metadata associated with the additional audio content.
  - 5. The method of claim 1, wherein the supplementary data includes processing state metadata indicative of the processing state of the primary audio data.
  - 6. The method of claim 1, wherein the primary audio data comprises at least one channel of audio content, and the supplementary data includes additional audio content comprising at least one additional audio channel.
  - 7. The method of claim 6, wherein the at least one additional audio channel comprises at least one object channel.
  - 8. The method of claim 3, wherein each of the bursts carries non-pulse-code modulated audio data in SMPTE 337 format, the non-pulse-code modulated audio data is or includes at least some of the primary audio data and at least some of the supplementary data, and each of the bursts corresponds to a time period equivalent to that of a corresponding video frame.
  - 9. The method of claim 8, wherein step (a) includes a step of encoding audio data, thereby generating the primary audio data.
  - 10. The method of claim 1, wherein the primary audio data and the supplementary data are pulse code modulated data.
  - 11. The method of claim 1, wherein at least some of the supplementary data is included in at least two intervals of at least one burst of the encoded audio bitstream, a first subset of the supplementary data is included in an interval of the burst, a second subset of the supplementary data is included in a later interval of the burst, the first subset of the supplementary data includes supplementary data corresponding to the second subset of the supplementary data in the later interval.
  - 12. The method of claim 11, wherein the supplementary data is or includes metadata useful for performing audio processing, and the first subset of the supplementary data is or includes metadata useful for performing at least one step of said audio processing.
  - 13. The method of claim 1, wherein the supplementary data is or includes metadata, and step (b) includes a step of including, in the encoded audio bitstream, a metadata segment including an escape code and at least some of the metadata.
  - 14. The method of claim 13, wherein the method includes steps of:
    - searching the metadata to be included in the metadata segment to identify an unused value which is not included in said metadata, and identifying the unused value as the escape code for the metadata segment; and
      
      searching the metadata to be included in the metadata segment to identify a predetermined data value, replacing said metadata with modified metadata, wherein the modified metadata is identical to the metadata except in that each identified occurrence of the predetermined data value is replaced by the escape code, and including the modified metadata in the metadata segment.
  - 15. The method of claim 1, wherein each metadata segment of a burst of the encoded audio bitstream includes a payload association code value for each metadata payload of the metadata segment, each payload association code value identifying each audio program indicated by the bitstream to which the payload applies.

16. A method for decoding encoded audio bitstreams, said method including steps of:
- (a) receiving an encoded audio bitstream and at least one additional encoded audio bitstream, wherein the encoded audio bitstream is indicative of primary audio data and supplementary data, the additional encoded audio bitstream is indicative of additional primary audio data and supplementary data, the primary audio data include audio data of at least one audio channel, and the additional primary audio data include audio data of at least one additional audio channel;
  
  (b) extracting the supplementary data, the primary audio data, and the additional primary audio data from the encoded audio bitstream and the additional encoded audio bitstream, wherein the supplementary data extracted from the encoded audio bitstream includes synchronization words, and wherein the supplementary data extracted from the additional encoded audio bitstream includes additional synchronization words;
  
  (c) decoding the primary audio data, thereby generating decoded audio data of the at least one audio channel, and decoding the additional primary audio data, thereby generating additional decoded audio data of the at least one additional audio channel; and
  
  (d) using the synchronization words and the additional synchronization words to time align the primary audio data and the additional primary audio data, or the primary audio data, the additional primary audio data, and the supplementary data,wherein each of the encoded audio bitstream and the additional encoded audio bitstream comprises a sequence of frames organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, and wherein step (b) includes a step of extracting the synchronization words from at least one said guard band segment of the encoded audio bitstream or of extracting the synchronization words from the P least significant bits of each of at least some of the audio segments, extracting the additional synchronization words from at least one said guard band segment of the additional encoded audio bitstream or of extracting the additional synchronization words from the P least significant bits of each of at least some of the audio segments, extracting some of the primary audio data from each of at least a subset of the frames of each of the bursts of the encoded audio bitstream, and extracting some of the additional primary audio data from each of at least a subset of the frames of each of the bursts of the additional encoded audio bitstream,where each of N, M, and P is a positive integer, and P is less than M.
- View Dependent Claims (17, 18, 19, 20)
- - 17. The method of claim 16, wherein said method also includes a step of:
    - using the synchronization words and the additional synchronization words to synchronize the primary audio data and the additional primary audio data with corresponding video frames, or to synchronize the primary audio data, the additional primary audio data, and the supplementary data with corresponding video frames.
  - 18. The method of claim 16, wherein each of the encoded audio bitstream and the additional encoded audio bitstream includes at least one synchronization word which is not included in the supplementary data, wherein said method also includes a step of:
    - using each said synchronization word to time align the primary audio data and the additional primary audio data.
  - 19. The method of claim 16, wherein each of the frames has the structure of an AES3 frame, and N=2, M=24, and P=4.
  - 20. The method of claim 16, wherein the encoded audio bitstream is a Dolby E bitstream.

21. An audio processing unit for generating an encoded audio bitstream and at least one additional encoded audio bitstream, the audio processing unit comprising one or more processors configured to:
- (a) provide primary audio data and supplementary data, wherein the primary audio data include audio data of at least one audio channel and the supplementary data include synchronization words and additional synchronization words;
  
  (b) combine the primary audio data with at least some of the supplementary data to generate the encoded audio bitstream, such that said encoded audio bitstream comprises a sequence of frames, the frames are organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, wherein step (b) includes a step of including at least the synchronization words as the P least significant bits of each of at least some of the audio segments or a step of including at least the synchronization words in at least one said guard band segment, and some of the primary audio data is included as the M-P most significant bits of said each of at least some of the audio segments, where each of N, M, and P is a positive integer, and P is less than M;
  
  (c) provide additional primary audio data, wherein the additional primary audio data include audio data of at least one additional audio channel; and
  
  (d) combine the additional primary audio data with at least some of the supplementary data to generate the additional encoded audio bitstream, such that said additional encoded audio bitstream comprises a sequence of frames, the frames are organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, wherein step (d) includes a step of including at least the additional synchronization words in at least one said guard band segment of the additional encoded audio bitstream or a step of including at least the additional synchronization words as the P least significant bits of each of at least some of the audio segments of the additional encoded audio bitstream, andwherein the synchronization words and the additional synchronization words are suitable for time aligning the primary audio data and the additional primary audio data, or the primary audio data, the additional primary audio data, and the supplementary data.

22. An audio processing unit for decoding encoded audio bitstreams, the audio processing unit comprising one or more processors configured to:
- (a) receive an encoded audio bitstream and at least one additional encoded audio bitstream, wherein the encoded audio bitstream is indicative of primary audio data and supplementary data, the additional encoded audio bitstream is indicative of additional primary audio data and supplementary data, the primary audio data include audio data of at least one audio channel, and the additional primary audio data include audio data of at least one additional audio channel;
  
  (b) extract the supplementary data, the primary audio data, and the additional primary audio data from the encoded audio bitstream and the additional encoded audio bitstream, wherein the supplementary data extracted from the encoded audio bitstream includes synchronization words, and wherein the supplementary data extracted from the additional encoded audio bitstream includes additional synchronization words;
  
  (c) decode the primary audio data, thereby generating decoded audio data of the at least one audio channel, and decoding the additional primary audio data, thereby generating additional decoded audio data of the at least one additional audio channel; and
  
  (d) use the synchronization words and the additional synchronization words to time align the primary audio data and the additional primary audio data, or the primary audio data, the additional primary audio data, and the supplementary data,wherein each of the encoded audio bitstream and the additional encoded audio bitstream comprises a sequence of frames organized in a sequence of bursts, each of the bursts has a guard band segment and includes some of the frames, each of the frames has N audio segments, each of the audio segments comprises M bits, and wherein step (b) includes a step of extracting the synchronization words from at least one said guard band segment of the encoded audio bitstream or of extracting the synchronization words from the P least significant bits of each of at least some of the audio segments, extracting the additional synchronization words from at least one said guard band segment of the additional encoded audio bitstream or of extracting the additional synchronization words from the P least significant bits of each of at least some of the audio segments, extracting some of the primary audio data from each of at least a subset of the frames of each of the bursts of the encoded audio bitstream, and extracting some of the additional primary audio data from each of at least a subset of the frames of each of the bursts of the additional encoded audio bitstream, where each of N, M, and P is a positive integer, and P is less than M.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Original Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Inventors
Riedmiller, Jeffrey, Farahani, Farhad, Sanchez, Freddie, Hoffmann, Michael, Grant, Michael

Application Number

US14/822,168
Publication Number

US 20150348558A1
Time in Patent Office

Days
Field of Search
US Class Current

1/1
CPC Class Codes

G10L 19/018 Audio watermarking, i.e. em...

G10L 19/167 Audio streaming, i.e. forma...

Audio Bitstreams with Supplementary Data and Encoding and Decoding of Such Bitstreams

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

26 Citations

22 Claims

Specification

Solutions

Use Cases

Quick Links

Audio Bitstreams with Supplementary Data and Encoding and Decoding of Such Bitstreams

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

26 Citations

22 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links