Data framing for adaptive-block-length coding system

US 6,226,608 B1
Filed: 01/28/1999
Issued: 05/01/2001
Est. Priority Date: 01/28/1999
Status: Expired due to Term

First Claim

Patent Images

1. A method for audio encoding that comprises steps performing the acts of:

receiving a reference signal conveying alignment of video information frames in a sequence of video information frames in which adjacent frames are separated by a frame interval;

receiving an audio signal conveying audio information;

analyzing the audio signal to identify characteristics of the audio information;

generating a control signal in response to the characteristics of the audio information, wherein the control signal conveys segment lengths for segments of the audio information in a sequence of overlapping segments, a respective segment having a respective overlap interval with an adjacent segment and the sequence having a length equal to the frame interval plus a frame overlap interval;

applying an adaptive block encoding process to the overlapping segments in the sequence to generate a plurality of blocks of encoded information, wherein the block encoding process adapts in response to the control signal; and

assembling the plurality of blocks of encoded information and control information conveying the segment lengths to form an encoded information frame that is aligned with the reference signal.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An audio encoder applies an adaptive block-encoding process to segments of audio information to generate frames of encoded information that are aligned with a reference signal conveying the alignment of a sequence of video information frames. The audio information is analyzed to determine various characteristics of the audio signal such as the occurrence and location of a transient, and a control signal is generated that causes the adaptive block-encoding process to encode segments of varying length. A complementary decoder applies an adaptive block-decoding process to recover the segments of audio information from the frames of encoded information. In embodiments that apply time-domain aliasing cancellation (TDAC) transforms, window functions and transforms are applied according to one of a plurality of segment patterns that define window functions and transform parameters for each segment in a sequence of segments. The segments in each frame of a sequence of overlapping frames may be recovered without aliasing artifacts independently from the recovery of segments in other frames. Window functions are adapted to provide preferred frequency-domain responses and time-domain gain profiles.

Citations

32 Claims

1. A method for audio encoding that comprises steps performing the acts of:
- receiving a reference signal conveying alignment of video information frames in a sequence of video information frames in which adjacent frames are separated by a frame interval;
  
  receiving an audio signal conveying audio information;
  
  analyzing the audio signal to identify characteristics of the audio information;
  
  generating a control signal in response to the characteristics of the audio information, wherein the control signal conveys segment lengths for segments of the audio information in a sequence of overlapping segments, a respective segment having a respective overlap interval with an adjacent segment and the sequence having a length equal to the frame interval plus a frame overlap interval;
  
  applying an adaptive block encoding process to the overlapping segments in the sequence to generate a plurality of blocks of encoded information, wherein the block encoding process adapts in response to the control signal; and
  
  assembling the plurality of blocks of encoded information and control information conveying the segment lengths to form an encoded information frame that is aligned with the reference signal.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
- - 2. A method for audio encoding according to claim 1 wherein the block encoding process applies a bank of bandpass filters or a transform to the segments of the audio information to generate blocks of subband signals or transform coefficients, respectively.
  - 3. A method for audio encoding according to claim 1 wherein the block encoding process applies a respective analysis window function to each segment of the audio information to generate windowed segments and applies a time-domain aliasing cancellation analysis transform to the windowed segments to generate blocks of transform coefficients.
  - 4. A method for audio encoding according to claim 3 that adapts the analysis window function and the time-domain aliasing cancellation analysis transform to generate a block representing an end segment in the sequence of segments for a respective encoded information frame that permits an application of a complementary synthesis transform and synthesis window function to recover audio information with substantially no time-domain aliasing in the overlap interval of the end segment in the sequence.
  - 5. A method for audio encoding according to claim 4 wherein the block encoding process constrains the segment lengths to be an integer power of two.
  - 6. A method for audio encoding according to claim 4 wherein the block encoding process adapts the segment lengths between a maximum segment length and a minimum segment length and, for a respective encoded information frame, applies either:
7. A method for audio encoding according to claim 6 wherein all segments in the short-short sequence have identical lengths.
8. A method for audio encoding according to claim 6 wherein all analysis window functions in the short-short sequence have non-zero portions that are identical in shape and length.
9. A method for audio encoding according to claim 3 wherein the block encoding process constrains the segment lengths to be an integer power of two.
10. A method for audio encoding according to claim 3 wherein the block encoding process adapts the segment lengths between a maximum segment length and a minimum segment length and, for a respective encoded information frame, applies either:
- a long-long sequence of analysis window functions to a sequence of segments having lengths equal to the maximum segment length;
  
  a short-short sequence of analysis window functions to a sequence of segments having effective lengths equal to the minimum segment length;
  
  a bridge-long sequence of analysis window functions to a sequence of segments having lengths that shift from the minimum segment length to the maximum segment length, wherein the bridge-long sequence comprises a first bridge sequence of window functions followed by a window function for a segment having a length equal to the maximum segment length;
  
  a long-bridge sequence of analysis window functions to a sequence of segments having lengths that shift from the maximum segment length to the minimum segment length, wherein the long-bridge sequence comprises a window function for a segment having a length equal to the maximum segment length followed by a second bridge sequence of window functions;
  
  or a bridge-bridge sequence of analysis window functions to a sequence of segments having varying lengths, wherein the bridge-bridge sequence comprises the first bridge sequence followed by the second bridge sequence.
11. A method for audio encoding according to claim 10 wherein all segments in the short-short sequence have identical lengths.
12. A method for audio encoding according to claim 10 wherein all analysis window functions in the short-short sequence have non-zero portions that are identical in shape and length.
13. A method for audio encoding according to claim 1 that comprises converting the audio information from an input audio sample rate to an internal audio sample rate prior to applying the block encoding process, wherein the reference signal conveys a video information frame rate and the internal audio sample rate is equal to an integer multiple of the video information frame rate.

14. A method for audio decoding that comprises steps performing the acts of:
- receiving a reference signal conveying alignment of video information frames in a sequence of video information frames in which adjacent frames are separated by a frame interval;
  
  receiving encoded information frames that are aligned with the reference signal and each comprise control information and a plurality of blocks of encoded audio information;
  
  generating a control signal in response to the control information, wherein the control signal conveys segment lengths for segments of audio information in a sequence of overlapping segments, a respective segment having a respective overlap interval with an adjacent segment and the sequence having a length equal to the frame interval plus a frame overlap interval;
  
  applying an adaptive block decoding process to the plurality of blocks of encoded audio information in a respective encoded information frame, wherein the block decoding process adapts in response to the control signal to generate the sequence of overlapping segments of audio information.
- View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
- - 15. A method for audio decoding according to claim 14 wherein the block decoding process applies a bank of bandpass synthesis filters or a synthesis transform to the plurality of blocks of encoded information to generate the overlapping segments of audio information.
  - 16. A method for audio decoding according to claim 14 wherein the block decoding process applies a time-domain aliasing cancellation synthesis transform to the plurality of blocks of encoded information and applies respective synthesis windows function to the results of the synthesis transform to generate the overlapping segments of audio information.
  - 17. A method for audio decoding according to claim 16 that adapts the time-domain aliasing cancellation synthesis transform and applies a synthesis window function to the results of the transform to recover an end segment in the sequence for the respective encoded information frame with substantially no time-domain aliasing in the overlap interval of the end segment in the sequence.
  - 18. A method for audio decoding according to claim 17 wherein the block decoding process is constrained to generate segments having lengths that are an integer power of two.
  - 19. A method for audio decoding according to claim 17 wherein the block decoding process decodes blocks representing segments of audio information representing segments of audio information having different lengths between a maximum segment length and a minimum segment length and, for a respective encoded information frame, applies either:
20. A method for audio decoding according to claim 19 wherein all segments generated from the short-short sequence have identical lengths.
21. A method for audio decoding according to claim 19 wherein all synthesis window functions in the short-short sequence have non-zero portions that are identical in shape and length.
22. A method for audio decoding according to claim 16 wherein the block decoding process is constrained to generate segments having lengths that are an integer power of two.
23. A method for audio decoding according to claim 16 wherein the block decoding process decodes blocks representing segments of audio information representing segments of audio information having different lengths between a maximum segment length and a minimum segment length and, for a respective encoded information frame, applies either:
- a long-long sequence of synthesis window functions to a sequence of segments having lengths equal to the maximum segment length;
  
  a short-short sequence of synthesis window functions to a sequence of segments having effective lengths equal to the minimum segment length;
  
  a bridge-long sequence of synthesis window functions to a sequence of segments having lengths that shift from the minimum segment length to the maximum segment length, wherein the bridge-long sequence comprises a first bridge sequence of window functions followed by a window function for a segment having a length equal to the maximum segment length;
  
  a long-bridge sequence of synthesis window functions to a sequence of segments having lengths that shift from the maximum segment length to the minimum segment length, wherein the long-bridge sequence comprises a window function for a segment having a length equal to the maximum segment length followed by a second bridge sequence of window functions;
  
  or a bridge-bridge sequence of synthesis window functions to a sequence of segments having varying lengths, wherein the bridge-bridge sequence comprises the first bridge sequence followed by the second bridge sequence.
24. A method for audio decoding according to claim 23 wherein all segments generated from the short-short sequence have identical lengths.
25. A method for audio decoding according to claim 23 wherein all synthesis window functions in the short-short sequence have non-zero portions that are identical in shape and length.
26. A method for audio decoding according to claim 14 that analyzes control information obtained from two encoded information frames to detect a discontinuity and, in response, adapts frequency response characteristics of the block decoding process in recovering first or last segments of audio information in a respective sequence of segments for either of the two encoded information frames.

27. An information storage medium carrying:
- video information arranged in video frames; and
  
  encoded audio information arranged in encoded information frames, wherein a respective encoded information frame corresponds to a respective video frame and includes control information conveying segment lengths for segments of audio information in a sequence of overlapping segments, a respective segment having a respective overlap interval with an adjacent segment and the sequence having a length equal to the frame interval plus a frame overlap interval, and blocks of encoded audio information, a respective block having a respective length and respective content that, when processed by an adaptive block-decoding process, results in a respective segment of audio information in the sequence of overlapping segments.
- View Dependent Claims (28, 29, 30, 31, 32)
- - 28. An information storage medium according to claim 27 wherein the respective block of encoded information has respective content that results in the respective segment of audio information when processed by an adaptive decoding process that includes applying a time-domain aliasing cancellation synthesis transform and applying a synthesis window function.
  - 29. An information storage medium according to claim 28 where the adaptive block decoding process adapts the time-domain aliasing cancellation synthesis transform and adapts the synthesis window function to generate the sequence of overlapping segments of audio information that independently has substantially no time-domain aliasing.
  - 30. An information storage medium according to claim 29 wherein all blocks of encoded audio information represent segments of audio information that have respective lengths that are integer powers of two.
  - 31. An information storage medium according to claim 28 wherein all blocks of encoded audio information represent segments of audio information that have respective lengths that are integer powers of two.
  - 32. An information storage medium according to claim 27 wherein the control information includes an indication of order of the respective encoded information frame within a sequence of encoded information frames.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Original Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Inventors
Truman, Michael Mead, Fielder, Louis Dunn
Primary Examiner(s)
Korzuch, William R.
Assistant Examiner(s)
Chawan, Vijay B

Application Number

US09/239,345
Time in Patent Office

824 Days
Field of Search

704/229, 704/500, 704/211, 704/200.1, 704/201, 704/503, 704/502, 704/504, 386/68, 386/81, 348/515, 375/240.28
US Class Current

704/229
CPC Class Codes

G10L 19/022 Blocking, i.e. grouping of ...

Data framing for adaptive-block-length coding system

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

32 Claims

Specification

Solutions

Use Cases

Quick Links

Data framing for adaptive-block-length coding system

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

32 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links