Adaptive window-size selection in transform coding
First Claim
1. A transform coder comprising:
- a transient detection component operating to process samples of an input signal to identify locations of transients in the input signal;
an open-loop window configuration component operating in response to the identified transient location to configure a first configuration of sizes of a plurality of transform input windows over the input signal selected from at least a small window size, a large window size, and an intermediate window size, so as to place one or more windows of the small window size to encompass a region of the input signal having at least one identified transient location and place windows of the large window size in areas of the input signal having no identified transient locations;
an encoding component for transform coding the input signal according to the first configuration of transform input window sizes, and for decoding to produce a reconstructed signal;
a quality measurement component operating to measure achieved quality of the reconstructed signal; and
a closed-loop window configuration component operating in response to the achieved quality measurement to adjust sizes of the transform input windows in the first configuration according to the achieved quality measurement to produce a second configuration of transform input windows for use in transform coding the input signal.
2 Assignments
0 Petitions
Accused Products
Abstract
A transform coder adaptively configures window sizes for transform coding in a two-pass process to maximize coding efficiency, while achieving necessary time resolution to avoid pre-echo. In a first pass, the coder places small size windows over detected transient regions of an input signal in an open-loop window configuration process. In a second pass, the coder adjusts the window size configuration according to measurements of the achieved quality in a closed-loop window configuration process. Where quality measurement shows unacceptable quantization noise, the coder increases window size. Where pre-echo is detected, the coder reduces window size within coding bit rate constraints.
187 Citations
49 Claims
-
1. A transform coder comprising:
-
a transient detection component operating to process samples of an input signal to identify locations of transients in the input signal;
an open-loop window configuration component operating in response to the identified transient location to configure a first configuration of sizes of a plurality of transform input windows over the input signal selected from at least a small window size, a large window size, and an intermediate window size, so as to place one or more windows of the small window size to encompass a region of the input signal having at least one identified transient location and place windows of the large window size in areas of the input signal having no identified transient locations;
an encoding component for transform coding the input signal according to the first configuration of transform input window sizes, and for decoding to produce a reconstructed signal;
a quality measurement component operating to measure achieved quality of the reconstructed signal; and
a closed-loop window configuration component operating in response to the achieved quality measurement to adjust sizes of the transform input windows in the first configuration according to the achieved quality measurement to produce a second configuration of transform input windows for use in transform coding the input signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. In a transform coder, a method of adaptively selecting transform window size, the method comprising:
-
detecting locations of transients in an input signal;
for a frame of the input signal in which no transient location is detected, configuring size of a transform window to be a first window size;
for a frame of the input signal in which at least one transient location is detected, configuring sizes of a plurality of transform windows in the frame to comprise a consecutive set of at least one second-size window substantially encompassing the transient locations in the frame and at least one third-size window before the transient, where the second window size is smaller than the first window size and where the third window size is intermediate to the first and second window sizes; and
transform encoding the input signal according to a first transform window configuration including the configured sizes of transform windows. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. In a transform coder, a method of adaptively selecting transform window size, the method comprising:
-
detecting locations of transients in an input signal;
for a frame of the input signal in which no transient location is detected, configuring size of a transform window to be a first window size;
for a frame of the input signal in which at least one transient location is detected, configuring sizes of a plurality of transform windows in the frame to comprise a consecutive set of at least one second-size window substantially encompassing the transient locations in the frame, where the second window size is smaller than the first window size;
transform encoding the input signal according to a first transform window configuration including the configured sizes of transform windows. measuring achieved perceptual quality of the transform-encoded signal;
re-configuring the size of at least some of the transform windows configured in the first transform window configuration according to the measured perceptual quality to produce a second transform window configuration; and
transform encoding the input signal according to the second transform window configuration. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 29, 30, 31, 32, 33)
-
-
27. In a transform coder, a method of adaptively selecting transform window size, the method comprising:
-
detecting locations of transients in a current frame of an input signal;
measuring achieved perceptual quality of at least one prior transform-encoded frame of the input signal;
determining a minimal window size for the current frame based on the measured achieved perceptual quality of the at least one prior transform-encoded frame;
for a first case in which no transient location is detected in the current frame, configuring size of a transform window to be a first window size;
for a second case in which at least one transient location is detected in the current frame of the input signal, configuring sizes of a plurality of transform windows in the frame to comprise a consecutive set of at least one second-size window substantially encompassing the transient locations in the frame, where the second window size is the minimal window size for the current frame; and
transform encoding the current frame of the input signal according to the configured sizes of transform windows. - View Dependent Claims (28)
-
-
34. A program storage medium having a transform coding program executable on an audio processing device to perform a method of adaptively selecting transform window size, the method comprising:
-
detecting locations of transients in an input signal;
for a frame of the input signal in which no transient location is detected, configuring size of a transform window to be a first window size;
for a frame of the input signal in which at least one transient location is detected, configuring sizes of a plurality of transform windows in the frame to comprise a consecutive set of at least one second-size window substantially encompassing the transient locations in the frame, where the second window size is smaller than the first window size;
transform encoding the input signal according to a first transform window configuration including the configured sizes of transform windows. measuring achieved perceptual quality of the transform-encoded signal;
re-configuring the size of at least some of the transform windows configured in the first transform window configuration according to the measured perceptual quality to produce a second transform window configuration; and
transform encoding the input signal according to the second transform window configuration. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41)
-
-
42. A data carrying medium having a transform coded signal carried thereon produced according to a method comprising:
-
detecting locations of transients in an input signal;
for a frame of the input signal in which no transient location is detected, configuring size of a transform window to be a first window size;
for a frame of the input signal in which at least one transient location is detected, configuring sizes of a plurality of transform windows in the frame to comprise a consecutive set of at least one second-size window substantially encompassing the transient locations in the frame, where the second window size is smaller than the first window size;
transform encoding the input signal according to a first transform window configuration including the configured sizes of transform windows. measuring achieved perceptual quality of the transform-encoded signal;
re-configuring the size of at least some of the transform windows configured in the first transform window configuration according to the measured perceptual quality to produce a second transform window configuration; and
transform encoding the input signal according to the second transform window configuration. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49)
-
Specification