AUTOMATIC AUDIO DUCKING WITH REAL TIME FEEDBACK BASED ON FAST INTEGRATION OF SIGNAL LEVELS
First Claim
1. A computer-implemented method for audio signal processing, the method comprising:
- accessing, by a multimedia editing application hosted on a computing device, a first audio signal associated with a foreground label, wherein the foreground label indicates that the first audio signal is a foreground audio signal;
generating, by the multimedia editing application, metrics corresponding to audio slices of the first audio signal and indicating values for an audio property of the first audio signal, wherein each metric corresponds to an audio slice, indicates a value for the audio property in the audio slice, and is generated based on an audio signal of the audio slice;
computing, by the multimedia editing application, a total metric for an audio slice based on a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and
adding, by the multimedia editing application, a key frame to a track based on the total metric, wherein the track organizes a presentation of the first audio signal and of a second audio signal having a background label, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the second audio signal at the location on the track.
2 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments describe audio signal processing. In an example, a computer system generates metrics, such as RMS levels, for audio slices from a foreground audio signal. A summed-area table is generated from the metrics. An observation window is used to determine whether to add a key frame or not. The observation window includes a set of audio slices. A total metrics, such as an average RMS level, is computed for the audio slices in the observation window. Based on the total metric, the computer system adds a key frame. The key frame references audio ducking parameters applicable to a background audio signal.
14 Citations
20 Claims
-
1. A computer-implemented method for audio signal processing, the method comprising:
-
accessing, by a multimedia editing application hosted on a computing device, a first audio signal associated with a foreground label, wherein the foreground label indicates that the first audio signal is a foreground audio signal; generating, by the multimedia editing application, metrics corresponding to audio slices of the first audio signal and indicating values for an audio property of the first audio signal, wherein each metric corresponds to an audio slice, indicates a value for the audio property in the audio slice, and is generated based on an audio signal of the audio slice; computing, by the multimedia editing application, a total metric for an audio slice based on a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and adding, by the multimedia editing application, a key frame to a track based on the total metric, wherein the track organizes a presentation of the first audio signal and of a second audio signal having a background label, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the second audio signal at the location on the track. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory computer-readable storage medium comprising instructions that, upon execution on a computer system, cause the computer system to perform operations comprising:
-
accessing a foreground audio signal; generating metrics corresponding to audio slices of the foreground audio signal and indicating values for an audio property of the foreground audio signal, wherein each metric corresponds to an audio slice, indicates a value for the audio property in the audio slice, and is generated based on an audio signal of the audio slice; computing a total metric for an audio slice based on a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and adding a key frame to a track based on the total metric, wherein the track organizes a presentation of the foreground audio signal and of a background audio signal, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the background audio signal at the location on the track. - View Dependent Claims (15, 16, 17)
-
-
18. A computer system comprising:
-
means for accessing a foreground audio signal; means for generating metrics corresponding to audio slices of the foreground audio signal and indicating values for an audio property of the foreground audio signal, wherein each metric corresponds to an audio slice, indicates a value for the audio property in the audio slice, and is generated based on an audio signal of the audio slice; means for computing a total metric for an audio slice based on a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and means for adding a key frame to a track based on the total metric, wherein the track organizes a presentation of the foreground audio signal and of a background audio signal, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the background audio signal at the location on the track. - View Dependent Claims (19, 20)
-
Specification