Automatic audio ducking with real time feedback based on fast integration of signal levels
First Claim
Patent Images
1. A computer-implemented method for audio signal processing, the method comprising:
- accessing, by a multimedia editing application hosted on a computing device, a first audio signal associated with a foreground label, wherein the foreground label indicates that the first audio signal is a foreground audio signal;
analyzing, by the multimedia editing application, the first audio signal to determine metrics corresponding to audio slices of the first audio signal,wherein each metric corresponds to a respective audio slice, indicates a value for an audio property in the respective audio slice, and is generated by the multimedia editing application based on an audio signal of the respective audio slice;
computing, by the multimedia editing application, a total metric for an audio slice as a function of a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and
adding, by the multimedia editing application, a key frame to a track based on the total metric, wherein the track organizes a presentation of the first audio signal and of a second audio signal having a background label, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the second audio signal at the location on the track,wherein the key frame added to the track based on the total metric is utilizable for audio ducking, the audio ducking comprising adapting a property of the second audio signal based on a state of the first audio signal.
2 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments describe audio signal processing. In an example, a computer system generates metrics, such as RMS levels, for audio slices from a foreground audio signal. A summed-area table is generated from the metrics. An observation window is used to determine whether to add a key frame or not. The observation window includes a set of audio slices. A total metrics, such as an average RMS level, is computed for the audio slices in the observation window. Based on the total metric, the computer system adds a key frame. The key frame references audio ducking parameters applicable to a background audio signal.
-
Citations
20 Claims
-
1. A computer-implemented method for audio signal processing, the method comprising:
-
accessing, by a multimedia editing application hosted on a computing device, a first audio signal associated with a foreground label, wherein the foreground label indicates that the first audio signal is a foreground audio signal; analyzing, by the multimedia editing application, the first audio signal to determine metrics corresponding to audio slices of the first audio signal, wherein each metric corresponds to a respective audio slice, indicates a value for an audio property in the respective audio slice, and is generated by the multimedia editing application based on an audio signal of the respective audio slice; computing, by the multimedia editing application, a total metric for an audio slice as a function of a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and adding, by the multimedia editing application, a key frame to a track based on the total metric, wherein the track organizes a presentation of the first audio signal and of a second audio signal having a background label, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the second audio signal at the location on the track, wherein the key frame added to the track based on the total metric is utilizable for audio ducking, the audio ducking comprising adapting a property of the second audio signal based on a state of the first audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory computer-readable storage medium comprising instructions that, upon execution on a computer system, cause the computer system to perform operations comprising:
-
accessing a foreground audio signal; analyzing the foreground audio signal to determine metrics corresponding to audio slices of the foreground audio signal, wherein each metric corresponds to a respective audio slice, indicates a value for an audio property in the respective audio slice, and is generated automatically based on an audio signal of the respective audio slice; computing a total metric for an audio slice as a function of a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and adding a key frame to a track based on the total metric, wherein the track organizes a presentation of the foreground audio signal and of a background audio signal, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the background audio signal at the location on the track, wherein the key frame added to the track based on the total metric is utilizable for audio ducking, the audio ducking comprising adapting a property of the background audio signal based on a state of the foreground audio signal. - View Dependent Claims (15, 16, 17)
-
-
18. A computer system comprising:
-
means for accessing a foreground audio signal; means for analyzing the foreground audio signal to determine metrics corresponding to audio slices of the foreground audio signal, wherein each metric corresponds to a respective audio slice, indicates a value for an audio property in the respective audio slice, and is generated automatically based on an audio signal of the respective audio slice; means for computing a total metric for an audio slice as a function of a set of the metrics corresponding to a set of the audio slices, wherein the set of the audio slices includes the audio slice; and means for adding a key frame to a track based on the total metric, wherein the track organizes a presentation of the foreground audio signal and of a background audio signal, wherein a location of the key frame corresponds to a location of the audio slice on the track, and wherein the key frame indicates a change to the audio property of the background audio signal at the location on the track, wherein the key frame added to the track based on the total metric is utilizable for audio ducking, the audio ducking comprising adapting a property of the background audio signal based on a state of the foreground audio signal. - View Dependent Claims (19, 20)
-
Specification