Multiple range dynamic level control
First Claim
Patent Images
1. A computing device, comprising:
- a processor;
one or more microphones configured to generate an input audio signal;
one or more speakers; and
memory, accessible by the processor and storing instructions that are executable by the processor to perform acts in multiple repetitions, the acts of each repetition comprising;
detecting voice presence in the input audio signal;
determining a voice level associated with the voice presence in the input audio signal;
comparing the voice level to at least one of a plurality of threshold amplitudes, each threshold amplitude of the plurality of threshold amplitudes corresponding to one of multiple level ranges;
identifying one of the multiple level ranges to which the voice level corresponds based at least in part on the comparing;
selecting an audio gain based at least in part on the identified one of the multiple level ranges;
smoothing the selected audio gain over time;
scaling the input audio signal by the selected and smoothed audio gain to produce an intermediate audio signal; and
attenuating the intermediate audio signal to reduce clipping, wherein the attenuating produces an output audio signal for output by the one or more speakers.
2 Assignments
0 Petitions
Accused Products
Abstract
An audio-based system may perform dynamic level adjustment by detecting voice activity in an input signal and evaluating voice levels during periods of voice activity. The current voice level is compared to a plurality of thresholds to determine a corresponding gain strategy, and the input signal is scaled in accordance with this gain strategy. Further adjustment to the signal is performed to reduce output clipping that might otherwise be produced.
34 Citations
20 Claims
-
1. A computing device, comprising:
-
a processor; one or more microphones configured to generate an input audio signal; one or more speakers; and memory, accessible by the processor and storing instructions that are executable by the processor to perform acts in multiple repetitions, the acts of each repetition comprising; detecting voice presence in the input audio signal; determining a voice level associated with the voice presence in the input audio signal; comparing the voice level to at least one of a plurality of threshold amplitudes, each threshold amplitude of the plurality of threshold amplitudes corresponding to one of multiple level ranges; identifying one of the multiple level ranges to which the voice level corresponds based at least in part on the comparing; selecting an audio gain based at least in part on the identified one of the multiple level ranges; smoothing the selected audio gain over time; scaling the input audio signal by the selected and smoothed audio gain to produce an intermediate audio signal; and attenuating the intermediate audio signal to reduce clipping, wherein the attenuating produces an output audio signal for output by the one or more speakers. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of dynamically controlling an audio level, comprising:
-
specifying a plurality of thresholds to define multiple level ranges and corresponding gain strategies; detecting voice presence in one or more audio signals, the one or more audio signals including the voice presence and other noise; determining a voice level associated with the voice presence in the one or more audio signals; comparing the voice level to the plurality of thresholds to identify one of the multiple level ranges to which the determined voice level corresponds; and selecting an audio gain based at least in part on the identified one of the multiple level ranges. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform acts comprising:
-
detecting voice presence in one or more audio signals, the one or more audio signals including the voice presence and other noise; determining a voice level associated with the voice presence in the one or more audio signals; specifying a plurality of thresholds to define multiple level ranges and corresponding gain strategies; comparing the voice level to the plurality of thresholds to identify one of multiple level ranges to which the voice level corresponds; selecting an audio gain based at least in part on the identified one of the multiple level ranges; and applying the selected audio gain to the one or more audio signals. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification