Compensation for speaker nonlinearities
First Claim
Patent Images
1. An audio system comprising:
- a playback device comprising a speaker, the playback device disposed at a first location; and
a network microphone device disposed at a second location, the network microphone device being displaceable relative to the playback device, the network microphone device comprising;
a microphone;
a processor; and
memory storing instructions executable by the processor to cause the processor to;
receive a first signal indicative of audio to be played back via the speaker of the playback device and a second signal that comprises (i) a voice input received via the microphone and (ii) at least a portion of the audio played by the speaker of the playback device at a same time that the microphone receives the voice input; and
perform self-sound suppression on at least one of the first signal and the second signal, wherein performing self-sound suppression comprises;
based on the first signal, determining nonlinearities output via the speaker of the playback device by inputting a representation of the first signal into a model configured to output an indication of a frequency response that changes over time, wherein at least a portion of the frequency response is indicative of nonlinear audio effects, and wherein the nonlinear audio effects comprise an intermodulation distortion; and
removing at least a portion of the determined nonlinearities from the second signal to output a third signal comprising substantially the voice input received at the microphone.
4 Assignments
0 Petitions
Accused Products
Abstract
A first signal may be received indicative of audio to be played by a speaker. A second signal may be received which comprises (i) a voice input received by a microphone and (ii) at least a portion of the audio played by the speaker at a same time that the microphone receives the voice input. Based on the first signal, nonlinearities output by the speaker which played the audio may be determined. At least the nonlinearities from the second signal may be removed to output a third signal comprising substantially the voice input received at the microphone.
243 Citations
16 Claims
-
1. An audio system comprising:
-
a playback device comprising a speaker, the playback device disposed at a first location; and a network microphone device disposed at a second location, the network microphone device being displaceable relative to the playback device, the network microphone device comprising; a microphone; a processor; and memory storing instructions executable by the processor to cause the processor to; receive a first signal indicative of audio to be played back via the speaker of the playback device and a second signal that comprises (i) a voice input received via the microphone and (ii) at least a portion of the audio played by the speaker of the playback device at a same time that the microphone receives the voice input; and perform self-sound suppression on at least one of the first signal and the second signal, wherein performing self-sound suppression comprises; based on the first signal, determining nonlinearities output via the speaker of the playback device by inputting a representation of the first signal into a model configured to output an indication of a frequency response that changes over time, wherein at least a portion of the frequency response is indicative of nonlinear audio effects, and wherein the nonlinear audio effects comprise an intermodulation distortion; and removing at least a portion of the determined nonlinearities from the second signal to output a third signal comprising substantially the voice input received at the microphone. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
receiving a first signal indicative of audio to be played back via a speaker of a playback device disposed at a first location and a second signal that comprises (i) a voice input received via a microphone of a network microphone device disposed at a second location, the network microphone device being displaceable relative to the playback device, and (ii) at least a portion of the audio played by the speaker at a same time that the microphone receives the voice input; and performing self-sound suppression on at least one of the first signal and the second signal, wherein performing self-sound suppression comprises; based on the first signal, determining nonlinearities output via the speaker of the playback device by inputting a representation of the first signal into a model configured to output an indication of a frequency response that changes over time, wherein at least a portion of the frequency response is indicative of nonlinear audio effects, and wherein the nonlinear audio effects comprise an intermodulation distortion; and removing at least a portion of the determined nonlinearities from the second signal to output a third signal comprising substantially the voice input received at the microphone of the network microphone device. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A tangible non-transitory computer readable storage medium including instructions for execution by a processor, the instructions, when executed, cause the processor to implement a method comprising:
-
receiving a first signal indicative of audio to be played back via a speaker of a playback device disposed at a first location and a second signal that comprises (i) a voice input received via a microphone of a network microphone device disposed at a second location, the network microphone device being displaceable relative to the playback device, and (ii) at least a portion of the audio played by the speaker at a same time that the microphone receives the voice input; and performing self-sound suppression on at least one of the first signal and the second signal, wherein performing self-sound suppression comprises; based on the first signal, determining nonlinearities output via the speaker of the playback device by inputting a representation of the first signal into a model configured to output an indication of a frequency response that changes over time, wherein at least a portion of the frequency response is indicative of nonlinear audio effects, and wherein the nonlinear audio effects comprise an intermodulation distortion; and removing at least a portion of the determined nonlinearities from the second signal to output a third signal comprising substantially the voice input received at the microphone of the network microphone device. - View Dependent Claims (16)
-
Specification