Echo cancellation based on shared reference signals
First Claim
Patent Images
1. A first device comprising:
- a processor;
a microphone to capture a speech utterance and an audio output from a second device and to generate an audio signal based at least in part on the speech utterance and the audio output from the second device, the audio signal including at least;
(1) a speech component corresponding to the speech utterance; and
(2) an audio component associated with the audio output from the second device, the second device being physically independent from the first device and from a source of the speech utterance;
a reference signal module configured to be operated by the processor to receive a reference signal from the second device, the reference signal corresponding to the audio output from the second device;
a signal processing module configured to be operated by the processor to process the audio signal to generate a processed audio signal by removing at least a part of the audio signal that corresponds to the reference signal; and
a speech recognition module configured to be operated by the processor to perform speech recognition on the processed audio signal or to provide the processed audio signal to another entity for performing the speech recognition, the processed audio signal substantially including the speech component.
2 Assignments
0 Petitions
Accused Products
Abstract
An audio processing system configured to generate, based at least in part on captured sound, an audio signal that includes a speech component corresponding to a user'"'"'s speech utterance and an audio component corresponding to audio output of another device is described herein. The audio processing system is also configured to receive a reference signal that corresponds to the audio output of the other device. The reference signal may be received as ultrasonic audio output of the other device or from a remote server. The audio processing device then processes the generated audio signal to remove at least a part of the generated audio signal that corresponds to the reference signal.
42 Citations
26 Claims
-
1. A first device comprising:
-
a processor; a microphone to capture a speech utterance and an audio output from a second device and to generate an audio signal based at least in part on the speech utterance and the audio output from the second device, the audio signal including at least;
(1) a speech component corresponding to the speech utterance; and
(2) an audio component associated with the audio output from the second device, the second device being physically independent from the first device and from a source of the speech utterance;a reference signal module configured to be operated by the processor to receive a reference signal from the second device, the reference signal corresponding to the audio output from the second device; a signal processing module configured to be operated by the processor to process the audio signal to generate a processed audio signal by removing at least a part of the audio signal that corresponds to the reference signal; and a speech recognition module configured to be operated by the processor to perform speech recognition on the processed audio signal or to provide the processed audio signal to another entity for performing the speech recognition, the processed audio signal substantially including the speech component. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method comprising:
-
receiving, at a first device via a microphone of the first device, a speech utterance and an audio output from a second device, the second device being physically separated from the first device and the second device being physically separated from a source of the speech utterance; generating, by the first device, an audio signal including a speech component corresponding to the speech utterance and an audio component associated with the audio output from the second device; receiving, by the first device, a reference signal from the second device, the reference signal corresponding to the audio output from the second device; and processing the audio signal to generate a processed audio signal by removing at least a part of the audio signal that corresponds to the reference signal. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. One or more non-transitory computer-readable media storing computer-executable instructions configured to program a first device to perform operations comprising:
-
generating an audio signal based at least in part on a speech utterance and an audio output from a second device, wherein; the second device is physically separate from the first device and the second device is physically separate from a source of the speech utterance, the speech utterance and the audio output are captured via a microphone, the audio output includes a human-audible audio output and an ultrasonic audio output including a reference signal, and the audio signal includes a human-audible sound component and an ultrasonic audio component, the human-audible sound component including a speech component corresponding to the speech utterance and an audio component corresponding to the human-audible audio output, and the ultrasonic audio component including the reference signal; processing the audio signal to generate a processed audio signal by removing at least a part of the audio component corresponding to the audio output by the second device from the audio signal based at least in part on the reference signal; and sending the processed audio signal to a remote system. - View Dependent Claims (17, 18, 19, 20, 21)
-
-
22. One or more non-transitory computer-readable media storing computer-executable instructions configured to program a first device to perform operations comprising:
-
generating an audio signal based at least in part on a speech utterance and an audio output from a second device, the speech utterance and the audio output from the second device being captured via a microphone of the first device, the audio signal including a speech component corresponding to the speech utterance and an audio component associated with the audio output from the second device, the second device being physically separate from the first device and the second device being physically separate from a source of the speech utterance; receiving, from a remote server via a wireless unit, a reference signal corresponding to the audio output from the second device; and processing the generated audio signal by removing at least a part of the audio signal that corresponds to the reference signal. - View Dependent Claims (23, 24, 25, 26)
-
Specification