Automatic volume attenuation for speech enabled devices
First Claim
Patent Images
1. A method of modifying operation of a device, the method comprising:
- generating an audio output via a speaker at a first volume level;
receiving a first audio input via a microphone, the first audio input including a first sound and a first portion of the audio output;
performing echo cancellation on the first audio input to remove the first portion of the audio output received by the microphone to generate an isolated audio input signal;
determining that the isolated audio input signal comprises the first sound by comparing the isolated audio input signal with at least one stored acoustic model, wherein the first sound does not comprise speech directed to the device;
in response to determining that the isolated audio input signal comprises the first sound;
reducing the audio output to a second volume level that is less than the first volume level; and
suppressing at least one notification that would otherwise have been produced;
storing the at least one notification;
receiving a voice command; and
in response to receiving the voice command;
restoring the audio output to the first volume level;
stopping the suppressing of further notifications; and
producing the at least one notification that was previously suppressed and stored.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition system that also automatically recognizes and acts in response to significant audio interruptions. Received audio is compared with stored acoustic signatures of noises which may trigger a change in device operation, such as pausing, loudening or attenuating of content playback after hearing a certain audio interruption, such as a doorbell, etc. If the received audio matches a stored acoustic model, the system alters an operational state of one or more devices, which may or may not include itself.
196 Citations
22 Claims
-
1. A method of modifying operation of a device, the method comprising:
-
generating an audio output via a speaker at a first volume level; receiving a first audio input via a microphone, the first audio input including a first sound and a first portion of the audio output; performing echo cancellation on the first audio input to remove the first portion of the audio output received by the microphone to generate an isolated audio input signal; determining that the isolated audio input signal comprises the first sound by comparing the isolated audio input signal with at least one stored acoustic model, wherein the first sound does not comprise speech directed to the device; in response to determining that the isolated audio input signal comprises the first sound; reducing the audio output to a second volume level that is less than the first volume level; and suppressing at least one notification that would otherwise have been produced; storing the at least one notification; receiving a voice command; and in response to receiving the voice command; restoring the audio output to the first volume level; stopping the suppressing of further notifications; and producing the at least one notification that was previously suppressed and stored. - View Dependent Claims (2, 3)
-
-
4. A computing device, comprising:
-
at least one processor; a memory including instructions operable to be executed by the at least one processor to perform a set of actions, configuring the at least one processor; to generate an audio output at a first volume level; to receive a first audio input as an audio signal; to identify a presence of a first audio interruption by comparing the audio signal with one or more stored models; and to alter the audio output of the computing device, to store at least one notification, and to suppress the at least one notification, in response to identifying the presence of the first audio interruption; to recognize a voice command; and in response to recognizing the voice command; to restore the audio output to the first volume level; to end the suppressing of further notifications; and to deliver the at least one notification that was previously suppressed and stored. - View Dependent Claims (5, 6, 7, 8, 9, 10, 18, 19, 20)
-
-
11. A non-transitory computer-readable storage medium storing processor-executable instructions for controlling a computing device, comprising:
-
program code to generate an audio output at a first volume level; program code to receive a first audio input as an audio signal; program code to identify a presence of a first audio interruption by comparing the audio signal with one or more stored models; and program code to alter the audio output of the computing device, to store at least one notification, and to suppress the at least one notification, in response to identifying the presence of the first audio interruption; program code to recognize a voice command; and program code to, in response to recognizing the voice command; restore the audio output to the first volume level; end the suppressing of further notifications; and deliver the at least one notification that was previously suppressed and stored. - View Dependent Claims (12, 13, 14, 15, 16, 17, 21, 22)
-
Specification