Attribute-based audio channel arbitration
First Claim
1. A device, comprising:
- one or more processors;
a speaker;
a microphone;
a network communications interface configured to communicate with a remote, network-based speech command service; and
non-transitory computer-readable media storing computer executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising;
receiving a first command from the network-based speech command service to play audio content;
receiving, from the network-based speech command service, a mixing attribute specified by the network-based speech command service and indicating whether to mix the audio content with additional audio content, the mixing attribute comprising;
first data specifying that the audio content is not to be played by the speaker at a same time as the additional audio content;
orsecond data specifying that the audio content is to be played by the speaker at the same time as the additional audio content;
playing the audio content on the speaker at a first volume;
receiving a second command from the network-based speech command service to play a speech message, wherein the speech message corresponds to a type of the additional audio content;
receiving an interrupt attribute from the network-based speech command service, wherein the interrupt attribute corresponds to the speech message, wherein the interrupt attribute is specified by the network-based speech command service and specifies whether the audio content is to be attenuated or paused while the speech message is played, the interrupt attribute comprising;
third data specifying that the audio content is to be attenuated while the speech message is played;
orfourth data specifying that the audio content is to be paused while the speech message is played;
playing the speech message on the speaker;
based at least in part on the mixing attribute comprising the second data and the interrupt attribute comprising the third data, lowering playback volume of the audio content to a second volume during playing of the speech message; and
based at least in part on one or more of the mixing attribute comprising the first data or the interrupt attribute comprising the fourth data, pausing the playing of the audio content during playing of the speech message.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech-based system includes a local device in a user premises and a remote service that uses the local device to conduct speech dialogs with a user. The local device may also be directed to play audio such as music, audio books, etc. When designating audio for playing by the local device, the remote service may specify that the audio is either background audio or foreground audio. For background audio, the service indicates whether the background audio is mixable. For foreground audio, the service indicates an interrupt behavior. When the local device is playing background audio and receives foreground audio, the background audio is paused, attenuated, or not changed based on the indicated interrupt behavior of the foreground audio and whether the background audio has been designated as being mixable.
-
Citations
18 Claims
-
1. A device, comprising:
-
one or more processors; a speaker; a microphone; a network communications interface configured to communicate with a remote, network-based speech command service; and non-transitory computer-readable media storing computer executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; receiving a first command from the network-based speech command service to play audio content; receiving, from the network-based speech command service, a mixing attribute specified by the network-based speech command service and indicating whether to mix the audio content with additional audio content, the mixing attribute comprising; first data specifying that the audio content is not to be played by the speaker at a same time as the additional audio content;
orsecond data specifying that the audio content is to be played by the speaker at the same time as the additional audio content; playing the audio content on the speaker at a first volume; receiving a second command from the network-based speech command service to play a speech message, wherein the speech message corresponds to a type of the additional audio content; receiving an interrupt attribute from the network-based speech command service, wherein the interrupt attribute corresponds to the speech message, wherein the interrupt attribute is specified by the network-based speech command service and specifies whether the audio content is to be attenuated or paused while the speech message is played, the interrupt attribute comprising; third data specifying that the audio content is to be attenuated while the speech message is played;
orfourth data specifying that the audio content is to be paused while the speech message is played; playing the speech message on the speaker; based at least in part on the mixing attribute comprising the second data and the interrupt attribute comprising the third data, lowering playback volume of the audio content to a second volume during playing of the speech message; and based at least in part on one or more of the mixing attribute comprising the first data or the interrupt attribute comprising the fourth data, pausing the playing of the audio content during playing of the speech message. - View Dependent Claims (2, 3, 4, 5)
-
-
6. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform acts comprising:
-
receiving a command from a network-based speech command service that specifies audio content; in response to receiving the command, playing the audio content at a first volume; receiving a speech message from the network-based speech command service; receiving, from the network-based speech command service, a first attribute associated with the audio content, specified by the network-based speech command service, and indicating whether to mix the audio content with the speech message, the first attribute comprising; first data specifying that the audio content is to be played while the speech message is played;
orsecond data specifying that the audio content is to be paused while the speech message is played; receiving a second attribute associated with the speech message and specified by the network-based speech command service, the second attribute comprising; third data specifying that the audio content is to be attenuated while the speech message is played;
orfourth data specifying that the audio content is to be paused while the speech message is played; playing the speech message; based at least in part on the first attribute comprising the first data and the second attribute comprising the third data, lowering playback volume of the audio content to a second volume while playing the speech message; based at least in part on the first attribute comprising the first data and the second attribute comprising the fourth data, pausing the playback of the audio content while playing the speech message; and based at least in part the first attribute comprising the second data, pausing the playing of the audio content while playing the speech message. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
-
14. A computing device comprising:
-
one or more processors; a speaker; and one or more non-transitory computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; receiving, over a network and from a network-based speech command system, a first command to output first audio content; receiving, from the network-based speech command system, a mixing attribute specified by the network-based speech command system and indicating whether to mix the first audio content with additional audio content, the mixing attribute comprising; first data specifying that the first audio content is not to be played by the speaker at a same time as the additional audio content;
orsecond data specifying that the first audio content is to be played by the speaker at the same time as the additional audio content; playing the first audio content on the speaker at a first volume; receiving a second command from the network-based speech command system to play second audio content, wherein the second audio content corresponds to a type of the additional audio content; receiving an interrupt attribute from the network-based speech command system, wherein the interrupt attribute corresponds to the second audio content, wherein the interrupt attribute is specified by the network-based speech command system and specifies whether the first audio content is to be attenuated or paused while the second audio content is played, the interrupt attribute comprising; third data specifying that the first audio content is to be attenuated while the second audio content is played;
orfourth data specifying that the first audio content is to be paused while the second audio content is played; playing the second audio content on the speaker; based at least in part on the mixing attribute comprising the second data and the interrupt attribute comprising the third data, lowering playback volume of the first audio content to a second volume during playing of the second audio content; and based at least in part on one or more of the mixing attribute comprising the first data or the interrupt attribute comprising the fourth data, pausing the playing of the first audio content during playing of the second audio content. - View Dependent Claims (15, 16, 17, 18)
-
Specification