System for recognizing and responding to environmental noises
First Claim
Patent Images
1. A method comprising:
- under control of one or more computing systems configured with executable instructions,receiving, from a device, over a network, audio data representing non-conversational noise and speech;
analyzing the first audio data with respect to sound data, wherein the sound data is associated with a category of noise;
determining that at least a portion of the first audio data corresponds to the sound data;
associating the first audio data with one or more instructions, the instructions to cause a specific response from the device and based, at least in part, on the category;
receiving, from the device, over the network, second audio data that represents sound detected by the device;
determining that the second audio data represents the non-conversational noise based, at least in part, on the analyzing of the second audio data with respect to the sound data, wherein the at least the portion of the first audio data that corresponds to the sound data is different than the second audio data that represents the non-conversational noise; and
sending, over the network, the instructions to the device at least partly in response to the determining that the second audio data represents the non-conversational noise.
2 Assignments
0 Petitions
Accused Products
Abstract
An audio controlled assistant captures environmental noise and converts the environmental noise into audio signals. The audio signals are provided to a system which analyzes the audio signals for a plurality of audio prompts, which have been customized for the acoustic environment surrounding the audio controlled assistant by an acoustic modeling system. The system configured to detect the presence of an audio prompt in the audio signals and transmit instructions associated with the detected audio prompt to at least one of the audio controlled assistant or one or more cloud based services, in response.
-
Citations
24 Claims
-
1. A method comprising:
-
under control of one or more computing systems configured with executable instructions, receiving, from a device, over a network, audio data representing non-conversational noise and speech; analyzing the first audio data with respect to sound data, wherein the sound data is associated with a category of noise; determining that at least a portion of the first audio data corresponds to the sound data; associating the first audio data with one or more instructions, the instructions to cause a specific response from the device and based, at least in part, on the category; receiving, from the device, over the network, second audio data that represents sound detected by the device; determining that the second audio data represents the non-conversational noise based, at least in part, on the analyzing of the second audio data with respect to the sound data, wherein the at least the portion of the first audio data that corresponds to the sound data is different than the second audio data that represents the non-conversational noise; and sending, over the network, the instructions to the device at least partly in response to the determining that the second audio data represents the non-conversational noise. - View Dependent Claims (2, 3, 4)
-
-
5. A device comprising:
-
one or more microphones to; generate first audio data based, at least in part, on first sound detected from an environment in which the device is located, the first audio data representing a non-conversational sound; and generate second audio data based, at least in part, on second sound detected from the environment, the second audio data representing the non-conversational sound; and one or more communication interfaces to; send the first audio data to one or more remote systems; send the second audio data to the one more remote systems; receive instructions from the one or more remote systems, wherein the instructions are based at least in part on the non-conversational sound as represented by the second audio data and the first audio data; and send, based at least in part on receiving the instructions, a control signal to at least one second device within the environment. - View Dependent Claims (6, 7)
-
-
8. A system comprising:
-
one or more processors; and one or more computer-readable media storing instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; receiving, from an electronic device, audio data that represents sound detected from an environment, the audio data including a first data portion representative of non-conversational noise and a second data portion representative of speech, the non-conversational noise being sound other than speech; identifying the first data portion representative of non-conversational noise within the audio data; analyzing the first data portion representative of the non-conversational noise using sound data; and storing the first data portion that represents the non-conversational noise based, at least in part, on a similarity threshold between acoustic characteristics of the first data portion representative of the non-conversational noise and the sound data. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A system comprising:
-
one or more processors; and one or more non-transitory computer readable storage media storing instructions that, when executed on the one or more processors, cause the one or more processors to performs acts comprising; receiving, from a first device, first audio data representing a non-conversational noise and speech; associating a portion of the first audio data representative of the non-conversational noise with instructions, the instructions to cause specific responses from the device; receiving, from the device, second audio data representing sound detected by the device; separating the second audio data into a first portion of the second audio data and a second portion of the second audio data, the first portion of the second audio data representing first sound associated with speech and the second portion of the second audio data representing second sound associated with non-conversational noises; detecting, based at least in part on the first audio data, that the second portion of the second audio data represents at least the non-conversational noise; and sending the instructions to a second device, the instructions to cause the second device to perform one or more actions associated with the non-conversational noise. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A method comprising:
-
receiving, from a remote device, audio data representing sound detected from an environment associated with an audio-controlled assistant, the audio data including a first data portion representative of non-speech-related noise and a second data portion representative of speech, the first data portion different than the second data portion; identifying, from the audio data, the first data portion representative of the non-speech-related noise; determining that the first data portion occurs more than a threshold number of times within the audio data; selecting instructions to associate with the first data portion based at least in part on the first data portion occurring more than the threshold number of times and at least in part on a characteristic of the non-speech related noise; associating the first data portion with the instructions to cause the audio-controlled assistant to perform a specific action in response to a future occurrence of the first data portion; and storing the first data portion. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
Specification