Method for siren detection based on audio samples
First Claim
1. An apparatus comprising:
- an audio unit configured to receive an audio signal;
a control unit configured to operate the apparatus; and
a processing unit configured to;
process the audio signal from the audio unit to create a plurality of windowed audio samples including at least a first windowed audio sample and a second windowed audio sample, wherein the first windowed audio sample and the second windowed audio sample each have a different length of time;
determine a likelihood that the first windowed audio sample comprises a siren signal based on a detection of a group of features in the first windowed audio sample associated with a siren-classification profile;
based on the first windowed audio sample indicating a likelihood of a siren signal below a threshold, determine a likelihood that the second windowed audio sample includes a siren signal based on a detection of a group of features of the second windowed audio sample with the siren-classification profile; and
alter control of the apparatus by the control unit based on the likelihood of at least one of the first windowed audio sample and the second windowed audio sample including a siren signal being above the threshold.
3 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure provides methods and apparatuses that enable an apparatus to identify sounds from short samples of audio. The apparatus may capture an audio sample and create several audio signals of different lengths, each containing audio from the captured audio sample. The apparatus my process the several audio signals in an attempt to identify features of the audio signal that indicate an identification of the captured sound. Because shorter audio samples can be analyzed more quickly, the system may first process the shortest audio samples in order to quickly identify features of the audio signal. Because longer audio samples contain more information, the system may be able to more accurately identify features in the audio signal in longer audio samples. However, analyzing longer audio signals takes more buffered audio than identifying features in shorter signals. Therefore, the present system attempts to identify features in the shortest audio signals first.
33 Citations
22 Claims
-
1. An apparatus comprising:
-
an audio unit configured to receive an audio signal; a control unit configured to operate the apparatus; and a processing unit configured to; process the audio signal from the audio unit to create a plurality of windowed audio samples including at least a first windowed audio sample and a second windowed audio sample, wherein the first windowed audio sample and the second windowed audio sample each have a different length of time; determine a likelihood that the first windowed audio sample comprises a siren signal based on a detection of a group of features in the first windowed audio sample associated with a siren-classification profile; based on the first windowed audio sample indicating a likelihood of a siren signal below a threshold, determine a likelihood that the second windowed audio sample includes a siren signal based on a detection of a group of features of the second windowed audio sample with the siren-classification profile; and alter control of the apparatus by the control unit based on the likelihood of at least one of the first windowed audio sample and the second windowed audio sample including a siren signal being above the threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
receiving an audio signal with an audio unit; processing, with a processor, the audio signal from the audio unit to create a plurality of windowed audio samples including at least a first windowed audio sample and a second windowed audio sample, wherein the first windowed audio sample and the second windowed audio sample each have a different length of time; determining a likelihood that the first windowed audio sample comprises a siren signal based on the detection of a group of features of the first windowed audio sample; based on the first windowed audio sample indicating a likelihood of the first windowed audio sample including a siren signal below a threshold, determining a likelihood that the second windowed audio sample comprises a siren signal based on the detection of a group of features of the second windowed audio sample; and providing instructions to control an apparatus based on the likelihood of at least one of the first windowed audio sample and the second windowed audio sample including a siren signal being above the threshold. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable medium having stored thereon program instructions that when executed by a computing system that includes at least one processor cause the computing system to perform operations comprising:
-
receiving an audio signal; processing the audio signal to create a plurality of windowed audio samples including at least a first windowed audio sample and a second windowed audio sample, wherein the first windowed audio sample has a first length of time and the second windowed audio sample has a second length of time longer than the first length of time; determining a likelihood that the first windowed audio sample comprises a siren signal based on the detection of a group of features of the first windowed audio sample; based on the first windowed audio sample indicating a low likelihood of the first windowed audio sample including a siren signal, determining a likelihood that the second windowed audio sample comprises a siren signal based on the detection of a group of features of the second windowed audio sample; and providing instructions to control an apparatus based on the likelihood of at least one of the first windowed audio sample and the second windowed audio sample including a siren signal being above a threshold. - View Dependent Claims (18, 19, 20, 21, 22)
-
Specification