Sound event detection
First Claim
1. An environmental data monitoring and reporting system, comprising:
- a device sensor that detects sound in an area and generates an audio signal based on the detected sound;
a device processor communicatively coupled to the device sensor, wherein the device processor is configured to convert the audio signal received from the device sensor into low-resolution audio signal data comprising a plurality of low-resolution feature vectors representative of the detected sound, and to analyze the low-resolution audio signal data, at the device processor level, to identify the detected sound as one of either a sound related to area human or pet occupancy, or a sound generated by a source other than the area human or pet occupancy, and provide a communication regarding the detected area human or pet occupancy-related sound; and
a device communication interface communicatively coupled to the device processor, wherein the device communication interface is configured to send the communication regarding the detected area human or pet occupancy-related sound,wherein the device sensor, device processor and device communication interface are integrated into a single premises management device, andwherein the device processor is configured to;
implement a Fast Fourier Transform element to perform a frequency domain conversion of the audio signal;
implement a plurality of bandwidth filters, a plurality of median filters, a plurality of range filters, and a plurality of summers, to extract the low-resolution feature vectors that distinguish the detected sound;
implement a state classifier element to determine state transition conditions by comparing the low-resolution feature vectors to threshold values that distinguish sound categories and generate outputs indicating occurrences of the distinguished sound categories; and
implement a detector element to detect an occurrence of a sound category indicating the area human or pet occupancy and generate a user message in response.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for the use of sensors and processors of existing, distributed systems, operating individually or in cooperation with other systems, networks or cloud-based services to enhance the detection and classification of sound events in an environment (e.g., a home), while having low computational complexity. The system and method provides functions where the most relevant features that help in discriminating sounds are extracted from an audio signal and then classified depending on whether the extracted features correspond to a sound event that should result in a communication to a user. Threshold values and other variables can be determined by training on audio signals of known sounds in defined environments, and implemented to distinguish human and pet sounds from other sounds, and compensate for variations in the magnitude of the audio signal, different sizes and reverberation characteristics of the environment, and variations in microphone responses.
37 Citations
17 Claims
-
1. An environmental data monitoring and reporting system, comprising:
-
a device sensor that detects sound in an area and generates an audio signal based on the detected sound; a device processor communicatively coupled to the device sensor, wherein the device processor is configured to convert the audio signal received from the device sensor into low-resolution audio signal data comprising a plurality of low-resolution feature vectors representative of the detected sound, and to analyze the low-resolution audio signal data, at the device processor level, to identify the detected sound as one of either a sound related to area human or pet occupancy, or a sound generated by a source other than the area human or pet occupancy, and provide a communication regarding the detected area human or pet occupancy-related sound; and a device communication interface communicatively coupled to the device processor, wherein the device communication interface is configured to send the communication regarding the detected area human or pet occupancy-related sound, wherein the device sensor, device processor and device communication interface are integrated into a single premises management device, and wherein the device processor is configured to; implement a Fast Fourier Transform element to perform a frequency domain conversion of the audio signal; implement a plurality of bandwidth filters, a plurality of median filters, a plurality of range filters, and a plurality of summers, to extract the low-resolution feature vectors that distinguish the detected sound; implement a state classifier element to determine state transition conditions by comparing the low-resolution feature vectors to threshold values that distinguish sound categories and generate outputs indicating occurrences of the distinguished sound categories; and implement a detector element to detect an occurrence of a sound category indicating the area human or pet occupancy and generate a user message in response. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An environmental data monitoring and reporting system, comprising:
-
a device sensor, comprising a microphone, that detects a condition comprising one or more sounds in an area and generates an audio signal based on the detected condition; a device processor communicatively coupled to the device sensor, wherein the device processor is configured to receive the audio signal and convert the audio signal received from the device sensor into low-resolution signal data comprising a plurality of low-resolution feature vectors representative of the one or more sounds in the area and to analyze the low-resolution signal data, at the device processor level, by; implementing a Fast Fourier Transform element, a plurality of bandwidth filters, a plurality of median filters, a plurality of range filters, and a plurality of summers, to perform a frequency domain conversion of the audio signal and extract the low-resolution feature vectors that distinguish detected conditions, implementing a state classifier element to compare the low-resolution feature vectors to threshold values that distinguish condition categories, generating outputs indicating occurrences of the distinguished condition categories, and implementing a detector element to detect one of the distinguished condition categories, which represents one of either a sound related to an area human or pet occupancy, or a sound generated by a source other than the area human or pet occupancy, and generate a user message in response; and a device communication interface communicatively coupled to the device processor, wherein the device communication interface is configured to send the user message regarding the detected area human or pet occupancy-related condition, wherein the device sensor, device processor and device communication interface are integrated into a single premises management device. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A method for controlling an environmental data monitoring and reporting system, comprising:
-
detecting sound in an area and generating an audio signal based on the detected sound; converting the audio signal into low-resolution audio signal data comprising a plurality of low-resolution feature vectors representative of the sound in the area, and analyzing the low-resolution audio signal data, at a device processor level, to identify the detected sound as one of either a sound related to area human or pet occupancy, or a sound generated by a source other than the area human or pet occupancy, and provide a communication regarding the detected area human or pet occupancy-related sound; and sending the communication regarding the detected area human or pet occupancy-related sound, wherein the detecting step, converting-step, analyzing step and sending are performed by a single premises management device, wherein the converting comprises performing a frequency domain conversion of the audio signal using a Fast Fourier Transform and extracting the low-resolution feature vectors that distinguish detected sounds, where the extracting is performed using a plurality of bandwidth filters, a plurality of median filters, a plurality of range filters, and a plurality of summers, to extract the low-resolution feature vectors, and the analyzing step comprises determining state transition conditions by comparing the low-resolution feature vectors to threshold values that distinguish sound categories and generating outputs indicating occurrences of the distinguished sound categories. - View Dependent Claims (16, 17)
-
Specification