Triggering video surveillance using embedded voice, speech, or sound recognition
First Claim
Patent Images
1. A method comprising:
- receiving, by a computer system, an audio signal captured from an area to be monitored via video surveillance;
recognizing, by the computer system via an embedded recognition component, a voice, speech phrase, or environmental sound in the audio signal;
determining, by the computer system, that the recognized voice, speech phrase, or environmental sound corresponds to a predefined trigger condition;
in response to the determining, detecting, by the computer system, whether the audio signal includes one or more aspects that characterize the audio signal as being from a pre-recorded television program or pre-recorded piece of music; and
if the one or more aspects are not detected in the audio signal;
transmitting, by the computer system, a signal to one or more video capture devices to begin video recording of the area; and
transmitting, by the computer system, an alert to a mobile device of an individual indicating that video surveillance of the area has been initiated.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for automatically triggering video surveillance using embedded voice, speech, or sound recognition are provided. In one embodiment, a computer system can receive an audio signal captured from an area to be monitored via video surveillance. The computer system can further recognize, via an embedded recognition component, a voice, speech phrase, or environmental sound in the audio signal, and can determine that the recognized voice, speech phrase, or environmental sound corresponds to a predefined trigger condition. The computer system can then automatically transmit a signal to one or more video capture devices to begin video recording of the area.
9 Citations
24 Claims
-
1. A method comprising:
-
receiving, by a computer system, an audio signal captured from an area to be monitored via video surveillance; recognizing, by the computer system via an embedded recognition component, a voice, speech phrase, or environmental sound in the audio signal; determining, by the computer system, that the recognized voice, speech phrase, or environmental sound corresponds to a predefined trigger condition; in response to the determining, detecting, by the computer system, whether the audio signal includes one or more aspects that characterize the audio signal as being from a pre-recorded television program or pre-recorded piece of music; and if the one or more aspects are not detected in the audio signal; transmitting, by the computer system, a signal to one or more video capture devices to begin video recording of the area; and transmitting, by the computer system, an alert to a mobile device of an individual indicating that video surveillance of the area has been initiated. - View Dependent Claims (2, 3, 4, 5, 6, 19, 20, 21, 22, 23, 24)
-
-
7. A non-transitory computer readable medium having stored thereon program code executable by a processor, the program code comprising:
-
code that causes the processor to receive an audio signal captured from an area to be monitored via video surveillance; code that causes the processor to recognize, via an embedded recognition component, a voice, speech phrase, or environmental sound in the audio signal; code that causes the processor to determine that the recognized voice, speech phrase, or environmental sound corresponds to a predefined trigger condition; in response to the determining, code that causes the processor to detect whether the audio signal includes one or more aspects that characterize the audio signal as being from a pre-recorded television program or pre-recorded piece of music; and if the one or more aspects are not detected in the audio signal; code that causes the processor to transmit a signal to one or more video capture devices to begin video recording of the area; and code that causes the processor to transmit an alert to a mobile device of an individual indicating that video surveillance of the area has been initiated. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer system comprising:
-
a processor; and a non-transitory computer readable medium having stored thereon executable program code which, when executed by the processor, causes the processor to; receive an audio signal captured from an area to be monitored via video surveillance; recognize, via an embedded recognition component, a voice, speech phrase, or environmental sound in the audio signal; determine that the recognized voice, speech phrase, or environmental sound corresponds to a predefined trigger condition; in response to the determining, detect whether the audio signal includes one or more aspects that characterize the audio signal as being from a pre-recorded television program or pre-recorded piece of music; and if the one or more aspects are not detected in the audio signal; transmit a signal to one or more video capture devices to begin video recording of the area; and transmit an alert to a mobile device of an individual indicating that video surveillance of the area has been initiated. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification