Systems and methods for video content analysis
First Claim
1. A method comprising a processor, a memory, a video sensor, a video encoder and a transceiver, the memory including instructions stored thereon which, when executed by the processor, perform a method for generating video analytics, the method comprising:
- providing, by the processor, information representative of a sequence of images captured by the video sensor to the video encoder that is adapted to encode the information using macroblock-based video encoding to obtain a plurality of video frames, wherein the video sensor and video encoder are co-located in a first apparatus;
generating, by the processor, pixel domain video analytics metadata (VAMD) that includes video content analysis information for each of a plurality of macroblocks while encoding the information in the video encoder;
generating, by the processor, a global video analytics message applicable to a plurality of images in the sequence of images using the VAMD;
generating, by the processor, a local video analytics message applicable to a first video frame in the plurality of video frames using the VAMD; and
transmitting, by the transceiver, the plurality of video frames through a network to a second apparatus with a package comprising the VAMD and the local video analytics message or the global video analytics message,wherein the second apparatus includes a video analytics processor configured to process the package transmitted by the first apparatus.
2 Assignments
0 Petitions
Accused Products
Abstract
Video analytics systems and methods are described that typically comprise a video encoder operable to generate macroblock video analytics metadata (VAMD) from a video frame. Functional modules receive the VAMD and an encoded version of the video frame is configured to generate video analytics information related to the frame using the VAMD and the encoded video frame. The downstream decoder can use the VAMD to obtain a global motion vector related to the frame, detect and track motion of an object within the frame and monitor a line provided or found within the frame. Traversals of the line by a moving object can be detected and counted using information in the VAMD and the line may be part of a polygon that delineates an area to be monitored within the encoded frame. The VAMD can comprise macroblock level and video frame level information.
-
Citations
20 Claims
-
1. A method comprising a processor, a memory, a video sensor, a video encoder and a transceiver, the memory including instructions stored thereon which, when executed by the processor, perform a method for generating video analytics, the method comprising:
-
providing, by the processor, information representative of a sequence of images captured by the video sensor to the video encoder that is adapted to encode the information using macroblock-based video encoding to obtain a plurality of video frames, wherein the video sensor and video encoder are co-located in a first apparatus; generating, by the processor, pixel domain video analytics metadata (VAMD) that includes video content analysis information for each of a plurality of macroblocks while encoding the information in the video encoder; generating, by the processor, a global video analytics message applicable to a plurality of images in the sequence of images using the VAMD; generating, by the processor, a local video analytics message applicable to a first video frame in the plurality of video frames using the VAMD; and transmitting, by the transceiver, the plurality of video frames through a network to a second apparatus with a package comprising the VAMD and the local video analytics message or the global video analytics message, wherein the second apparatus includes a video analytics processor configured to process the package transmitted by the first apparatus. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A device comprising:
-
a camera; a video encoder; a video analytics engine; a communication interface; and a video sensor in the camera configured to capture a sequence of images; the video encoder configured to; encode the sequence of images in video frames using macroblock-based video encoding to provide encoded video frames, and generate video analytics metadata (VAMD) that includes video content analysis information for each of a plurality of macroblocks processed the sequence of images in the video frames; the video analytics engine configured to process the VAMD, and to generate one or more video analytics messages from results obtained by processing the VAMD; and the communication interface configured to transmit the encoded video frames to a video decoder of a client device, and to transmit the VAMD and the one or more video analytics messages in a layered package to a video analytics processor in the client device that is configured to generate video analytics information related to the sequence of images based on the VAMD, the one or more video analytics messages, and the encoded video frames. - View Dependent Claims (11, 12, 13, 14)
-
-
15. An apparatus comprising:
-
a camera; a video encoder configured to provide encoded video frames representative of images received from the camera using macroblock-based video encoding, wherein the video encoder is further configured to generate video analytics metadata (VAMD) that includes video content analysis information for each of a plurality of macroblocks processed while encoding the images; a video analytics engine configured to process the VAMD, and to generate a global video analytics message applicable to a plurality of images in the images received from the camera or a local video analytics message applicable to one of the encoded video frames; and a communication interface adapted to transmit the encoded video frames to a video decoder of a client device, and to transmit the VAMD, global video analytics message or the local video analytics message in a layered package to a video analytics processor in the client device that is configured to generate video analytics information related to the images received from the camera based on the VAMD, the global video analytics message or the local video analytics message, and the encoded video frames. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification