Real-time video object generation for smart cameras
First Claim
1. An apparatus for video object generation and selective encoding, the apparatus comprising:
- a detection module for detecting a first object in at least one image frame of a series of image frames, wherein the detection module detects the first object by initializing a plurality of regions in the at least one image frame, for each initialization computes a degree of similarity between a model and a candidate object in the at least one image frame, and applies an iterative comparative procedure to the degrees of similarity computed, the iterations being based on a gradient vector to shift the location of candidate object in the at least one frame, to derive as the location of the candidate object in the at least one frame that location which has characteristics most similar to the characteristics of the model;
a tracking module for tracking the first object in successive image frames of the series of image frames and segmenting the first object from a background, the background being a second object; and
an encoder for encoding the first and second objects to be transmitted to a receiver, wherein the first object is compressed at a high compression rate and the second object is compressed at a low compression rate.
5 Assignments
0 Petitions
Accused Products
Abstract
An apparatus and method for video object generation and selective encoding is provided. The apparatus includes a detection module for detecting a first object in at least one image frame of a series of image frames; a tracking module for tracking the first object in successive image frames and segmenting the first object from a background, the background being a second object; and an encoder for encoding the first and second objects to be transmitted to a receiver, wherein the first object is compressed at a high compression rate and the second object is compressed at a low compression rate. The receiver merges the first and second object to form a composite image frame. The method provides for detecting, tracking and segmenting one or more objects, such as a face, from a background to be encoded at the same or different compression rates to conserve bandwidth.
75 Citations
35 Claims
-
1. An apparatus for video object generation and selective encoding, the apparatus comprising:
-
a detection module for detecting a first object in at least one image frame of a series of image frames, wherein the detection module detects the first object by initializing a plurality of regions in the at least one image frame, for each initialization computes a degree of similarity between a model and a candidate object in the at least one image frame, and applies an iterative comparative procedure to the degrees of similarity computed, the iterations being based on a gradient vector to shift the location of candidate object in the at least one frame, to derive as the location of the candidate object in the at least one frame that location which has characteristics most similar to the characteristics of the model; a tracking module for tracking the first object in successive image frames of the series of image frames and segmenting the first object from a background, the background being a second object; and an encoder for encoding the first and second objects to be transmitted to a receiver, wherein the first object is compressed at a high compression rate and the second object is compressed at a low compression rate. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for video object generation and selective encoding, the method comprising the steps of:
-
detecting a first object from at least one of a plurality of successive image frames, wherein detecting further includes initializing a plurality of regions in the at least one image frame, for each initialization, computing a degree of similarity between a model and a candidate object in the at least one image frame, and applying an iterative comparative procedure to the degrees of similarity computed, the iterations being based on a gradient vector to shift the location of candidate object in the at least one frame, to derive as the location of the candidate object in the at least one frame that location which has characteristics most similar to the characteristics of the model; tracking the first object through the plurality of image frames; segmenting the first object from a background of the image frame, the background being a second object; and encoding the first and second objects to be transmitted to a receiver, wherein the first object is compressed at a high compression rate and the second object is compressed at a low compression rate. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for video object generation and selective encoding, the method steps comprising:
-
detecting a first object from at least one of a plurality of successive image frames, wherein detecting further includes initializing a plurality of regions in the at least one image frame, for each initialization, computing a degree of similarity between a model and a candidate object in the at least one image frame, and applying an iterative comparative procedure to the degrees of similarity computed, the iterations being based on a gradient vector to shift the location of candidate object in the at least one frame, to derive as the location of the candidate object in the at least one frame that location which has characteristics most similar to the characteristics of the model; tracking the first object through the plurality of image frames; segmenting the first object from a background of the image frame, the background being a second object; and encoding the first and second objects to be transmitted to a receiver, wherein the first object is compressed at a high compression rate and the second object is compressed at a low compression rate.
-
-
27. A method for video object generation and selective encoding, the method comprising the steps of:
-
detecting a plurality of objects from at least one of a plurality of successive image frames, wherein detecting includes initializing multiple regions in the at least one image frame, for each initialization, computing a degree of similarity between a plurality of models and candidate objects in the at least one frame, and applying an iterative comparative procedure to the degrees of similarity computed, the iterations being based on a gradient vector to shift the location of candidate objects in the at least one frame, to derive as the location of the candidate objects in the at least one frame those locations which have characteristics most similar to the characteristics of the plurality of models; tracking the plurality of objects through the plurality of image frames; segmenting the plurality of objects from the at least one image frame; and encoding the plurality of objects to be transmitted to a receiver, wherein each of the plurality of objects is compressed at a different compression rate. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35)
-
Specification