Apparatus and method for generating object-labeled image in video sequence
First Claim
1. A method for receiving a video sequence including query objects to be extracted and generating object-labeled images based on the query objects, the method comprising the steps of:
- (a) dividing the video sequence into one or more shots, each of which is a set of frames having a similar scene, and selecting one or more key frames from each of the shots;
(b) determining whether there exists an object similar to each of the query objects in each of the key frames and extracting the similar objects as corresponding query object based initial object regions from each of the key frames;
(c) for each query object, tracking object regions in all frames of only shots determined to have a respective similar object in a key frame based on the corresponding query object based initial object regions; and
(d) labeling the object regions tracked in each of the frames based on information on the corresponding query objects.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method for generating object-labeled images based on query images in a video sequence are provided. A video sequence is divided into a plurality of shots, each of which consists of a set of frames having a similar scene, and an initial object region is extracted from each of the shots by determining whether an object image exists in key frames of the shots. Based on the initial object region extracted from each of the key frames, object regions are tracked in all frames of the shots. Then, the object regions are labeled to generate object-labeled images. Therefore, the object-labeled image generating apparatus and method can be applied regardless of the degree of motion of an object and time required to extract query objects is reduced.
24 Citations
9 Claims
-
1. A method for receiving a video sequence including query objects to be extracted and generating object-labeled images based on the query objects, the method comprising the steps of:
-
(a) dividing the video sequence into one or more shots, each of which is a set of frames having a similar scene, and selecting one or more key frames from each of the shots;
(b) determining whether there exists an object similar to each of the query objects in each of the key frames and extracting the similar objects as corresponding query object based initial object regions from each of the key frames;
(c) for each query object, tracking object regions in all frames of only shots determined to have a respective similar object in a key frame based on the corresponding query object based initial object regions; and
(d) labeling the object regions tracked in each of the frames based on information on the corresponding query objects. - View Dependent Claims (2, 3, 4)
-
-
5. An apparatus for receiving a video sequence including query objects to be extracted and generating object-labeled images based on the query objects, the apparatus comprising:
-
a shot and key frame setting unit for dividing the video sequence into one or more shots, each of which is a set of frames having a similar scene, and selecting one or more key frames from each of the shots;
an initial object region extractor for determining whether there exists an object similar to each of the query objects in each of the key frames and extracting the similar objects as corresponding query object based initial object regions from each of the key frames;
an object region tracker for tracking, for each query object, object regions in all frames of only shots determined to have a respective similar object in a key frame based on the corresponding query image object based initial object regions; and
an object-labeled image generator for labeling the object regions tracked in each of the frames based on information on the corresponding query objects. - View Dependent Claims (6, 7, 8)
-
-
9. A computer readable medium having embodied thereon a computer program for receiving a video sequence including query objects to be extracted and generating object-labeled images based on the query objects, wherein generating object-labeled images comprises the steps of:
-
(a) dividing the video sequence into one or more shots, each of which is a set of frames having a similar scene, and selecting one or more key frames from each of the shots;
(b) determining whether there exists an object similar to each of the query objects in each of the key frames and extracting the similar objects as corresponding query object based initial object regions from each of the key frames;
(c) for each query object, tracking object regions in all frames of only shots determined to have a respective similar object in a key frame based on the corresponding query object based initial object regions; and
(d) labeling the object regions tracked in each of the frames based on information on the corresponding query objects.
-
Specification