Methods and Apparatus for Video Completion
First Claim
1. A method, comprising:
- obtaining an input video sequence of a scene comprising a plurality of frames and an indication of an object to be removed from the frames of the input video sequence;
tracking feature points in the input video sequence to generate feature tracks;
factoring the feature tracks into a low-dimensional subspace;
generating a prediction of background scene motion according to the low-dimensional subspace; and
processing a frame of the input video sequence as a target frame to fill a region in the target frame occluded by the indicated object with background content from unoccluded regions of one or more other frames of the input video sequence as source frames, wherein said processing the target frame is performed according to the prediction of background scene motion.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, apparatus, and computer-readable storage media for video completion that may be applied to restore missing content, for example holes or border regions, in video sequences. A video completion technique applies a subspace constraint technique that finds and tracks feature points in the video, which are used to form a model of the camera motion and to predict locations of background scene points in frames where the background is occluded. Another frame where those points were visible is found, and that frame is warped using the predicted points. A content-preserving warp technique may be used. Image consistency constraints may be applied to modify the warp so that it fills the hole seamlessly. A compositing technique is applied to composite the warped image into the hole. This process may be repeated until the missing content is filled on all frames.
199 Citations
20 Claims
-
1. A method, comprising:
-
obtaining an input video sequence of a scene comprising a plurality of frames and an indication of an object to be removed from the frames of the input video sequence; tracking feature points in the input video sequence to generate feature tracks; factoring the feature tracks into a low-dimensional subspace; generating a prediction of background scene motion according to the low-dimensional subspace; and processing a frame of the input video sequence as a target frame to fill a region in the target frame occluded by the indicated object with background content from unoccluded regions of one or more other frames of the input video sequence as source frames, wherein said processing the target frame is performed according to the prediction of background scene motion. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system, comprising:
-
at least one processor; and a memory comprising program instructions, wherein the program instructions are executable by the at least one processor to; obtain an input video sequence of a scene comprising a plurality of frames and an indication of an object to be removed from the frames of the input video sequence; generate a prediction of background scene motion in the input video sequence; and process a frame of the input video sequence as a target frame to fill a region in the target frame occluded by the indicated object with background content from unoccluded regions of one or more other frames of the input video sequence as source frames, wherein, to process a target frame, the program instructions are executable by the at least one processor to; identify the unoccluded regions of the one or more source frames according to the prediction of background scene motion; warp each source frame to the target frame according to the prediction of background scene motion; apply one or more image consistency energy terms to improve alignment between the target frame and each warped source frame; and composite at least a portion of the unoccluded region from each warped source frame into the occluded region of the target frame. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A non-transitory computer-readable storage medium storing program instructions, wherein the program instructions are computer-executable to implement:
-
obtaining an input video sequence of a scene comprising a plurality of frames and an indication of an object to be removed from the frames of the input video sequence; generating a prediction of background scene motion in the input video sequence; and processing a frame of the input video sequence as a target frame to fill a region in the target frame occluded by the indicated object with background content from unoccluded regions of one or more other frames of the input video sequence as source frames, wherein said processing a target frame comprises; identifying the unoccluded regions of the one or more source frames according to the prediction of background scene motion; warping each source frame to the target frame according to the prediction of background scene motion; applying one or more image consistency energy terms to improve alignment between the target frame and each warped source frame; and compositing at least a portion of the unoccluded region from each warped source frame into the occluded region of the target frame. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A non-transitory computer-readable storage medium storing program instructions, wherein the program instructions are computer-executable to implement:
-
obtaining an input video sequence of a scene comprising a plurality of frames, wherein the frames in the input video sequence have been cropped; tracking feature points in the input video sequence to generate feature tracks; factoring the feature tracks into a low-dimensional subspace; and generating the prediction of background scene motion according to the low-dimensional subspace; and processing each frame of the input video sequence as a target frame to fill a cropped region in the target frame with content from background regions of one or more other frames of the input video sequence as source frames, wherein said processing the target frame is performed according to the prediction of background scene motion. - View Dependent Claims (17, 18, 19, 20)
-
Specification