Depth map generation using motion cues for conversion of monoscopic visual content to stereoscopic 3D
First Claim
Patent Images
1. A method comprising:
- identifying a subset of frames in a two-dimensional video, the subset of frames comprising moving objects and static objects;
determining a global camera motion value for the subset of frames;
determining a dense motion value for a plurality of pixels in the subset of frames;
comparing, by a processing device, the global camera motion value and the dense motion value to calculate a local motion value for each of the plurality of pixels;
determining, for each of the plurality of pixels, whether a local motion value of a respective pixel satisfies a first threshold condition or a second threshold condition;
responsive to the local motion value of the respective pixel satisfying the first threshold condition, assigning a first value to represent the local motion value of the respective pixel, and responsive to the local motion value of the respective pixel satisfying the second threshold condition, assigning a second value to represent the local motion value of the respective pixel, wherein the first value indicates that the respective pixel is associated with one of the moving objects, and the second value indicates that a corresponding pixel is associated with one of the static objects, wherein the assigning of the first and second values results in the plurality of pixels each being assigned either the first value or the second value;
generating a rough depth map for the subset of frames using assigned first and second values of the plurality of pixels and locations of the plurality of pixels in the subset of the frames;
interpolating, based on the rough depth map, a depth value for each of the plurality of pixels in the subset of frames; and
rendering a three-dimensional video from the subset of frames using the depth value for each of the plurality of pixels.
2 Assignments
0 Petitions
Accused Products
Abstract
An image converter identifies a subset of frames in a two-dimensional video and determines a global camera motion value for the subset of frames. The image converter also determines a dense motion value for a plurality of pixels in the subset of frames and compares the global camera motion value and the dense motion value to calculate a rough depth map for the subset of frames. The image converter further interpolates, based on the rough depth map, a depth value for each of the plurality of pixels in the subset of frames and renders a three-dimensional video from the subset of frames using the depth value for each of the plurality of pixels.
-
Citations
20 Claims
-
1. A method comprising:
-
identifying a subset of frames in a two-dimensional video, the subset of frames comprising moving objects and static objects; determining a global camera motion value for the subset of frames; determining a dense motion value for a plurality of pixels in the subset of frames; comparing, by a processing device, the global camera motion value and the dense motion value to calculate a local motion value for each of the plurality of pixels; determining, for each of the plurality of pixels, whether a local motion value of a respective pixel satisfies a first threshold condition or a second threshold condition; responsive to the local motion value of the respective pixel satisfying the first threshold condition, assigning a first value to represent the local motion value of the respective pixel, and responsive to the local motion value of the respective pixel satisfying the second threshold condition, assigning a second value to represent the local motion value of the respective pixel, wherein the first value indicates that the respective pixel is associated with one of the moving objects, and the second value indicates that a corresponding pixel is associated with one of the static objects, wherein the assigning of the first and second values results in the plurality of pixels each being assigned either the first value or the second value; generating a rough depth map for the subset of frames using assigned first and second values of the plurality of pixels and locations of the plurality of pixels in the subset of the frames; interpolating, based on the rough depth map, a depth value for each of the plurality of pixels in the subset of frames; and rendering a three-dimensional video from the subset of frames using the depth value for each of the plurality of pixels. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory machine-readable storage medium storing instructions which, when executed, cause a data processing system to perform a method comprising:
-
identifying a subset of frames in a two-dimensional video, the subset of frames comprising moving objects and static objects; determining a global camera motion value for the subset of frames; determining a dense motion value for a plurality of pixels in the subset of frames; comparing, by a processing device, the global camera motion value and the dense motion value to calculate a local motion value for each of the plurality of pixels; determining, for each of the plurality of pixels, whether a local motion value of a respective pixel satisfies a first threshold condition or a second threshold condition; responsive to the local motion value of the respective pixel satisfying the first threshold condition, assigning a first value to represent the local motion value of the respective pixel, and responsive to the local motion value of the respective pixel satisfying the second threshold condition, assigning a second value to represent the local motion value of the respective pixel, wherein the first value indicates that the respective pixel is associated with one of the moving objects, and the second value indicates that a corresponding pixel is associated with one of the static objects, wherein the assigning of the first and second values results in the plurality of pixels each being assigned either the first value or the second value; generating a rough depth map for the subset of frames using assigned first and second values of the plurality of pixels and locations of the plurality of pixels in the subset of the frames; interpolating, based on the rough depth map, a depth value for each of the plurality of pixels in the subset of frames; and rendering a three-dimensional video from the subset of frames using the depth value for each of the plurality of pixels. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a processing device; and a memory coupled to the processing device; and an image converter, executable by the processing device from the memory, to; identify a subset of frames in a two-dimensional video, the subset of frames comprising moving objects and static objects; determine a global camera motion value for the subset of frames; determine a dense motion value for a plurality of pixels in the subset of frames; compare the global camera motion value and the dense motion value to calculate a local motion value for each of the plurality of pixels; determine, for each of the plurality of pixels, whether a local motion value of a respective pixel satisfies a first threshold condition or a second threshold condition; responsive to the local motion value of the respective pixel satisfying the first threshold condition, assign a first value to represent the local motion value of the respective pixel, and responsive to the local motion value of the respective pixel satisfying the second threshold condition, assign a second value to represent the local motion value of the respective pixel, wherein the first value indicates that the respective pixel is associated with one of the moving objects, and the second value indicates that a corresponding pixel is associated with one of the static objects, wherein the assigning of the first and second values results in the plurality of pixels each being assigned either the first value or the second value; generate a rough depth map for the subset of frames using assigned first and second values of the plurality of pixels and locations of the plurality of pixels in the subset of the frames; interpolate, based on the rough depth map, a depth value for each of the plurality of pixels in the subset of frames; and render a three-dimensional video from the subset of frames using the depth value for each of the plurality of pixels. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification