Generation of a stereo video from a mono video
First Claim
1. A method for generating stereo video from mono video comprising:
- identifying a shot in the mono video, the shot comprising a plurality of mono frames;
identifying a scene type for the shot, the scene type identified from one of a plurality of different scene types, each scene type associated with a different set of depth parameters, wherein each set of depth parameters comprises Knear, a numerical value representing depth scaling of a closest object in one of the mono frames to a viewer as a function of a horizontal resolution of the mono video, and Kfar, a numerical value representing depth scaling of a farthest object in one of the mono frames to the viewer as a function of the horizontal resolution of the mono video;
determining depth parameters for the shot based on the identified scene type;
selecting a first frame from the shot to use as either a left frame or a right frame in a stereo frame pair;
determining a pixel depth map for the first frame based on the scene type, the pixel depth map comprising a pixel depth for each pixel in the first frame; and
generating a second frame based on the first frame, the second frame to be used as either the right frame or the left frame so as to be another frame in the stereo frame pair, comprising;
determining a pixel shift amount for each pixel in the first frame, the pixel shift amount based on its pixel depth and the depth parameters for the shot; and
generating a pixel in the second frame by shifting a corresponding pixel in the first frame by its pixel shift amount.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and methodology provide for generation of a stereo video from a mono video. A mono video is partitioned into shots, where each shot including one or more frames of the mono video. The mono video frames are used as the left frames in the stereo video. Depth parameters are determined for each shot. A pixel depth map is created for each frame of each shot. The right frames for the stereo video are created by shifting pixels from the left frames laterally to occupy new locations in the right frames. A pixel'"'"'s shift is based on the depth parameters and the pixel depth. In aggregate, pixel shifts will cause objects to appear in different locations in the right frame relative to where they appeared in the left frame. The effect upon the viewer is that the stereo video will provide an enhanced illusion of depth for the viewer.
-
Citations
21 Claims
-
1. A method for generating stereo video from mono video comprising:
-
identifying a shot in the mono video, the shot comprising a plurality of mono frames; identifying a scene type for the shot, the scene type identified from one of a plurality of different scene types, each scene type associated with a different set of depth parameters, wherein each set of depth parameters comprises Knear, a numerical value representing depth scaling of a closest object in one of the mono frames to a viewer as a function of a horizontal resolution of the mono video, and Kfar, a numerical value representing depth scaling of a farthest object in one of the mono frames to the viewer as a function of the horizontal resolution of the mono video; determining depth parameters for the shot based on the identified scene type; selecting a first frame from the shot to use as either a left frame or a right frame in a stereo frame pair; determining a pixel depth map for the first frame based on the scene type, the pixel depth map comprising a pixel depth for each pixel in the first frame; and generating a second frame based on the first frame, the second frame to be used as either the right frame or the left frame so as to be another frame in the stereo frame pair, comprising; determining a pixel shift amount for each pixel in the first frame, the pixel shift amount based on its pixel depth and the depth parameters for the shot; and generating a pixel in the second frame by shifting a corresponding pixel in the first frame by its pixel shift amount. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for generating stereo video from mono video comprising:
-
a shot detection module configured to determine a first plurality of frames from a shot of a mono video, each frame of the first plurality of frames to be used as either a left frame or a right frame in a stereo frame pair; a scene classification module configured to; determine depth parameters for the shot, where the depth parameters determined for the shot are one of a plurality of different sets of depth parameters, wherein each set of depth parameters comprises Knear, a numerical value representing depth scaling of a closest object in one of the first plurality of frames to a viewer as a function of a horizontal resolution of the mono video, and Kfar, a numerical value representing depth scaling of a farthest object in one of the first plurality of frames to the viewer as a function of the horizontal resolution of the mono video, and determine a pixel depth map for each frame of the first plurality of frames of the shot, the pixel depth maps comprising a pixel depth for each pixel in each frame of the first plurality of frames; and a depth image based rendering (DIBR) module configured to determine a pixel shift amount for each pixel in each frame of the first plurality of frames based on the depth parameters and the pixel depth maps; and generate a second plurality of frames based on the first plurality of frames laterally shifting the pixels of the first plurality of frames by the pixel shift amounts, each frame of the second plurality of frames to be used as either a right frame or a left frame so as to be another frame in a stereo frame pair. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A method for generating stereo video from mono video comprising:
-
identifying a shot in the mono video, the shot comprising a plurality of mono frames; identifying a scene type for the shot, the scene type is identified from one of a plurality of different scene types including at least an indoor scene type, an outdoor scene type, and a generic scene type, each of these scene types associated with a different set of depth parameters, wherein each set of depth parameters comprises Knear, a numerical value representing depth scaling of a closest object in one of the mono frames to a viewer as a function of a horizontal resolution of a mono video, and Kfar, a numerical value representing depth scaling of a farthest object in one of the mono frames to the viewer as a function of the horizontal resolution of the mono video; determining depth parameters for the shot based on the identified scene type; selecting a first frame from the shot, to use as either a left frame or a right frame in a stereo frame pair; determining a pixel depth map for the first frame based on the scene type, the pixel depth map comprising a pixel depth for each pixel in the first frame; and generating a second frame based on the first frame, the second frame to be used as either the right frame or the left frame so as to be another frame in the stereo frame pair comprising; determining a pixel shift amount for each pixel in the first frame, the pixel shift amount based on its pixel depth and the depth parameters for the shot; and generating a pixel in the second frame by shifting a corresponding pixel in the first frame by its pixel shift amount.
-
Specification