N-view synthesis from monocular video of certain broadcast and stored mass media content
First Claim
Patent Images
1. An image processing method for use on a data processing device, the method comprising the acts of:
- receiving at least one monocular video input image Ik;
segmenting at least one foreground object from the input image Ik;
wherein the act of segmenting at least one foreground object from the input image further comprises;
applying a homography transformation Hk to the at least one monocular video input image Ik to create at least one transformed image Jk;
combining the at least one transformed images Jk to create a mosaic M;
applying a median filter to the multiple values at each pixel of said mosaic M to derive a median value at each of said pixels in said mosaic M;
applying an inverse homography transformation Hk−
1 to said mosaic M to derive at least one background image Bk;
comparing the at least one background image Bk with the at least one input image Ik to create at least one mask image Mk;
extracting those pixels from the monocular input image Ik that are set to one in the mask image M; and
setting the remaining pixels in the monocular input image Ik not set to one at said extracting act in the mask image M to black resulting in the identification of said at least one foreground object from the input image Ik;
applying a respective left (TLm) and right (TRm) transformation to each segmented foreground object and a respective left (HL) and right (HR) background transformation to the background, for each of a plurality of output images;
combining the respective left transformation (TLm) corresponding to each segmented foreground object with the respective left background transformation (HL) corresponding to the background to generate a left view Lk for each of said plurality of output images;
combining the respective right transformation (TRm) corresponding to each segmented foreground object with the respective right background transformation (HR) corresponding to the background to generate a right view Rk for each of said plurality of output images; and
deriving the plurality of output images from the results of the respective transformations.
3 Assignments
0 Petitions
Accused Products
Abstract
A monocular input image is transformed to give it an enhanced three dimensional appearance by creating at least two output images. Foreground and background objects are segmented in the input image and transformed differently from each other, so that the foreground objects appear to stand out from the background. Given a sequence of input images, the foreground objects will appear to move differently from the background objects in the output images.
44 Citations
30 Claims
-
1. An image processing method for use on a data processing device, the method comprising the acts of:
-
receiving at least one monocular video input image Ik; segmenting at least one foreground object from the input image Ik; wherein the act of segmenting at least one foreground object from the input image further comprises; applying a homography transformation Hk to the at least one monocular video input image Ik to create at least one transformed image Jk; combining the at least one transformed images Jk to create a mosaic M; applying a median filter to the multiple values at each pixel of said mosaic M to derive a median value at each of said pixels in said mosaic M; applying an inverse homography transformation Hk−
1 to said mosaic M to derive at least one background image Bk;comparing the at least one background image Bk with the at least one input image Ik to create at least one mask image Mk; extracting those pixels from the monocular input image Ik that are set to one in the mask image M; and setting the remaining pixels in the monocular input image Ik not set to one at said extracting act in the mask image M to black resulting in the identification of said at least one foreground object from the input image Ik; applying a respective left (TLm) and right (TRm) transformation to each segmented foreground object and a respective left (HL) and right (HR) background transformation to the background, for each of a plurality of output images; combining the respective left transformation (TLm) corresponding to each segmented foreground object with the respective left background transformation (HL) corresponding to the background to generate a left view Lk for each of said plurality of output images; combining the respective right transformation (TRm) corresponding to each segmented foreground object with the respective right background transformation (HR) corresponding to the background to generate a right view Rk for each of said plurality of output images; and deriving the plurality of output images from the results of the respective transformations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An image processing device comprising
an input for receiving at least one monocular video input image; at least one processor adapted to perform the following operations segmenting at least one foreground object from the input image; wherein the operation of segmenting at least one foreground object from the input image, further comprises; applying a homography transformation Hk to the at least one monocular video input image Ik to create at least one transformed image Jk; combining the at least one transformed images Jk to create a mosaic M; applying a median filter to the multiple values at each pixel of said mosaic M to derive a median value at each of said pixels in said mosaic M; applying an inverse homography transformation Hk−
1 to said mosaic M to derive at least one background image Bk;comparing the at least one background image Bk with the at least one input image Ik to create at least one mask image Mk; extracting those pixels from the monocular input image Ik that are set to one in the mask image M; and setting the remaining pixels in the monocular input image Ik not set to one at said extracting act in the mask image M to black resulting in the identification of said at least one foreground object from the input image Ik; applying a respective left (TLm) and right (TRm) transformation to each segmented foreground object and a respective left (HL) and right (HR) background transformation to the background, for each of the plurality of output images; combining the respective left transformation (TLm) corresponding to each segmented foreground object with the respective left background transformation (HL) corresponding to the background to generate a left view Lk for each of said plurality of output images; combining the respective right transformation (TRm) corresponding to each segmented foreground object with the respective right background transformation (HR) corresponding to the background to generate a right view Rk for each of said plurality of output images; and deriving the plurality of output images from the results of the respective transformations. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 27)
-
21. At least one medium readable by a data processing device and embodying code for causing execution of the following operations:
-
receiving at least one monocular video input image; wherein the operation of segmenting at least one foreground object from the input image, further comprises; applying a homography transformation Hk to the at least one monocular video input image Ik to create at least one transformed image Jk; combining the at least one transformed images Jk to create a mosaic M; applying a median filter to the multiple values at each pixel of said mosaic M to derive a median value at each of said pixels in said mosaic M; applying an inverse homography transformation Hk−
1 to said mosaic M to derive at least one background image Bk;comparing the at least one background image Bk with the at least one input image Ik to create at least one mask image Mk; extracting those pixels from the monocular input image Ik that are set to one in the mask image M; and setting the remaining pixels in the monocular input image Ik not set to one at said extracting act in the mask image M to black resulting in the identification of said at least one foreground object from the input image Ik; segmenting at least one foreground object from the input image applying a respective left (TLm) and right (TRm) transformation to each segmented foreground object and a respective left (HL) and right (HR) background transformation to the background, for each of the plurality of output images; combining the respective left transformation (TLm) corresponding to each segmented foreground object with the respective left background transformation (HL) corresponding to the background to generate a left view Lk for each of said plurality of output images; combining the respective right transformation (TRm) corresponding to each segmented foreground object with the respective right background transformation (HR) corresponding to the background to generate a right view Rk for each of said plurality of output images. - View Dependent Claims (22, 23, 24, 25, 26, 28, 29, 30)
-
Specification