System and method for removing camera rotation from a panoramic video

US 9,277,122 B1
Filed: 08/13/2015
Issued: 03/01/2016
Est. Priority Date: 08/13/2015
Status: Expired due to Fees

First Claim

Patent Images

1. A method to remove camera rotation from a panoramic video comprisingobtaining a panoramic video of a scene in a three-dimensional space, wherein each panoramic frame of said panoramic video is mapped onto a sphere to form a spherically mapped panoramic frame;

accepting reference axes that define said three-dimensional space;

projecting each of said spherically mapped panoramic frames onto a front plane to create a forward-looking planar video;

calculating a virtual camera rotation and a virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video;

wherein said calculating said virtual camera rotation and said virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video comprisesidentifying and locating a plurality of features in one or more key frames of said forward-looking planar video, wherein each of said plurality of features has a fixed position in said three-dimensional space;

tracking the pixel location of each of said plurality of features across the planar frames of said forward-looking planar video to create a feature track for each of said plurality of features;

calculating a 3D geometry model from said feature tracks that comprisesthe position of each of said plurality of features in said three-dimensional space;

the virtual camera rotation and the virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video;

wherein said 3D geometry model minimizes the error betweenthe projection of said position of each of said plurality of features onto a view plane defined by said virtual camera rotation and said virtual camera translation for each planar frame of said forward-looking planar video, andsaid pixel location of each of said plurality of features in each of said feature tracks;

obtaining said virtual camera rotation and said virtual camera translation from said 3D geometry model;

applying said virtual camera rotation to said sphere in each of said spherically mapped panoramic frames to generate a rotation-corrected panoramic frame for each of said spherically mapped panoramic frames;

combining said rotation-corrected panoramic frames to form a rotation-corrected panoramic video.

View all claims

5 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Removes undesirable camera rotation from a panoramic video. Video frames are mapped to a sphere and then projected to a forward-looking planar video. Camera tracking or other techniques are used to derive camera position and orientation in each frame. From the camera orientation, a correcting rotation is derived and is applied to the sphere onto which the panoramic video was mapped. Re-projecting video from the rotation-corrected sphere reduces or eliminates camera rotation artifacts from the panoramic video. Rotation may be re-introduced and other visual effects may be added to the stabilized video. May be utilized in 2D virtual reality and augmented reality displays. Depth may be accepted by the system for objects in the video and utilized to create 3D stereoscopic virtual reality and augmented reality displays.

90 Citations

View as Search Results

19 Claims

1. A method to remove camera rotation from a panoramic video comprisingobtaining a panoramic video of a scene in a three-dimensional space, wherein each panoramic frame of said panoramic video is mapped onto a sphere to form a spherically mapped panoramic frame;
- accepting reference axes that define said three-dimensional space;
  
  projecting each of said spherically mapped panoramic frames onto a front plane to create a forward-looking planar video;
  
  calculating a virtual camera rotation and a virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video;
  
  wherein said calculating said virtual camera rotation and said virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video comprisesidentifying and locating a plurality of features in one or more key frames of said forward-looking planar video, wherein each of said plurality of features has a fixed position in said three-dimensional space;
  
  tracking the pixel location of each of said plurality of features across the planar frames of said forward-looking planar video to create a feature track for each of said plurality of features;
  
  calculating a 3D geometry model from said feature tracks that comprisesthe position of each of said plurality of features in said three-dimensional space;
  
  the virtual camera rotation and the virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video;
  
  wherein said 3D geometry model minimizes the error betweenthe projection of said position of each of said plurality of features onto a view plane defined by said virtual camera rotation and said virtual camera translation for each planar frame of said forward-looking planar video, andsaid pixel location of each of said plurality of features in each of said feature tracks;
  
  obtaining said virtual camera rotation and said virtual camera translation from said 3D geometry model;
  
  applying said virtual camera rotation to said sphere in each of said spherically mapped panoramic frames to generate a rotation-corrected panoramic frame for each of said spherically mapped panoramic frames;
  
  combining said rotation-corrected panoramic frames to form a rotation-corrected panoramic video.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 16, 17)
- - 2. The method of claim 1, wherein said obtaining said panoramic video comprisesobtaining a plurality of videos of said scene, each captured by a corresponding camera from a plurality of cameras, wherein all of said plurality of videos are captured over a common time interval;
    - selecting a sequence of frame sample times within said common time interval;
      
      for each of said frame sample times andfor each of said plurality of camerasmapping the camera frame that was captured from said camera approximately at said frame sample times onto said sphere, to form a spherically mapped camera frame;
      
      combining said spherically mapped camera frames to form said spherically mapped panoramic frame for said frame sample times.
  - 3. The method of claim 2, whereinsaid obtaining a plurality of videos of said scene comprisesattaching said plurality of cameras to a camera mount;
    - placing said camera mount in said scene;
      
      capturing said plurality of videos from said plurality of cameras over said common time interval.
  - 4. The method of claim 3, wherein said camera mount is coupled to a drone.
  - 5. The method of claim 3, further comprisingaccepting a definition of a reference sphere surrounding said camera mount;
    - calculating a spherical region for each of said plurality of cameras as the intersection of the field of view of each of said plurality of cameras with the surface of said reference sphere;
      
      wherein said mapping the camera frame that was captured from said camera approximately at said frame sample times onto said sphere comprisesprojecting said camera frame onto said spherical region for the corresponding camera.
  - 6. The method of claim 1, further comprisinggenerating a virtual reality environment from said 3D geometry model;
    - obtaining a position and orientation of a user in said virtual reality environment;
      
      generating a view image for said user in said virtual reality environment bycalculating an image plane for said user in said virtual environment based on said position and orientation of said user;
      
      projecting each of said plurality of features from said position of each of said plurality of features onto said image plane.
  - 7. The method of claim 1, further comprisingaccepting input depths to assign a plurality of features in said forward-looking planar video that comprises the position of each of said plurality of features in said three-dimensional space;
    - generating a 2D presentation or 3D stereoscopic presentation to display in a virtual reality or augmented reality viewing device.
  - 16. The method of claim 2, further comprising:
    - blending said spherically mapped camera frames.
  - 17. The method of claim 3, further comprising:
    - translating and rotating said camera mount in said scene over said common time interval.

8. A method for removing camera rotation from a panoramic video comprisingattaching a plurality of cameras to a camera mount;
- placing said camera mount in a scene in a three-dimensional space;
  
  capturing a plurality of videos from said plurality of cameras over a common time interval;
  
  translating and rotating said camera mount in said scene over said common time interval;
  
  defining reference axes for said three-dimensional space;
  
  defining a reference sphere surrounding said camera mount;
  
  calculating a spherical region for each of said plurality of cameras as the intersection of the field of view of each of said plurality of cameras with the surface of said reference sphere;
  
  selecting a sequence of frame sample times within said common time interval;
  
  for each of said frame sample times andfor each of said plurality of camerasprojecting the camera frame that was captured from said camera approximately at said frame sample times onto said spherical region for the corresponding camera, to form a spherically mapped camera frame;
  
  blending said spherically mapped camera frames;
  
  combining said spherically mapped camera frames to form a spherically mapped panoramic frame for said frame sample times;
  
  projecting each of said spherically mapped panoramic frames onto a front plane to create a forward-looking planar video;
  
  obtaining depths by either or both ofidentifying and locating a plurality of features in one or more key frames of said forward-looking planar video, wherein each of said plurality of features has a fixed position in said three-dimensional space;
  
  tracking the pixel location of each of said plurality of features across the planar frames of said forward-looking planar video to create a feature track for each of said plurality of features;
  
  calculating a 3D geometry model from said feature tracks that comprises the position of each of said plurality of features in said three-dimensional space;
  
  the virtual camera rotation and the virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video;
  
  wherein said 3D geometry model minimizes the error between the projection of said position of each of said plurality of features onto a view plane defined by said virtual camera rotation and said virtual camera translation for each planar frame of said forward-looking planar video; and
  
  said pixel location of each of said plurality of features in each of said feature tracks;
  
  obtaining said virtual camera rotation and said virtual camera translation from said 3D geometry model;
  
  oraccepting input depths to assign a plurality of features in said forward-looking planar video that comprises the position of each of said plurality of features in said three-dimensional space;
  
  generating a 2D presentation or 3D stereoscopic presentation to display in a virtual reality or augmented reality viewing device;
  
  applying said virtual camera rotation to said sphere in each of said spherically mapped panoramic frames to generate a rotation-corrected panoramic frame for each of said spherically mapped panoramic frames;
  
  combining said rotation-corrected panoramic frames to form a rotation-corrected panoramic video;
  
  generating a virtual reality environment from said 3D geometry model;
  
  obtaining a position and orientation of a user in said virtual reality environment;
  
  generating a view image for said user in said virtual reality environment by calculating an image plane for said user in said virtual environment based on said position and orientation of said user;
  
  projecting each of said plurality of features from said position of each of said plurality of features onto said image plane.

9. A system to remove camera rotation from a panoramic video comprisingobtain a panoramic video of a scene in a three-dimensional space, wherein each panoramic frame of said panoramic video is mapped onto a sphere to form a spherically mapped panoramic frame;
- accept reference axes that define said three-dimensional space;
  
  project each of said spherically mapped panoramic frames onto a front plane to create a forward-looking planar video;
  
  calculate a virtual camera rotation and a virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video;
  
  wherein said calculate said virtual camera rotation and said virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video comprisesidentify and locating a plurality of features in one or more key frames of said forward-looking planar video, wherein each of said plurality of features has a fixed position in said three-dimensional space;
  
  track the pixel location of each of said plurality of features across the planar frames of said forward-looking planar video to create a feature track for each of said plurality of features;
  
  calculate a 3D geometry model from said feature tracks that comprisesthe position of each of said plurality of features in said three-dimensional space;
  
  the virtual camera rotation and the virtual camera translation, relative to said reference axes, for each planar frame of said forward-looking planar video;
  
  wherein said 3D geometry model minimizes the error betweenthe projection of said position of each of said plurality of features onto a view plane defined by said virtual camera rotation and said virtual camera translation for each planar frame of said forward-looking planar video, andsaid pixel location of each of said plurality of features in each of said feature tracks;
  
  obtain said virtual camera rotation and said virtual camera translation from said 3D geometry model;
  
  apply said virtual camera rotation to said sphere in each of said spherically mapped panoramic frames to generate a rotation-corrected panoramic frame for each of said spherically mapped panoramic frames;
  
  combine said rotation-corrected panoramic frames to form a rotation-corrected panoramic video.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 18, 19)
- - 10. The system of claim 9, wherein said obtain said panoramic video comprisesobtain a plurality of videos of said scene, each captured by a corresponding camera from a plurality of cameras, wherein all of said plurality of videos are captured over a common time interval;
    - select a sequence of frame sample times within said common time interval;
      
      for each of said frame sample times andfor each of said plurality of camerasmap the camera frame that was captured from said camera approximately at said frame sample times onto said sphere, to form a spherically mapped camera frame;
      
      combine said spherically mapped camera frames to form said spherically mapped panoramic frame for said frame sample times.
  - 11. The system of claim 10, whereinsaid obtain said plurality of videos of said scene comprisesattach said plurality of cameras to a camera mount;
    - place said camera mount in said scene;
      
      capture said plurality of videos from said plurality of cameras over said common time interval.
  - 12. The system of claim 11, wherein said camera mount is coupled to a drone.
  - 13. The system of claim 11, further comprisingaccept a definition of a reference sphere surrounding said camera mount;
    - calculate a spherical region for each of said plurality of cameras as the intersection of the field of view of each of said plurality of cameras with the surface of said reference sphere;
      
      wherein said mapping the camera frame that was captured from said camera approximately at said frame sample times onto said sphere comprisesproject said camera frame onto said spherical region for the corresponding camera.
  - 14. The system of claim 9, further comprisinggenerate a virtual reality environment from said 3D geometry model;
    - obtain a position and orientation of a user in said virtual reality environment;
      
      generate a view image for said user in said virtual reality environment throughcalculate an image plane for said user in said virtual environment based on said position and orientation of said user;
      
      project each of said plurality of features from said position of each of said plurality of features onto said image plane.
  - 15. The system of claim 9, further comprisingaccept input depths to assign a plurality of features in said forward-looking planar video that comprises the position of each of said plurality of features in said three-dimensional space;
    - generate a 2D presentation or 3D stereoscopic presentation to display in a virtual reality or augmented reality viewing device.
  - 18. The system of claim 10, further comprising:
    - blend said spherically mapped camera frames.
  - 19. The system of claim 11, further comprising:
    - translate and rotate said camera mount in said scene over said common time interval.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Legend3D, Inc.
Original Assignee
Legend3D, Inc.
Inventors
Imura, Ken, Cummins, Ryan
Primary Examiner(s)
Perungavoor, Sath V
Assistant Examiner(s)
Aynalem, Nathnael

Application Number

US14/825,746
Time in Patent Office

201 Days
Field of Search

348/36, 348/37
US Class Current

1/1
CPC Class Codes

G06T 19/003   Navigation within 3D models...

G06T 19/006   Mixed reality object pose d...

G06T 2200/04   involving 3D image data

G06T 2207/10016   Video; Image sequence

G06T 7/246   using feature-based methods...

G06T 7/292   Multi-camera tracking

G06T 7/73   using feature-based methods

H04N 13/239   using two 2D image sensors ...

H04N 23/6811   based on the image signal

H04N 23/683   performed by a processor, e...

H04N 23/698   for achieving an enlarged f...

System and method for removing camera rotation from a panoramic video

First Claim

5 Assignments

0 Petitions

Accused Products

Abstract

90 Citations

19 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for removing camera rotation from a panoramic video

First Claim

5 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

90 Citations

19 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links