Methods of image fusion for image stabilization
First Claim
1. A non-transitory program storage device, readable by a programmable control device and comprising instructions stored thereon to cause the programmable control device to:
- obtain a first image of a scene captured at a first time, the first image having a plurality of pixels;
perform a multi-resolution decomposition of the first image to generate a first multi-level pyramid representation of the first image;
obtain a second image of the scene, wherein the second image is captured at a different time from the first image, and wherein each of the plurality of pixels in the first image has a corresponding pixel in the second image;
perform a multi-resolution decomposition of the second image to generate a second multi-level pyramid representation of the second image;
generate a level in an output multi-level pyramid representation of the scene for each level in the first and second multi-level pyramid representations of the scene in accordance with instructions to cause the programmable control device to;
identify, for a group of pixels in a level of the first multi-level pyramid representation of the scene, a corresponding group of pixels in the level from the second multi-level pyramid representation of the scene;
determining a weight value associated with each group of pixels in the level from the first multi-level pyramid representation of the scene and the identified corresponding group of pixels in the level from the second multi-level pyramid representation of the scene;
perform a spatial search of the level of the second multi-level pyramid representation of the scene to find a better corresponding group of pixels when the weight value is less than or equal to a specified threshold;
determine a weight value for the better corresponding group of pixels; and
fuse the identified group of pixels from the first multi-level pyramid representations of the scene with the better corresponding group of pixels from the second multi-level pyramid representation when the determined weight value for the better corresponding group of pixels is greater than the specified threshold;
repeat the instructions to cause the programmable control device to generate a level in the output multi-level pyramid representation of the scene for each level in the multi-resolution decomposition of the first image;
combine the output multi-level pyramid representations of the scene to generate a single output image representative of the scene; and
store the output image in a memory.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and computer readable media to improve image stabilization operations are described. Novel approaches for fusing non-reference images with a pre-selected reference frame in a set of commonly captured images are disclosed. The fusing approach may use a soft transition by using a weighted average for ghost/non-ghost pixels to avoid sudden transition between neighborhood and almost similar pixels. Additionally, the ghost/non-ghost decision can be made based on a set of neighboring pixels rather than independently for each pixel. An alternative approach may involve performing a multi-resolution decomposition of all the captured images, using temporal fusion, spatio-temporal fusion, or combinations thereof, at each level and combining the different levels to generate an output image.
56 Citations
20 Claims
-
1. A non-transitory program storage device, readable by a programmable control device and comprising instructions stored thereon to cause the programmable control device to:
-
obtain a first image of a scene captured at a first time, the first image having a plurality of pixels; perform a multi-resolution decomposition of the first image to generate a first multi-level pyramid representation of the first image; obtain a second image of the scene, wherein the second image is captured at a different time from the first image, and wherein each of the plurality of pixels in the first image has a corresponding pixel in the second image; perform a multi-resolution decomposition of the second image to generate a second multi-level pyramid representation of the second image; generate a level in an output multi-level pyramid representation of the scene for each level in the first and second multi-level pyramid representations of the scene in accordance with instructions to cause the programmable control device to; identify, for a group of pixels in a level of the first multi-level pyramid representation of the scene, a corresponding group of pixels in the level from the second multi-level pyramid representation of the scene; determining a weight value associated with each group of pixels in the level from the first multi-level pyramid representation of the scene and the identified corresponding group of pixels in the level from the second multi-level pyramid representation of the scene; perform a spatial search of the level of the second multi-level pyramid representation of the scene to find a better corresponding group of pixels when the weight value is less than or equal to a specified threshold; determine a weight value for the better corresponding group of pixels; and fuse the identified group of pixels from the first multi-level pyramid representations of the scene with the better corresponding group of pixels from the second multi-level pyramid representation when the determined weight value for the better corresponding group of pixels is greater than the specified threshold; repeat the instructions to cause the programmable control device to generate a level in the output multi-level pyramid representation of the scene for each level in the multi-resolution decomposition of the first image; combine the output multi-level pyramid representations of the scene to generate a single output image representative of the scene; and store the output image in a memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
an image capture device; memory; and one or more programmable control devices operable to interact with the image capture device and the memory, and to perform operations comprising; obtaining a first image of a scene captured at a first time, the first image having a plurality of pixels; performing a multi-resolution decomposition of the first image to generate a first multi-level pyramid representation of the first image; obtaining a second image of the scene, wherein the second image is captured at a different time from the first image, and wherein each of the plurality of pixels in the first image has a corresponding pixel in the second image; performing a multi-resolution decomposition of the second image to generate a second multi-level pyramid representation of the second image; generating a level in an output multi-level pyramid representation of the scene for each level in the first and second multi-level pyramid representations of the scene in accordance with instructions to cause the programmable control device to perform operations comprising; identifying, for a group of pixels in a level of the first multi-level pyramid representation of the scene, a corresponding group of pixels in the level from the second multi-level pyramid representation of the scene; determining a weight value associated with each group of pixels in the level from the first multi-level pyramid representation of the scene and the identified corresponding group of pixels in the level from the second multi-level pyramid representation of the scene; performing a spatial search of the level of the second multi-level pyramid representation of the scene to find a better corresponding group of pixels when the weight value is less than or equal to a specified threshold; determining a weight value for the better corresponding group pixels; and fusing the identified group of pixels from the first multi-level pyramid representations of the scene with the better corresponding group of pixels from the second multi-level pyramid representation of the scene when the determined weight value for the better corresponding group of pixels is greater than the specified threshold; combining the output multi-level pyramid representations of the scene to generate a single output image representative of the scene; and storing the output image in a memory. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
obtaining a first image of a scene captured at a first time, the first image having a plurality of pixels; performing a multi-resolution decomposition of the first image to generate a first multi-level pyramid representation of the first image; obtaining a second image of the scene, wherein the second image is captured at a different time from the first image, and wherein each of the plurality of pixels in the first image has a corresponding pixel in the second image; performing a multi-resolution decomposition of the second image to generate a second multi-level pyramid representation of the second image; generating a level in an output multi-level pyramid representation of the scene for each level in the first and second multi-level pyramid representations of the scene, wherein generating the level comprises; identifying, for a group of pixels in a level of the first multi-level pyramid representation of the scene, a corresponding group of pixels in the level from the second multi-level pyramid representation of the scene; determining a weight value associated with each group of pixels in the level from the first multi-level pyramid representation of the scene and the identified corresponding group of pixels in the level from the second multi-level pyramid representation of the scene; performing a spatial search of the level of the second multi-level pyramid representation of the scene to find a better corresponding group of pixels when the weight value is less than or equal to a specified threshold; determining a weight value for the better corresponding group of pixels; and fusing the identified group of pixels from the first multi-level pyramid representations of the scene with the better corresponding group of pixels from the second multi-level pyramid representation of the scene when the determined weight value for the better corresponding group of pixels is greater than the specified threshold; combining the output multi-level pyramid representations of the scene to generate a single output image representative of the scene; and storing the output image in a memory. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification