Video frame interpolation using a convolutional neural network
First Claim
1. A video processing system comprising:
- a computing platform including a display, a hardware processor, and a system memory;
a frame interpolation software code stored in the system memory, the frame interpolation software code including a convolutional neural network (CNN) trained using a loss function having an image loss term summed with a phase loss term, the CNN having a plurality of convolutional processing blocks including a first subset of the plurality of convolutional processing blocks trained independently of a second subset of the plurality of convolutional processing blocks;
the hardware processor configured to execute the frame interpolation software code to;
receive a first video frame including a first image and a second video frame including a second image, the first and second video frames being consecutive;
decompose the first and second images to produce respective first and second image decompositions;
use the CNN to determine an intermediate image decomposition based on the first and second image decompositions, the intermediate image decomposition corresponding to an interpolated video frame for insertion between the first and second video frames;
synthesize the interpolated video frame based on the intermediate image decomposition; and
render a video sequence including the interpolated video frame inserted between the first and second video frames on the display.
2 Assignments
0 Petitions
Accused Products
Abstract
According to one implementation, a video processing system includes a computing platform having a hardware processor and a system memory storing a frame interpolation software code, the frame interpolation software code including a convolutional neural network (CNN) trained using a loss function having an image loss term summed with a phase loss term. The hardware processor executes the frame interpolation software code to receive first and second consecutive video frames including respective first and second images, and to decompose the first and second images to produce respective first and second image decompositions. The hardware processor further executes the frame interpolation software code to use the CNN to determine an intermediate image decomposition corresponding to an interpolated video frame for insertion between the first and second video frames based on the first and second image decompositions, and to synthesize the interpolated video frame based on the intermediate image decomposition.
-
Citations
18 Claims
-
1. A video processing system comprising:
-
a computing platform including a display, a hardware processor, and a system memory; a frame interpolation software code stored in the system memory, the frame interpolation software code including a convolutional neural network (CNN) trained using a loss function having an image loss term summed with a phase loss term, the CNN having a plurality of convolutional processing blocks including a first subset of the plurality of convolutional processing blocks trained independently of a second subset of the plurality of convolutional processing blocks; the hardware processor configured to execute the frame interpolation software code to; receive a first video frame including a first image and a second video frame including a second image, the first and second video frames being consecutive; decompose the first and second images to produce respective first and second image decompositions; use the CNN to determine an intermediate image decomposition based on the first and second image decompositions, the intermediate image decomposition corresponding to an interpolated video frame for insertion between the first and second video frames; synthesize the interpolated video frame based on the intermediate image decomposition; and render a video sequence including the interpolated video frame inserted between the first and second video frames on the display. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for use by a video processing system including a display, a computing platform having a hardware processor, and a system memory storing a frame interpolation software code including a convolutional neural network (CNN) trained using a loss function having an image loss term summed with a phase loss term, the CNN having a plurality of convolutional processing blocks including a first subset of the plurality of convolutional processing blocks trained independently of a second subset of the plurality of convolutional processing blocks, the method comprising:
-
receiving, using the hardware processor, a first video frame including a first image and a second video frame including a second image, the first and second video frames being consecutive; decomposing, using the hardware processor, the first and second images to produce respective first and second image decompositions; using the hardware processor and the CNN to determine an intermediate image decomposition based on the first and second image decompositions, the intermediate image decomposition corresponding to an interpolated video frame for insertion between the first and second video frames; synthesizing, using the hardware processor, the interpolated video frame based on the intermediate image decomposition; and rendering, using the hardware processor, a video sequence including the interpolated video frame inserted between the first and second video frames on the display. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification