Visual processing using temporal and spatial interpolation
First Claim
Patent Images
1. A method for upscaling at least a section of low-resolution video data using a convolutional neural network (CNN), the method comprising the steps of:
- receiving at least three consecutive frames of low-resolution video data;
inputting the at least three consecutive frames of low-resolution video data into an initial layer of the CNN;
extracting, using a plurality of hidden convolutional layers of the CNN, low-resolution features from the at least three consecutive frames of low-resolution video data; and
enhancing, using a hidden convolutional layer of the CNN, the extracted low-resolution features from the three or more consecutive frames of low-resolution video data to generate a higher-resolution target section of video data corresponding to a middle frame of the at least three consecutive frames of low-resolution video data,wherein the CNN is trained on training data including ground truth sections of video data with corresponding sequences of three or more consecutive frames of sub-sampled video data to reproduce ground truth sections of video data from the corresponding frames of sub-sampled video data.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for enhancing at least a section of lower-quality visual data using a hierarchical algorithm, the method comprising receiving at least a plurality of neighbouring sections of lower-quality visual data. A plurality of input sections from the received plurality of neighbouring sections of lower quality visual data are selected and features are extracted from those plurality of input sections of lower-quality visual data. A target section based on the extracted features from the plurality of input sections of lower-quality visual data is then enhanced.
-
Citations
40 Claims
-
1. A method for upscaling at least a section of low-resolution video data using a convolutional neural network (CNN), the method comprising the steps of:
-
receiving at least three consecutive frames of low-resolution video data; inputting the at least three consecutive frames of low-resolution video data into an initial layer of the CNN; extracting, using a plurality of hidden convolutional layers of the CNN, low-resolution features from the at least three consecutive frames of low-resolution video data; and enhancing, using a hidden convolutional layer of the CNN, the extracted low-resolution features from the three or more consecutive frames of low-resolution video data to generate a higher-resolution target section of video data corresponding to a middle frame of the at least three consecutive frames of low-resolution video data, wherein the CNN is trained on training data including ground truth sections of video data with corresponding sequences of three or more consecutive frames of sub-sampled video data to reproduce ground truth sections of video data from the corresponding frames of sub-sampled video data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15)
-
-
14. A computer program product embodied on a non-transitory storage medium and comprising instructions that, when executed, cause a system to upscale at least a section of low-resolution video data using a CNN, by performing the steps of:
-
receiving at least three consecutive frames of low-resolution video data; inputting the at least three consecutive frames of low-resolution video data into an initial layer of the CNN; extracting, using a plurality of hidden convolutional layers of the CNN, low-resolution features from the at least three consecutive frames of low-resolution video data; and enhancing, using a hidden convolutional layer of the CNN, the extracted low-resolution features from the three or more consecutive frames of low-resolution video data to generate a higher-resolution target section of video data corresponding to a middle frame of the at least three consecutive frames of low-resolution video data, wherein the CNN is trained on training data including ground truth sections of video data with corresponding sequences of three or more consecutive frames of sub-sampled video data to reproduce ground truth sections of video data from the corresponding frames of sub-sampled video data. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A system for upscaling at least a section of low-resolution video data using a CNN, the system comprising:
-
a least one processor; memory storing instructions executable that, when executed the at least one processor, cause the system to; receive at least three consecutive frames of low-resolution video data;
inputting the at least three consecutive frames of low-resolution video data into an initial layer of the CNN;extract, using a plurality of hidden convolutional layers of the CNN, low-resolution features from the at least three consecutive frames of low-resolution video data; and enhance, using a hidden convolutional layer of the CNN, the extracted low-resolution features from the three or more consecutive frames of low-resolution video data to generate a higher-resolution target section of video data corresponding to a middle frame of the at least three consecutive frames of low-resolution video data, wherein the CNN is trained on training data including ground truth sections of video data with corresponding sequences of three or more consecutive frames of sub-sampled video data to reproduce ground truth sections of video data from the corresponding frames of sub-sampled video data. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
Specification