Training end-to-end video processes
First Claim
Patent Images
1. A method for jointly training a pre-processing neural network and a post-processing neural network for a visual data encoding and decoding process, the method comprising:
- determining a differential approximation of the encoding and decoding process;
receiving one or more sections of input visual data;
using a pre-processing neural network to process the input visual data and to output pre-processed visual data;
applying the encoding and decoding process to the pre-processed visual data to generate decoded visual data;
using a post-processing neural network to further process the decoded visual data and to output reconstructed visual data;
comparing the reconstructed visual data with the input visual data using a metric; and
updating parameters of the pre-processing neural network and the post-processing neural network using the differential approximation of the encoding and decoding process and based on the comparing of the reconstructed visual data with the input visual data.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is method for training a plurality of visual processing algorithms for processing visual data. The method includes using a pre-processing hierarchical algorithm to process the visual data prior to encoding the visual data in visual data processing, and using a post-processing hierarchical algorithm to further process the visual data following decoding visual data in visual data processing. The encoding and decoding are performed with respect to a predetermined visual data codec and may be content specific.
-
Citations
17 Claims
-
1. A method for jointly training a pre-processing neural network and a post-processing neural network for a visual data encoding and decoding process, the method comprising:
-
determining a differential approximation of the encoding and decoding process; receiving one or more sections of input visual data; using a pre-processing neural network to process the input visual data and to output pre-processed visual data; applying the encoding and decoding process to the pre-processed visual data to generate decoded visual data; using a post-processing neural network to further process the decoded visual data and to output reconstructed visual data; comparing the reconstructed visual data with the input visual data using a metric; and updating parameters of the pre-processing neural network and the post-processing neural network using the differential approximation of the encoding and decoding process and based on the comparing of the reconstructed visual data with the input visual data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification