Method and apparatus of neural network based image signal processor
First Claim
Patent Images
1. An image signal processing (ISP) system, comprising:
- a neural network that receives an input image and outputs a processed image, the input image including input image data, the neural network comprising a convolutional neural network having multiple parallel paths in which each path scales the input image data to separate the input image data into separate frequency bands of spatial resolution, a first path comprising a first convolutional layer having an output at a first frequency band of spatial resolution, the output of the first convolutional layer being pooled at a second frequency band of spatial resolution that is less than the first frequency band of spatial resolution, a second path being coupled to the pooled output of the first convolutional layer and comprising a second convolutional layer having an output at the second frequency band of spatial resolution, the output of the second convolutional layer being pooled at a third frequency band of spatial resolution that is less than the second frequency band of spatial resolution, a third path being coupled to the pooled output of the second convolutional layer and comprising a third convolutional layer having an output at the third frequency band of spatial resolution, an output of the third convolutional layer being upsampled to the second frequency band of spatial resolution and concatenated with the output of the second convolutional layer to form a first intermediate output, the first intermediate output being upsampled to the first frequency band of spatial resolution and concatenated with the output of the first convolutional layer to form a second intermediate output, the second intermediate output being input to a fourth convolutional layer to output the processed image, and the neural network adjusting the input image data for at least one image-quality attribute to generate the processed image; and
an image-processing chain that receives the processed image output from the neural network, the image-processing chain providing at least one of color correction, gamma correction, edge enhancement and contrast enhancement of the processed image to output a final image,wherein the at least one adjusted image-quality attribute comprises an image size, an aspect ratio, a brightness, an intensity, a bit depth, a white value, a dynamic range, a gray level, a contouring, a smoothing, a speckle, a color space values, an interleaving, a gamma correction, a contrast enhancement, a sharpness and a demosaicing.
1 Assignment
0 Petitions
Accused Products
Abstract
An image signal processing (ISP) system is provided. The system includes a neural network trained by inputting a set of raw data images and a correlating set of desired quality output images; the neural network including an input for receiving input image data and providing processed output; wherein the processed output includes input image data that has been adjusted for at least one image quality attribute. A method and an imaging device are disclosed.
32 Citations
13 Claims
-
1. An image signal processing (ISP) system, comprising:
-
a neural network that receives an input image and outputs a processed image, the input image including input image data, the neural network comprising a convolutional neural network having multiple parallel paths in which each path scales the input image data to separate the input image data into separate frequency bands of spatial resolution, a first path comprising a first convolutional layer having an output at a first frequency band of spatial resolution, the output of the first convolutional layer being pooled at a second frequency band of spatial resolution that is less than the first frequency band of spatial resolution, a second path being coupled to the pooled output of the first convolutional layer and comprising a second convolutional layer having an output at the second frequency band of spatial resolution, the output of the second convolutional layer being pooled at a third frequency band of spatial resolution that is less than the second frequency band of spatial resolution, a third path being coupled to the pooled output of the second convolutional layer and comprising a third convolutional layer having an output at the third frequency band of spatial resolution, an output of the third convolutional layer being upsampled to the second frequency band of spatial resolution and concatenated with the output of the second convolutional layer to form a first intermediate output, the first intermediate output being upsampled to the first frequency band of spatial resolution and concatenated with the output of the first convolutional layer to form a second intermediate output, the second intermediate output being input to a fourth convolutional layer to output the processed image, and the neural network adjusting the input image data for at least one image-quality attribute to generate the processed image; and an image-processing chain that receives the processed image output from the neural network, the image-processing chain providing at least one of color correction, gamma correction, edge enhancement and contrast enhancement of the processed image to output a final image, wherein the at least one adjusted image-quality attribute comprises an image size, an aspect ratio, a brightness, an intensity, a bit depth, a white value, a dynamic range, a gray level, a contouring, a smoothing, a speckle, a color space values, an interleaving, a gamma correction, a contrast enhancement, a sharpness and a demosaicing. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for providing an image signal processing (ISP) system, the method comprising:
-
inputting an input image into a neural network, the input image comprising input image data, and the neural network comprising a convolutional neural network having multiple parallel paths in which each path scales the input image data to separate the input image data into multiple separate frequency bands of spatial resolution, a first path comprising a first convolutional layer having an output at a first frequency band of spatial resolution, the output of the first convolutional layer being pooled at a second frequency band of spatial resolution that is less than the first frequency band of spatial resolution, a second path being coupled to the pooled output of the first convolutional layer and comprising a second convolutional layer having an output at the second frequency band of spatial resolution, the output of the second convolutional layer being pooled at a third frequency band of spatial resolution that is less than the second frequency band of spatial resolution, a third path being coupled to the pooled output of the second convolutional layer and comprising a third convolutional layer having an output at the third frequency band of spatial resolution, an output of the third convolutional layer being upsampled to the second frequency band of spatial resolution and concatenated with the output of the second convolutional layer to form a first intermediate output, the first intermediate output being upsampled to the first frequency band of spatial resolution and concatenated with the output of the first convolutional layer to form a second intermediate output, the second intermediate output being input to a fourth convolutional layer to output the processed image; adjusting the input image by the neural network to output a processed image having input image data adjusted for at least one image-quality attribute; and
,generating by an image-processing chain a final image from the processed image, the image-processing chain processing at least one of a color correction, a gamma correction, an edge enhancement and a contrast enhancement of the processed image, wherein the at least one adjusted image-quality attribute comprises an image size, an aspect ratio, a brightness, an intensity, a bit depth, a white value, a dynamic range, a gray level, a contouring, a smoothing, a speckle, a color space values, an interleaving, a gamma correction, a contrast enhancement, a sharpness and a demosaicing. - View Dependent Claims (7, 8, 9, 10)
-
-
11. An imaging device, comprising:
-
an imaging sensor that outputs raw image data; and an image signal processing (ISP) system comprising a convolutional neural network and an image-processing chain, the convolutional neural network comprising multiple parallel paths in which each path scales the raw image data to separate the raw image data into separate frequency bands of spatial resolution, a first path comprising a first convolutional layer having an output at a first frequency band of spatial resolution, the output of the first convolutional layer being pooled at a second frequency band of spatial resolution that is less than the first frequency band of spatial resolution, a second path being coupled to the pooled output of the first convolutional layer and comprising a second convolutional layer having an output at the second frequency band of spatial resolution, the output of the second convolutional layer being pooled at a third frequency band of spatial resolution that is less than the second frequency band of spatial resolution, a third path being coupled to the pooled output of the second convolutional layer and comprising a third convolutional layer having an output at the third frequency band of spatial resolution, an output of the third convolutional layer being upsampled to the second frequency band of spatial resolution and concatenated with the output of the second convolutional layer to form a first intermediate output, the first intermediate output being upsampled to the first frequency band of spatial resolution and concatenated with the output of the first convolutional layer to form a second intermediate output, the second intermediate output being input to a fourth convolutional layer to output the processed image, the convolutional neural network to receive the raw image data from the imaging sensor and to adjust the raw image data to output a processed image in which the raw image data has been adjusted for at least one image-quality attribute, and the image-processing chain to output a final image, the image-processing chain comprising at least one of a color correction, a gamma correction, an edge enhancement and a contrast enhancement of the processed image, wherein the at least one adjusted image-quality attribute comprises an image size, an aspect ratio, a brightness, an intensity, a bit depth, a white value, a dynamic range, a gray level, a contouring, a smoothing, a speckle, a color space values, an interleaving, a gamma correction, a contrast enhancement, a sharpness and a demosaicing. - View Dependent Claims (12, 13)
-
Specification