DEVICE AND A METHOD FOR PROCESSING DATA SEQUENCES USING A CONVOLUTIONAL NEURAL NETWORK
First Claim
1. A device for processing data sequences comprising a convolutional neural network, whereinthe device is configured to receive an input sequence (It) comprising a plurality of data items captured over time, each of said data items comprising a multi-dimensional representation of a scene,the convolutional neural network is configured to generate an output sequence (ht) representing the input sequence processed item-wise by the convolutional neural network,the convolutional neural network comprises a sampling unit configured to generate an intermediate output sequence ({tilde over (h)}t) by sampling from a past portion of the output sequence (ht−
- 1) according to a sampling grid (Gt),the convolutional neural network is configured to generate the sampling grid (Gt) item-wise on the basis of a grid-generation sequence, wherein the grid-generation sequence is based on a combination of the input sequence (It) and an intermediate grid-generation sequence representing a past portion of the output sequence (ht−
1) or the grid-generation sequence (Ct−
1),the convolutional neural network is configured to generate the output sequence (ht) based on a weighted combination of the intermediate output sequence ({tilde over (h)}t) and the input sequence (It).
5 Assignments
0 Petitions
Accused Products
Abstract
A device for processing data sequences by means of a convolutional neural network is configured to carry out the following steps: receiving an input sequence comprising a plurality of data items captured over time using a sensor, each of said data items comprising a multi-dimensional representation of a scene, generating an output sequence representing the input sequence processed item-wise by the convolutional neural network, wherein generating the output sequence comprises: generating a grid-generation sequence based on a combination of the input sequence and an intermediate grid-generation sequence representing a past portion of the output sequence or the grid-generation sequence, generating a sampling grid on the basis of the grid-generation sequence, generating an intermediate output sequence by sampling from the past portion of the output sequence according to the sampling grid, and generating the output sequence based on a weighted combination of the intermediate output sequence and the input sequence.
-
Citations
14 Claims
-
1. A device for processing data sequences comprising a convolutional neural network, wherein
the device is configured to receive an input sequence (It) comprising a plurality of data items captured over time, each of said data items comprising a multi-dimensional representation of a scene, the convolutional neural network is configured to generate an output sequence (ht) representing the input sequence processed item-wise by the convolutional neural network, the convolutional neural network comprises a sampling unit configured to generate an intermediate output sequence ({tilde over (h)}t) by sampling from a past portion of the output sequence (ht− - 1) according to a sampling grid (Gt),
the convolutional neural network is configured to generate the sampling grid (Gt) item-wise on the basis of a grid-generation sequence, wherein the grid-generation sequence is based on a combination of the input sequence (It) and an intermediate grid-generation sequence representing a past portion of the output sequence (ht−
1) or the grid-generation sequence (Ct−
1),the convolutional neural network is configured to generate the output sequence (ht) based on a weighted combination of the intermediate output sequence ({tilde over (h)}t) and the input sequence (It). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
or wherein the intermediate grid-generation sequence is formed by the past portion of the output sequence (ht−
1) processed with an inner convolutional neural network,or wherein the intermediate grid-generation sequence (Ct−
1) is formed by the past portion of the grid-generation sequence processed with an inner convolutional neural network.
- 1) according to a sampling grid (Gt),
-
4. The device according to claim 1,
wherein the convolutional neural network is configured to generate the sampling grid (Gt) by processing the grid-generation sequence with at least one inner convolutional neural network. -
5. The device according to claim 1, wherein
the convolutional neural network is configured to generate the output sequence (ht) by generating a first weighting sequence (ft) and a second weighting sequence (it) based on one of the input sequence (It), the intermediate output sequence ({tilde over (h)}t), the intermediate grid-generation sequence (ht− - 1, Ct−
1),the grid-generation sequence processed by an inner convolutional network ({tilde over (C)}t, Ct), generating an intermediate input sequence by processing the input sequence (It) with an inner convolutional neural network (18), weighting the intermediate output sequence ({tilde over (h)}t) with the first weighting sequence (ft), weighting the intermediate input sequence with the second weighting sequence (it), superimposing the weighted intermediate output sequence and the weighted intermediate input sequence.
- 1, Ct−
-
6. The device according to claim 5, wherein
generating the first weighting sequence (ft) and/or the second weighting sequence (it) comprises: -
forming a combination of at least two of the input sequence (It), the intermediate output sequence ({tilde over (h)}t), the intermediate grid-generation sequence (ht−
1, Ct−
1),the grid-generation sequence processed by an inner convolutional network ({tilde over (C)}t, Ct), and forming a processed combination (it, ft, zt) by processing the combination with an inner convolutional neural network (22).
-
-
7. The device according to claim 6,
wherein one of the first weighting sequence or the second weighting sequence is formed by the processed combination and wherein the other of the first weighting sequence or the second weighting sequence is formed by the processed combination (zt) subtracted from a constant. -
8. The device according to one of claim 5, wherein the convolutional neural network is configured to generate the first weighting sequence (ft) and the second weighting sequence (it) correspondingly.
-
9. The device according to claim 1,
wherein the sampling grid (Gt) comprises a plurality of sampling locations, each of the sampling locations being defined by a respective pair of an offset and one of a plurality of data points of a data item of the intermediate output sequence ({tilde over (h)}t). -
10. The device according to claim 1,
wherein each data item of the input sequence (It) comprises a plurality of data points, each data point representing a location in the scene and comprising a plurality of parameters, in particular coordinates, of the location. -
11. The device according to claim 1, wherein each data item of the input sequence (It) is formed by an image comprising a plurality of pixels.
-
12. A system for processing data sequences, the system comprising a sensor for capturing a data sequence and a device according to one of the preceding claims.
-
13. The system according to claim 12, wherein the sensor comprises at least one of a radar sensor, a light-detection-and-ranging sensor, an ultrasonic sensor, or a camera.
-
14. A method for processing data sequences by means of a convolutional neural network, the method comprising:
-
receiving an input sequence (It) comprising a plurality of data items captured over time using a sensor, each of said data items comprising a multi-dimensional representation of a scene, generating an output sequence (ht) representing the input sequence processed item-wise by the convolutional neural network, wherein generating the output sequence comprises; generating a grid-generation sequence based on a combination of the input sequence (It) and an intermediate grid-generation sequence representing a past portion of the output sequence (ht−
1) or the grid-generation sequence (Ct−
1),generating a sampling grid (Gt) on the basis of the grid-generation sequence, generating an intermediate output sequence ({tilde over (h)}t) by sampling from the past portion of the output sequence (ht−
1) according to the sampling grid (Gt), andgenerating the output sequence (ht) based on a weighted combination of the intermediate output sequence ({tilde over (h)}t) and the input sequence (It).
-
Specification