DEEP NEURAL NETWORK PARTITIONING ON SERVERS
First Claim
1. A method for implementing a deep neural network on a server component that comprises a host component including a CPU and a hardware acceleration component coupled to the host component, the deep neural network comprising a plurality of layers, the method comprising:
- partitioning the deep neural network into a first segment and a second segment, the first segment comprising a first subset of the plurality of layers, the second segment comprising a second subset of the plurality of layers;
configuring the host component to implement the first segment; and
configuring the hardware acceleration component to implement the second segment.
1 Assignment
0 Petitions
Accused Products
Abstract
A method is provided for implementing a deep neural network on a server component that includes a host component including a CPU and a hardware acceleration component coupled to the host component. The deep neural network includes a plurality of layers. The method includes partitioning the deep neural network into a first segment and a second segment, the first segment including a first subset of the plurality of layers, the second segment including a second subset of the plurality of layers, configuring the host component to implement the first segment, and configuring the hardware acceleration component to implement the second segment.
-
Citations
20 Claims
-
1. A method for implementing a deep neural network on a server component that comprises a host component including a CPU and a hardware acceleration component coupled to the host component, the deep neural network comprising a plurality of layers, the method comprising:
-
partitioning the deep neural network into a first segment and a second segment, the first segment comprising a first subset of the plurality of layers, the second segment comprising a second subset of the plurality of layers; configuring the host component to implement the first segment; and configuring the hardware acceleration component to implement the second segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A server component configured to implement a deep neural network comprising a plurality of layers, the server component comprising:
-
a host component comprising a CPU; a hardware acceleration component coupled to the host component; a controller component configured to; partition the deep neural network into a first segment and a second segment, the first segment comprising a first subset of the plurality of layers, the second segment comprising a second subset of the plurality of layers; configure the host component to implement the first segment; and configure the hardware acceleration component to implement the second segment. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for implementing a deep neural network on a server component that comprises a host component including a CPU and a hardware acceleration component coupled to the host component, the deep neural network comprising a plurality of linear layers and a plurality of convolutional layers, the method comprising:
-
configuring the host component to implement the linear layers; and configuring the hardware acceleration component to implement the convolutional layers. - View Dependent Claims (20)
-
Specification