Mobile speech recognition hardware accelerator
First Claim
Patent Images
1. A mobile computing device, comprising;
- a processor configured to execute a speech recognition application that uses a multi-layered neural network as an acoustic model; and
a hardware accelerator comprising;
circuitry configured to receive matrix data representing one or more frames of an audio signal as input data for a first layer of the neural network;
a multiplier-accumulator (MAC) unit configured to;
multiply the received matrix data representing one or more frames of the audio signal with a weight matrix;
add a bias matrix to the multiplication results; and
accumulate the addition results;
circuitry configured to pass the accumulated results through an activation function to generate an output matrix representing an output of the first layer of the neural network for the frame; and
a data transceiver configured to receive and decode weights and bias terms data, the data transceiver including;
a decompression unit configured to;
decompress compressed weight and bias terms data, anddouble buffer decompressed weights and bias terms data to allow for parallel decompression and MAC unit operations.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for executing a mobile speech recognition software application based on a multi-layer neural network model includes providing to a hardware accelerator in the mobile device to classify one or more frames of an audio signal. The hardware accelerator includes a multiplier-accumulator (MAC) unit to perform matrix multiplication operations involved in computing the neural network output.
50 Citations
31 Claims
-
1. A mobile computing device, comprising;
-
a processor configured to execute a speech recognition application that uses a multi-layered neural network as an acoustic model; and a hardware accelerator comprising; circuitry configured to receive matrix data representing one or more frames of an audio signal as input data for a first layer of the neural network; a multiplier-accumulator (MAC) unit configured to; multiply the received matrix data representing one or more frames of the audio signal with a weight matrix; add a bias matrix to the multiplication results; and accumulate the addition results; circuitry configured to pass the accumulated results through an activation function to generate an output matrix representing an output of the first layer of the neural network for the frame; and a data transceiver configured to receive and decode weights and bias terms data, the data transceiver including; a decompression unit configured to; decompress compressed weight and bias terms data, and double buffer decompressed weights and bias terms data to allow for parallel decompression and MAC unit operations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for executing a speech recognition software application on a mobile device, the method comprising;
-
utilizing a hardware accelerator in the mobile device to perform neural network calculations to classify an audio signal, wherein utilizing the hardware accelerator includes; sending matrix data representing one or more frames of an audio signal as input data for a first layer of a neural network to the hardware accelerator; using a multiplier-accumulator (MAC) unit in the hardware accelerator to; multiply the received matrix data representing one or more frames of the audio signal with a weight matrix; add a bias matrix to the multiplication results; and accumulate the addition results; and using circuitry in the hardware accelerator to pass the accumulated results through an activation function to generate an output matrix representing an output of the first layer of the neural network; receive and decode weights and bias terms data; decompress compressed weight and bias terms data; and double buffer decompressed weights and bias terms data to allow for parallel decompression and MAC unit operations. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A hardware accelerator configured to compute a multi-layered neural network of a mobile speech recognition application, the hardware accelerator comprising:
-
circuitry configured to receive matrix data representing one or more frames of an audio signal as input data for a first layer of the neural network; a multiplier-accumulator (MAC) unit comprising; circuitry configured to multiply the matrix data representing one or more frames of the audio signal with a weight matrix; circuitry configured to add a bias matrix to the multiplication results; circuitry configured to accumulate the addition results; circuitry to pass the accumulated results through an activation function to generate an output matrix representing an output of the first layer of the neural network for the frame; and a data transceiver configured to receive and decode weights and bias terms data, the data transceiver including; a decompression unit configured to; decompress compressed weight and bias terms data, and double buffer decompressed weights and bias terms data to allow for parallel decompression and MAC unit operations. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
Specification