FIXED POINT NEURAL NETWORK BASED ON FLOATING POINT NEURAL NETWORK QUANTIZATION
First Claim
1. A method of quantizing a floating point machine learning network to obtain a fixed point machine learning network using a quantizer, comprising:
- determining quantizer parameters for quantizing values of the floating point machine learning network based at least in part on at least one moment of an input distribution of the floating point machine learning network to obtain corresponding values of the fixed point machine learning network.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of quantizing a floating point machine learning network to obtain a fixed point machine learning network using a quantizer may include selecting at least one moment of an input distribution of the floating point machine learning network. The method may also include determining quantizer parameters for quantizing values of the floating point machine learning network based at least in part on the at least one selected moment of the input distribution of the floating point machine learning network to obtain corresponding values of the fixed point machine learning network.
-
Citations
30 Claims
-
1. A method of quantizing a floating point machine learning network to obtain a fixed point machine learning network using a quantizer, comprising:
determining quantizer parameters for quantizing values of the floating point machine learning network based at least in part on at least one moment of an input distribution of the floating point machine learning network to obtain corresponding values of the fixed point machine learning network. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A method of converting a first machine learning network to a second machine learning network, comprising:
incorporating a mean value of a distribution of activation values of the first machine learning network into a network bias of the second machine learning network. - View Dependent Claims (12, 13, 14, 15)
-
16. An apparatus for quantizing a floating point machine learning network to obtain a fixed point machine learning network using a quantizer, comprising:
-
means for selecting at least one moment of an input distribution of the floating point machine learning network; and means for determining quantizer parameters for quantizing values of the floating point machine learning network based at least in part on the at least one selected moment of the input distribution of the floating point machine learning network to obtain corresponding values of the fixed point machine learning network. - View Dependent Claims (17, 18, 19, 20)
-
-
21. An apparatus for quantizing a floating point machine learning network to obtain a fixed point machine learning network using a quantizer, comprising:
-
a memory unit; and at least one processor coupled to the memory unit, the at least one processor configured; to select at least one moment of an input distribution of the floating point machine learning network; and to determine quantizer parameters for quantizing values of the floating point machine learning network based at least in part on the at least one selected moment of the input distribution of the floating point machine learning network to obtain corresponding values of the fixed point machine learning network. - View Dependent Claims (22, 23, 24, 25)
-
-
26. A non-transitory computer-readable medium having program code recorded thereon for quantizing a floating point machine learning network to obtain a fixed point machine learning network using a quantizer, the program code being executed by a processor and comprising:
-
program code to select at least one moment of an input distribution of the floating point machine learning network; and program code to determine quantizer parameters for quantizing values of the floating point machine learning network based at least in part on the at least one selected moment of the input distribution of the floating point machine learning network to obtain corresponding values of the fixed point machine learning network. - View Dependent Claims (27, 28, 29, 30)
-
Specification