Quantized neural networks (QNNs) are widely used to achieve computationally efficient solutions recognition problems. Overall, eight-bit QNNs have almost the same accuracy as full-precision networks, but working several times faster. However, with lower quantization levels demonstrate inferior in comparison their classical analogs. To solve this issue, a number of quantization-aware training (Q...