Speech enhancement employing deep neural networks (DNNs) for denoising is called noise suppression (DNS). The DNS trained with mean squared error (MSE) losses cannot guarantee good perceptual quality. Perceptual evaluation of speech quality (PESQ) a widely used metric evaluating However, the original PESQ algorithm non-differentiable, therefore, directly be as optimization criterion gradient-ba...