Deep learning has achieved substantial improvement on single-channel speech enhancement tasks. However, the performance of multi-layer perceptions (MLPs)-based methods is limited by ability to capture long-term effective history information. The recurrent neural networks (RNNs), e.g., long short-term memory (LSTM) model, are able temporal dependencies, but come with issues high latency and comp...