Since the fully convolutional network has achieved great success in semantic segmentation, lots of works have been proposed to extract discriminative pixel representations. However, authors observe that existing methods still suffer from two typical challenges: (i) The intra-class feature variation between different scenes may be large, leading difficulty maintaining consistency same-class pixe...