Modern machine learning models may be susceptible to spurious correlations that hold on average but not for the atypical group of samples. To address problem, previous approaches minimize empirical worst-group risk. Despite promise, they often assume each sample belongs one and only group, which does allow expressing uncertainty in labeling. In this paper, we propose a novel framework PG-DRO, e...