Large pre-trained language models are successfully being used in a variety of tasks, across many languages. With this ever-increasing usage, the risk harmful side effects also rises, for example by reproducing and reinforcing stereotypes. However, detecting mitigating these harms is difficult to do general becomes computationally expensive when tackling multiple languages or considering differe...