Several recent works have been dedicated to unsupervised reinforcement learning in a single environment, which policy is first pre-trained with interactions, and then fine-tuned towards the optimal for several downstream supervised tasks defined over same environment. Along this line, we address problem of class multiple environments, interactions from whole class, any environment class. Notabl...