Human–Object Interaction (HOI) detection is important to human-centric scene understanding tasks. Existing works tend assume that the same verb has similar visual characteristics in different HOI categories, an approach ignores diverse semantic meanings of verb. To address this issue, paper, we propose a novel Polysemy Deciphering Network (PD-Net) decodes polysemy verbs for three distinct ways....