Robotic task instructions often involve a referred object that the robot must locate (ground) within environment. While intent understanding is an essential part of natural language understanding, less effort made to resolve ambiguity may arise while grounding task. Existing works use vision-based and detection, suitable for fixed view static robot. However, problem magnifies mobile robot, wher...