Common video-based object detectors exploit temporal contextual information to improve the performance of detection. However, detecting objects under challenging conditions has not been thoroughly studied yet. In this paper, we focus on improving detection for events such as aspect ratio change, occlusion, or large motion. To end, propose a video network using event-aware ConvLSTM and relation ...