Video object detection still faces several difficulties and challenges. For example, the imbalance of positive negative samples leads to low information processing efficiency, performance declines in abnormal situations video. This paper examines video based on local attention address such We propose a sequence model optimized parameter calculation ConvGRU. It could process spatial temporal vid...