Video summarisation can relieve the pressure on video storage, transmission, archiving, and retrieval caused by explosive growth of online videos in recent years. Most existing supervised methods use convolutional neural network (CNN) or recurrent (RNN) to model temporal dependencies between frames shots. CNN mainly focuses local information, RNN loses long-term information when input sequence ...