Many music generation research works have achieved effective performance, while rarely combining with given videos. We propose a model two linear Transformers to generate background according video. To enhance the melodic quality of generated music, we firstly input note-related and rhythm-related features separately into each Transformer network. In particular, pay attention connection indepen...