A transformer generative adversarial network for multi-track music generation

被引:35
|
作者
Jin, Cong [1 ]
Wang, Tao [2 ]
Li, Xiaobing [3 ]
Tie, Chu Jie Jiessie [4 ]
Tie, Yun [2 ,3 ]
Liu, Shan [5 ]
Yan, Ming [1 ]
Li, Yongzhi [6 ]
Wang, Junxian [7 ]
Huang, Shenze [7 ]
机构
[1] Commun Univ China, Sch Informat & Commun Engn, Beijing, Peoples R China
[2] Zhengzhou Univ, Sch Informat & Engn, Zhengzhou, Peoples R China
[3] Cent Conservatory Mus, Beijing, Peoples R China
[4] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[5] Commun Univ China, Sch Data Sci & Media Intelligence, Beijing, Peoples R China
[6] South China Univ Technol, Sch Light Ind & Engn, Guangzhou, Peoples R China
[7] Commun Univ China, Sch Animat & Digital Arts, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
artificial intelligence; deep learning; multimedia;
D O I
10.1049/cit2.12065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study proposes a new generation network based on transformers and guided by the music theory to produce high-quality music work. In this study, the decoding block of the transformer is used to learn the internal information of single-track music, and cross-track transformers are used to learn the information amongst the tracks of different musical instruments. A reward network based on the music theory is proposed, which optimizes the global and local loss objective functions while training and discriminating the network so that the reward network can provide a reliable adjustment method for the generation of the network. The method of combining the reward network and cross entropy loss is used to guide the training of the generator and produce high-quality music work. Compared with other multi-track music generation models, the experimental results verify the validity of the model.
引用
收藏
页码:369 / 380
页数:12
相关论文
共 50 条
  • [1] MuseGAN: Multi-Track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment
    Dong, Hao-Wen
    Hsiao, Wen-Yi
    Yang, Li-Chia
    Yang, Yi-Hsuan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 34 - 41
  • [2] A Transformer-Based Model for Multi-Track Music Generation
    Jin, Cong
    Wang, Tao
    Liu, Shouxun
    Tie, Yun
    Li, Jianguang
    Li, Xiaobing
    Lui, Simon
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2020, 11 (03): : 36 - 54
  • [3] Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition
    Liu, Weiming
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06): : 6560 - 6582
  • [4] Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition
    Weiming Liu
    The Journal of Supercomputing, 2023, 79 : 6560 - 6582
  • [5] Multi-Track Music Generation Based on the AC Algorithm and Global Value Return Network
    Guo, Wei
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 456 - 465
  • [6] Multi-category MIDI music generation based on LSTM Generative adversarial network
    Wang, Yutian
    Yu, Guochen
    Cai, Juanjuan
    Wang, Hui
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 20 - 25
  • [7] The Analysis of Multi-Track Music Generation With Deep Learning Models in Music Production Process
    Jiang, Rong
    Mou, Xiaofei
    IEEE ACCESS, 2024, 12 : 110322 - 110330
  • [8] Calliope: A Co-creative Interface for Multi-Track Music Generation
    Tchemeube, Renaud Bougueng
    Ens, J.
    Pasquier, P.
    PROCEEDINGS OF THE 14TH CREATIVITY AND COGNITION, C&C 2022, 2022, : 608 - 611
  • [9] DiffuseRoll: multi-track multi-attribute music generation based on diffusion model
    Hongfei Wang
    Yi Zou
    Haonan Cheng
    Long Ye
    Multimedia Systems, 2024, 30
  • [10] DiffuseRoll: multi-track multi-attribute music generation based on diffusion model
    Wang, Hongfei
    Zou, Yi
    Cheng, Haonan
    Ye, Long
    MULTIMEDIA SYSTEMS, 2024, 30 (01)