Real-Time Emotion-Based Piano Music Generation Using Generative Adversarial Network (GAN)

被引:0
|
作者
Zheng, Lijun [1 ]
Li, Chenglong [2 ]
机构
[1] Ewha Womans Univ, Sch Mus, Seoul 03760, South Korea
[2] Qiannan Normal Coll Nationalities, Conservatory Mus & Dance, Duyun 558000, Guizhou, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Generative adversarial networks; Learning automata; Deep learning; Music; Instruments; Complexity theory; Computational modeling; Reinforcement learning; Real-time music generation; generative adversarial network; self-attention mechanism; reinforcement learning; learning automata; emotion-based music;
D O I
10.1109/ACCESS.2024.3414673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic creation of real-time, emotion-based piano music pieces remains a challenge for deep learning models. While Generative Adversarial Networks (GANs) have shown promise, existing methods can struggle with generating musically coherent pieces and often require complex manual configuration. This paper proposes a novel model called Learning Automata-based Self-Attention Generative Adversarial Network (LA-SAGAN) to address these limitations. The proposed model uses a Generative Adversarial Network (GAN), combined with Self-Attention (SA) mechanism to reach this goal. The benefits of using SA modules in GAN architecture is twofold: First, SA mechanism results in generating music pieces with homogenous structure, which means long-distance dependencies in generated outputs are considered. Second, the SA mechanism utilizes the emotional features of the input to produce output pieces. This results in generating music pieces with desired genre or theme. In order to control the complexity of the proposed model, and optimize its structure, a set of Learning Automata (LA) models have been used to determine the activity state of each SA module. To do this, an iterative algorithm based on cooperation of LAs is introduced which optimizes the model by deactivating unnecessary SA modules. The efficiency of the proposed model in generating piano music has been evaluated. Evaluations demonstrate LA-SAGAN's effectiveness: at least 14.47% improvement in entropy (diversity) and improvements in precision (at least 2.47%) and recall (at least 2.13%). Moreover, human evaluation confirms superior musical coherence and adherence to emotional cues.
引用
收藏
页码:87489 / 87500
页数:12
相关论文
共 50 条
  • [21] Transforming the Emotion in Speech using a Generative Adversarial Network
    Yasuda, Kenji
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 427 - 434
  • [22] Emotion-Based Music Information Retrieval Using Lyrics
    Ogino, Akihiro
    Yamashita, Yuko
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, 2015, 9339 : 613 - 622
  • [23] Real-time segmentation of various insulators using generative adversarial networks
    Chang, Wenkai
    Yang, Guodong
    Yu, Junzhi
    Liang, Zize
    IET COMPUTER VISION, 2018, 12 (05) : 596 - 602
  • [24] Speech Enhancement Using Generative Adversarial Network (GAN)
    Huq, Mahmudul
    Maskeliunas, Rytis
    HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 273 - 282
  • [25] PAC-GAN: Packet Generation of Network Traffic using Generative Adversarial Networks
    Cheng, Adriel
    2019 IEEE 10TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2019, : 728 - 734
  • [26] fire-GAN: Flame Image Generation Algorithm Based on Generative Adversarial Network
    Qin Kui
    Hou Xinguo
    Zhou Feng
    Yan Zhengjun
    Bu Leping
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (12)
  • [27] Emotion-based Music Recommendation using Supervised Learning
    Bodarwe, Karl-Arnold
    Noack, Jenny
    Jean-Jacques, Philipp
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS MULTIMEDIA (MUM 2015), 2015, : 341 - 344
  • [28] Research on real-time optimization control algorithm of cement burning system based on Generative Adversarial Network
    Zhang Cheng-Wei
    Li Hui-Xia
    Wang Lei
    Zhou Qiang
    Han Liang
    Weng Si-Hao
    Wu Yan-Wen
    ZKG INTERNATIONAL, 2023, 76 (08): : 48 - 55
  • [29] Music Generation Using Dual Interactive Wasserstein Fourier Acquisitive Generative Adversarial Network
    Shaikh, Tarannum
    Jadhav, Ashish
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2025, 24 (01)
  • [30] Real-time music emotion recognition based on multimodal fusion
    Hao, Xingye
    Li, Honghe
    Wen, Yonggang
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 116 : 586 - 600