Real-Time Emotion-Based Piano Music Generation Using Generative Adversarial Network (GAN)

被引:0
|
作者
Zheng, Lijun [1 ]
Li, Chenglong [2 ]
机构
[1] Ewha Womans Univ, Sch Mus, Seoul 03760, South Korea
[2] Qiannan Normal Coll Nationalities, Conservatory Mus & Dance, Duyun 558000, Guizhou, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Generative adversarial networks; Learning automata; Deep learning; Music; Instruments; Complexity theory; Computational modeling; Reinforcement learning; Real-time music generation; generative adversarial network; self-attention mechanism; reinforcement learning; learning automata; emotion-based music;
D O I
10.1109/ACCESS.2024.3414673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic creation of real-time, emotion-based piano music pieces remains a challenge for deep learning models. While Generative Adversarial Networks (GANs) have shown promise, existing methods can struggle with generating musically coherent pieces and often require complex manual configuration. This paper proposes a novel model called Learning Automata-based Self-Attention Generative Adversarial Network (LA-SAGAN) to address these limitations. The proposed model uses a Generative Adversarial Network (GAN), combined with Self-Attention (SA) mechanism to reach this goal. The benefits of using SA modules in GAN architecture is twofold: First, SA mechanism results in generating music pieces with homogenous structure, which means long-distance dependencies in generated outputs are considered. Second, the SA mechanism utilizes the emotional features of the input to produce output pieces. This results in generating music pieces with desired genre or theme. In order to control the complexity of the proposed model, and optimize its structure, a set of Learning Automata (LA) models have been used to determine the activity state of each SA module. To do this, an iterative algorithm based on cooperation of LAs is introduced which optimizes the model by deactivating unnecessary SA modules. The efficiency of the proposed model in generating piano music has been evaluated. Evaluations demonstrate LA-SAGAN's effectiveness: at least 14.47% improvement in entropy (diversity) and improvements in precision (at least 2.47%) and recall (at least 2.13%). Moreover, human evaluation confirms superior musical coherence and adherence to emotional cues.
引用
收藏
页码:87489 / 87500
页数:12
相关论文
共 50 条
  • [31] Music Creation Technology Based on Generative Adversarial Network
    Liu, Feng
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (09) : 626 - 632
  • [32] Generative Adversarial Shaders for Real-Time Realism Enhancement
    Salmi, A.
    Csefalvay, Sz
    Imber, J.
    COMPUTER GRAPHICS FORUM, 2023, 42 (08)
  • [33] Multi-category MIDI music generation based on LSTM Generative adversarial network
    Wang, Yutian
    Yu, Guochen
    Cai, Juanjuan
    Wang, Hui
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 20 - 25
  • [34] A Wasserstein generative adversarial network-based approach for real-time track irregularity estimation using vehicle dynamic responses
    Yuan, Zhandong
    Luo, Jun
    Zhu, Shengyang
    Zhai, Wanming
    VEHICLE SYSTEM DYNAMICS, 2022, 60 (12) : 4186 - 4205
  • [35] Generative Adversarial Network for Real-Time Flash Drought Monitoring: A Deep Learning Study
    Foroumandi, Ehsan
    Gavahi, Keyhan
    Moradkhani, Hamid
    WATER RESOURCES RESEARCH, 2024, 60 (05)
  • [36] EmoKbGAN: Emotion controlled response generation using Generative Adversarial Network for knowledge grounded conversation
    Varshney, Deeksha
    Ekbal, Asif
    Tiwari, Mrigank
    Nagaraja, Ganesh Prasad
    PLOS ONE, 2023, 18 (02):
  • [37] Emotion Classification Using a Tensorflow Generative Adversarial Network Implementation
    Caramihale, Traian
    Popescu, Dan
    Ichim, Loretta
    SYMMETRY-BASEL, 2018, 10 (09):
  • [38] SANTIAGO - A Real-Time Biological Neural Network Environment for Generative Music Creation
    Kerllenevich, Hernan
    Ernesto Riera, Pablo
    Camilo Eguia, Manuel
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT II, 2011, 6625 : 344 - 353
  • [39] I-sounds - Emotion-based music generation for virtual environments
    Cruz, Ricardo
    Brisson, Antonio
    Paiva, Ana
    Lopes, Eduardo
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2007, 4738 : 766 - +
  • [40] Synthetic Energy Data Generation Using Time Variant Generative Adversarial Network
    Asre, Shashank
    Anwar, Adnan
    ELECTRONICS, 2022, 11 (03)