Real-Time Emotion-Based Piano Music Generation Using Generative Adversarial Network (GAN)

被引:0
|
作者
Zheng, Lijun [1 ]
Li, Chenglong [2 ]
机构
[1] Ewha Womans Univ, Sch Mus, Seoul 03760, South Korea
[2] Qiannan Normal Coll Nationalities, Conservatory Mus & Dance, Duyun 558000, Guizhou, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Generative adversarial networks; Learning automata; Deep learning; Music; Instruments; Complexity theory; Computational modeling; Reinforcement learning; Real-time music generation; generative adversarial network; self-attention mechanism; reinforcement learning; learning automata; emotion-based music;
D O I
10.1109/ACCESS.2024.3414673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic creation of real-time, emotion-based piano music pieces remains a challenge for deep learning models. While Generative Adversarial Networks (GANs) have shown promise, existing methods can struggle with generating musically coherent pieces and often require complex manual configuration. This paper proposes a novel model called Learning Automata-based Self-Attention Generative Adversarial Network (LA-SAGAN) to address these limitations. The proposed model uses a Generative Adversarial Network (GAN), combined with Self-Attention (SA) mechanism to reach this goal. The benefits of using SA modules in GAN architecture is twofold: First, SA mechanism results in generating music pieces with homogenous structure, which means long-distance dependencies in generated outputs are considered. Second, the SA mechanism utilizes the emotional features of the input to produce output pieces. This results in generating music pieces with desired genre or theme. In order to control the complexity of the proposed model, and optimize its structure, a set of Learning Automata (LA) models have been used to determine the activity state of each SA module. To do this, an iterative algorithm based on cooperation of LAs is introduced which optimizes the model by deactivating unnecessary SA modules. The efficiency of the proposed model in generating piano music has been evaluated. Evaluations demonstrate LA-SAGAN's effectiveness: at least 14.47% improvement in entropy (diversity) and improvements in precision (at least 2.47%) and recall (at least 2.13%). Moreover, human evaluation confirms superior musical coherence and adherence to emotional cues.
引用
收藏
页码:87489 / 87500
页数:12
相关论文
共 50 条
  • [41] Architectural layout generation using a graph-constrained conditional Generative Adversarial Network (GAN)
    Aalaei, Mohammadreza
    Saadi, Melika
    Rahbar, Morteza
    Ekhlassi, Ahmad
    AUTOMATION IN CONSTRUCTION, 2023, 155
  • [42] Realistic real-time processing of anime portraits based on generative adversarial networks
    Zhu, Gaofeng
    Qu, Zhiguo
    Sun, Le
    Liu, Yuming
    Yang, Jianfeng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (04)
  • [43] Polyphonic music generation generative adversarial network with Markov decision process
    Huang, Wenkai
    Xue, Yihao
    Xu, Zefeng
    Peng, Guanglong
    Wu, Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 29865 - 29885
  • [44] A Generative Adversarial Network Model Based on Intelligent Data Analytics for Music Emotion Recognition under IoT
    Huang, I-Sheng
    Lu, Yu-Hsuan
    Shafiq, Muhammad
    Laghari, Asif Ali
    Yadav, Rahul
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [45] Polyphonic music generation generative adversarial network with Markov decision process
    Wenkai Huang
    Yihao Xue
    Zefeng Xu
    Guanglong Peng
    Yu Wu
    Multimedia Tools and Applications, 2022, 81 : 29865 - 29885
  • [46] A transformer generative adversarial network for multi-track music generation
    Jin, Cong
    Wang, Tao
    Li, Xiaobing
    Tie, Chu Jie Jiessie
    Tie, Yun
    Liu, Shan
    Yan, Ming
    Li, Yongzhi
    Wang, Junxian
    Huang, Shenze
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (03) : 369 - 380
  • [47] Enhancement of Alaryngeal Speech using Generative Adversarial Network (GAN)
    Huq, Mahmudul
    2021 IEEE/ACS 18TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2021,
  • [48] Emotion Recognition Based on EEG Using Generative Adversarial Nets and Convolutional Neural Network
    Pan, Bo
    Zheng, Wei
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [49] A novel AI-based model for real-time flooding image recognition using super-resolution generative adversarial network
    Zeng, Yuan -Fu
    Chang, Ming-Jui
    Lin, Gwo-Fong
    JOURNAL OF HYDROLOGY, 2024, 638
  • [50] SCAN-GAN: Generative Adversarial Network Based Synthetic Data Generation Technique for Controller Area Network
    Chougule A.
    Agrawal K.
    Chamola V.
    IEEE Internet of Things Magazine, 2023, 6 (03): : 126 - 130