Dance2Music-Diffusion: leveraging latent diffusion models for music generation from dance videos

被引:0
|
作者
Zhang, Chaoyang [1 ]
Hua, Yan [1 ]
机构
[1] Commun Univ China, Sch Informat & Commun Engn, 1 Dingfuzhuang East St, Beijing 100024, Peoples R China
来源
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2024年 / 2024卷 / 01期
基金
中国国家自然科学基金;
关键词
Diffusion; Cross-modality; Dance to music; Transformer;
D O I
10.1186/s13636-024-00370-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
With the rapid development of social networks, short videos have become a popular form of content, especially dance videos. In this context, research on automatically generating music for dance videos shows significant practical value. However, existing studies face challenges such as limited richness in music timbre and lack of synchronization with dance movements. In this paper, we propose Dance2Music-Diffusion, a novel framework for music generation from dance videos using latent diffusion models. Our approach includes a motion encoder module for extracting motion features and a music diffusion generation module for generating latent music representations. By integrating dance type monitoring and latent diffusion techniques, our framework outperforms existing methods in generating complex and rich dance music. We conducted objective and subjective evaluations of the results produced by various existing models on the AIST++ dataset. Our framework shows outstanding performance in terms of beat recall rate, consistency with GT beats, and coordination with dance movements. This work represents the state of the art in automatic music generation from dance videos, is easy to train, and has implications for enhancing entertainment experiences and inspiring innovative dance productions. Sample videos of our generated music and dance can be viewed at https://youtu.be/eCvLdLdkX-Y. The code is available at https://github.com/hellto/dance2music-diffusion.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] From movement to dance in music education
    Martin Escobar, Maria Jesus
    EDUCATIO SIGLO XXI, 2005, 23 : 125 - 139
  • [22] Dance House: European Models of Folk Music and Dance Revival in Urban Settings
    Pettan, Svanibor
    JOURNAL OF URBAN CULTURE RESEARCH, 2010, 1 : 128 - 135
  • [23] Dance with You: The Diversity Controllable Dancer Generation via Diffusion Models
    Yao, Siyue
    Sun, Mingjie
    Li, Bingliang
    Yang, Fengyu
    Wang, Junle
    Zhang, Ruimao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8504 - 8514
  • [24] InstructME: An Instruction Guided Music Edit Framework with Latent Diffusion Models
    Han, Bing
    Dai, Junyu
    Hao, Weituo
    He, Xinyan
    Guo, Dong
    Chen, Jitong
    Wang, Yuxuan
    Qian, Yanmin
    Song, Xuchen
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 5835 - 5843
  • [25] DanceComposer: Dance-to-Music Generation Using a Progressive Conditional Music Generator
    Liang, Xiao
    Li, Wensheng
    Huang, Lifeng
    Gao, Chengying
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10237 - 10250
  • [26] Bidirectional Autoregressive Diffusion Model for Dance Generation
    Zhang, Canyu
    Tang, Youbao
    Zhang, Ning
    Lin, Ruei-Sung
    Han, Mei
    Xiao, Jing
    Wang, Song
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 687 - 696
  • [27] Creative Chord Sequence Generation for Electronic Dance Music
    Conklin, Darrell
    Gasser, Martin
    Oertl, Stefan
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [28] From the Dance Floor to the Brain: The Effects of Music and Dance on Movement Disorders
    Kojovic, Maja
    MOVEMENT DISORDERS CLINICAL PRACTICE, 2024,
  • [29] Robot Dance Generation with Music Based Trajectory Optimization
    Boukheddimi, Melya
    Harnack, Daniel
    Kumar, Shivesh
    Kumar, Rohit
    Vyas, Shubham
    Arriaga, Octavio
    Kirchner, Frank
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 3069 - 3076
  • [30] Soul Train: The Music, Dance, and Style of a Generation.
    Salois, Kendra
    JOURNAL OF POPULAR MUSIC STUDIES, 2014, 26 (2-3) : 414 - 418