Audio Features for Music Emotion Recognition: A Survey

被引:41
|
作者
Panda, Renato [1 ,2 ]
Malheiro, Ricardo [1 ,3 ]
Paiva, Rui Pedro [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
[2] Polytech Inst Tomar, Ci2, P-2300313 Tomar, Portugal
[3] Miguel Torga Higher Inst, P-3000132 Coimbra, Portugal
关键词
Rhythm; Feature extraction; Emotion recognition; Psychology; Indexes; Machine learning; Affective computing; music emotion recognition; audio feature design; music information retrieval; PERCEPTION; EXPRESSION; PITCH; EXTRACTION; SPEECH; TIMBRE; REPRESENTATIONS; CLASSIFICATION; REGRESSION; RESPONSES;
D O I
10.1109/TAFFC.2020.3032373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of meaningful audio features is a key need to advance the state-of-the-art in music emotion recognition (MER). This article presents a survey on the existing emotionally-relevant computational audio features, supported by the music psychology literature on the relations between eight musical dimensions (melody, harmony, rhythm, dynamics, tone color, expressivity, texture and form) and specific emotions. Based on this review, current gaps and needs are identified and strategies for future research on feature engineering for MER are proposed, namely ideas for computational audio features that capture elements of musical form, texture and expressivity that should be further researched. Previous MER surveys offered broad reviews, covering topics such as emotion paradigms, approaches for the collection of ground-truth data, types of MER problems and overviewing different MER systems. On the contrary, our approach is to offer a deep and specific review on one key MER problem: the design of emotionally-relevant audio features.
引用
收藏
页码:68 / 88
页数:21
相关论文
共 50 条
  • [41] EXTRACTING AUDIO-VISUAL FEATURES FOR EMOTION RECOGNITION THROUGH ACTIVE FEATURE SELECTION
    Haider, Fasih
    Pollak, Senja
    Albert, Pierre
    Luz, Saturnino
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [42] Personalized Music Emotion Recognition
    Yang, Yi-Hsuan
    Lin, Yu-Ching
    Chen, Homer
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 748 - 749
  • [43] A Survey on Automatic Emotion Recognition Using Audio Big Data and Deep Learning Architectures
    Zhao, Huijuan
    Ye, Ning
    Wang, Ruchuan
    2018 IEEE 4TH INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), 4THIEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 3RD IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2018, : 139 - 142
  • [44] Multi-view Neural Networks for Raw Audio-based Music Emotion Recognition
    He, Na
    Ferguson, Sam
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 168 - 172
  • [45] Multi-label Emotion Classification in Music Videos Using Ensembles of Audio and Video Features
    Kostiuk, Bruno
    Costa, Yandre M. G.
    Britto Jr, Alceu S.
    Hu, Xiao
    Silla Jr, Carlos N.
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 517 - 523
  • [46] Survey on audiovisual emotion recognition: databases, features, and data fusion strategies
    Wu, Chung-Hsien
    Lin, Jen-Chun
    Wei, Wen-Li
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2014, 3 (03)
  • [47] EXTRACTING AND RECOGNISING MUSIC FEATURES THROUGH MULTI-MODAL EMOTION RECOGNITION
    Xu, Chi
    MECHATRONIC SYSTEMS AND CONTROL, 2024, 52 (03): : 140 - 146
  • [48] Music classification system through emotion recognition based on regression model of music signal and electroencephalogram features
    Lee, Ju-Hwan
    Kim, Jin-Young
    Jeong, Dong-Ki
    Kim, Hyoung-Gook
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2022, 41 (02): : 115 - 121
  • [49] Fast recognition of remixed music audio
    Casey, Michael
    Slaney, Malcolm
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1425 - +
  • [50] Audio Features in Music Information Retrieval
    Grzywczak, Daniel
    Gwardys, Grzegorz
    ACTIVE MEDIA TECHNOLOGY, AMT 2014, 2014, 8610 : 187 - 199