Audio Features for Music Emotion Recognition: A Survey

被引:41
|
作者
Panda, Renato [1 ,2 ]
Malheiro, Ricardo [1 ,3 ]
Paiva, Rui Pedro [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
[2] Polytech Inst Tomar, Ci2, P-2300313 Tomar, Portugal
[3] Miguel Torga Higher Inst, P-3000132 Coimbra, Portugal
关键词
Rhythm; Feature extraction; Emotion recognition; Psychology; Indexes; Machine learning; Affective computing; music emotion recognition; audio feature design; music information retrieval; PERCEPTION; EXPRESSION; PITCH; EXTRACTION; SPEECH; TIMBRE; REPRESENTATIONS; CLASSIFICATION; REGRESSION; RESPONSES;
D O I
10.1109/TAFFC.2020.3032373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of meaningful audio features is a key need to advance the state-of-the-art in music emotion recognition (MER). This article presents a survey on the existing emotionally-relevant computational audio features, supported by the music psychology literature on the relations between eight musical dimensions (melody, harmony, rhythm, dynamics, tone color, expressivity, texture and form) and specific emotions. Based on this review, current gaps and needs are identified and strategies for future research on feature engineering for MER are proposed, namely ideas for computational audio features that capture elements of musical form, texture and expressivity that should be further researched. Previous MER surveys offered broad reviews, covering topics such as emotion paradigms, approaches for the collection of ground-truth data, types of MER problems and overviewing different MER systems. On the contrary, our approach is to offer a deep and specific review on one key MER problem: the design of emotionally-relevant audio features.
引用
收藏
页码:68 / 88
页数:21
相关论文
共 50 条
  • [21] Emotion Recognition in Audio Records
    Pavaloi, Ioan
    Musca, Elena
    Rotaru, Florin
    2013 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2013,
  • [22] The CASIA Audio Emotion Recognition Method for Audio/Visual Emotion Challenge 2011
    Pan, Shifeng
    Tao, Jianhua
    Li, Ya
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PT II, 2011, 6975 : 388 - 395
  • [23] Survey on speech emotion recognition: Features, classification schemes, and databases
    El Ayadi, Moataz
    Kamel, Mohamed S.
    Karray, Fakhri
    PATTERN RECOGNITION, 2011, 44 (03) : 572 - 587
  • [24] Review of data features-based music emotion recognition methods
    Yang, Xinyu
    Dong, Yizhuo
    Li, Juan
    MULTIMEDIA SYSTEMS, 2018, 24 (04) : 365 - 389
  • [25] XGBoost-based Music Emotion Recognition with Emobase Emotional Features
    Kyaw, Pyi Bhone
    Cho, Li
    2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 271 - 272
  • [26] Music Emotion Recognition by Using Chroma Spectrogram and Deep Visual Features
    Mehmet Bilal Er
    Ibrahim Berkan Aydilek
    International Journal of Computational Intelligence Systems, 2019, 12 : 1622 - 1634
  • [27] Music Emotion Recognition by Using Chroma Spectrogram and Deep Visual Features
    Er, Mehmet Bilal
    Aydilek, Ibrahim Berkan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 1622 - 1634
  • [28] Multimodal Fusion of EEG and Musical Features in Music-Emotion Recognition
    Thammasan, Nattapong
    Fukui, Ken-ichi
    Numao, Masayuki
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4991 - 4992
  • [29] Review of data features-based music emotion recognition methods
    Xinyu Yang
    Yizhuo Dong
    Juan Li
    Multimedia Systems, 2018, 24 : 365 - 389
  • [30] A new approach of audio emotion recognition
    Ooi, Chien Shing
    Seng, Kah Phooi
    Ang, Li-Minn
    Chew, Li Wern
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (13) : 5858 - 5869