A Multi-Stage Visual Perception Approach for Image Emotion Analysis

被引:0
|
作者
Pan, Jicai [1 ]
Lu, Jinqiao [1 ]
Wang, Shangfei [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Key Lab Comp & Commun Software Anhui Prov, Hefei 230027, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Task analysis; Visualization; Emotion recognition; Convolutional neural networks; Classification algorithms; Adaptation models; Affective gap; attribute; entity; image emotion analysis; multi-stage perception;
D O I
10.1109/TAFFC.2024.3372090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most current methods for image emotion analysis suffer from the affective gap, in which features directly extracted from images are supervised by a single emotional label, which may not align with users' perceived emotions. To effectively address this limitation, this article introduces a novel multi-stage perception approach inspired by the human staged emotion perception process. The proposed approach comprises three perception modules: entity perception, attribute perception, and emotion perception. The entity perception module identifies entities in images, while the attribute perception module captures the attribute content associated with each entity. Finally, the emotion perception module combines entity and attribute information to extract emotion features. Pseudo-labels of entities and attributes are generated through image segmentation and vision-language models to provide auxiliary guidance for network learning. A progressive understanding of entities and attributes allows the network to hierarchically extract semantic-level features for emotion analysis. Comprehensive experiments on image emotion classification, regression, and distribution learning demonstrate the superior performance of our multi-stage perception network.
引用
收藏
页码:1786 / 1799
页数:14
相关论文
共 50 条
  • [21] A multi-stage dynamical fusion network for multimodal emotion recognition
    Sihan Chen
    Jiajia Tang
    Li Zhu
    Wanzeng Kong
    Cognitive Neurodynamics, 2023, 17 : 671 - 680
  • [22] Toward Multi-Stage Decoupled Visual SLAM System
    Merzban, Mohamed H.
    Abdellatif, Mohamed
    Abbas, Hossam
    Sessa, Salvatore
    2013 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2013), 2013,
  • [23] Adaptation reveals multi-stage coding of visual duration
    James Heron
    Corinne Fulcher
    Howard Collins
    David Whitaker
    Neil W. Roach
    Scientific Reports, 9
  • [24] Multi-stage Attention based Visual Question Answering
    Mishra, Aakansha
    Anand, Ashish
    Guha, Prithwijit
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9407 - 9414
  • [25] Adaptation reveals multi-stage coding of visual duration
    Heron, James
    Fulcher, Corinne
    Collins, Howard
    Whitaker, David
    Roach, Neil W.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [26] MSHT: Multi-Stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer
    Zhang, Tianyi
    Feng, Yunlu
    Zhao, Yu
    Fan, Guangda
    Yang, Aiming
    Lyu, Shangqing
    Zhang, Peng
    Song, Fan
    Ma, Chenbin
    Sun, Yangyang
    Feng, Youdan
    Zhang, Guanglei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (04) : 1946 - 1957
  • [27] Technology Enablers for Big Data, Multi-Stage Analysis in Medical Image Processing
    Bao, Shunxing
    Parvarthaneni, Prasanna
    Huo, Yuankai
    Barve, Yogesh
    Plassard, Andrew J.
    Yao, Yuang
    Sun, Hongyang
    Lyu, Ilwoo
    Zald, David H.
    Landman, Bennett A.
    Gokhale, Aniruddha
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1337 - 1346
  • [28] MHRNet: A Multi-stage Image Deblurring Approach with High-Resolution Representation Learning
    Liu, Wenfu
    Peng, Junjie
    Yuan, Haochen
    Zhang, Luming
    Cai, Zesu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [29] PMDNet: A multi-stage approach to single image dehazing with contextual and spatial feature preservation
    Pushpalatha, D.
    Prithvi, P.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 107
  • [30] Optimal Multi-Stage Arrhythmia Classification Approach
    Zheng, Jianwei
    Chu, Huimin
    Struppa, Daniele
    Zhang, Jianming
    Yacoub, Sir Magdi
    El-Askary, Hesham
    Chang, Anthony
    Ehwerhemuepha, Louis
    Abudayyeh, Islam
    Barrett, Alexander
    Fu, Guohua
    Yao, Hai
    Li, Dongbo
    Guo, Hangyuan
    Rakovski, Cyril
    SCIENTIFIC REPORTS, 2020, 10 (01)