A weakly supervised spatial group attention network for fine-grained visual recognition

被引:8
|
作者
Xie, Jiangjian [1 ,2 ,3 ]
Zhong, Yujie [1 ]
Zhang, Junguo [1 ,2 ]
Zhang, Changchun [1 ,2 ]
Schuller, Bjoern W. [3 ,4 ,5 ]
机构
[1] Beijing Forestry Univ, Sch Technol, Beijing 100083, Peoples R China
[2] Beijing Forestry Univ, Res Ctr Biodivers Intelligent Monitoring, Beijing 100083, Peoples R China
[3] Univ Augsburg, Chair Embedded Intelligence Hlth Care & Wellbeing, D-86159 Augsburg, Germany
[4] Imperial Coll London, GLAM Grp Language Audio & Mus, London SW7 2AZ, England
[5] Univ Augsburg, Ctr Interdisciplinary Hlth Res, D-86159 Augsburg, Germany
关键词
Classification; Fine-grained image; Bird recognition; Weakly supervised network; Moment exchange; Spatial group attention;
D O I
10.1007/s10489-023-04627-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fine-grained visual recognition is to classify several sub-categories affiliated to the same basic-level category, which is highly challenging because the same sub-category with large variance and different sub-categories with small variance. Previously approaches generally localize the targets or parts first, then determine which sub-category the image is attached to. They depend on target or part annotations, which are labor-intensive and a barrier to moving towards practical use. Other methods indirectly extract recognizable areas from the high-level feature maps, ignoring the spatial relationships between the target and its parts, which may cause inaccurate recognition. In this paper, we propose a weakly supervised spatial group attention network (WSSGA-Net) for fine-grained bird recognition. According to the spatial relationships between the target and its parts, we embed the spatial group attention (SGA) module into the WSSGA-Net to highlight the correct semantic feature regions by establishing a semantic feature space enhancement mechanism. In addition, we apply moment exchange (MoEx) to generate new feature maps by exchanging two input image feature moments for data augmentation. Comprehensive experiments indicate that our approach significantly has a better performance than the state-of-the-art approaches on the standard bird image datasets Bird-65, CUB200-2011 and fine-grained dataset Stanford Cars.
引用
收藏
页码:23301 / 23315
页数:15
相关论文
共 50 条
  • [31] Weakly Supervised Learning of Object-Part Attention Model for Fine-Grained Image Classification
    Lei, Chenxi
    Jiang, Linfeng
    Ji, Jingshen
    Zhong, Weilin
    Xiong, Huilin
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 1222 - 1226
  • [32] Context Sensitive Network for weakly-supervised fine-grained temporal action localization
    Dong, Cerui
    Liu, Qinying
    Wang, Zilei
    Zhang, Yixin
    Zhao, Feng
    NEURAL NETWORKS, 2025, 185
  • [33] A Saliency-based Weakly-supervised Network for Fine-Grained Image Categorization
    Han, Yawen
    Meng, Fang
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 270 - 274
  • [34] Fine-Grained Background Representation for Weakly Supervised Semantic Segmentation
    Yin, Xu
    Im, Woobin
    Min, Dongbo
    Huo, Yuchi
    Pan, Fei
    Yoon, Sung-Eui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11739 - 11750
  • [35] Robust Fine-Grained Visual Recognition With Neighbor-Attention Label Correction
    Mao, Shunan
    Zhang, Shiliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2614 - 2626
  • [36] DEEP MULTI-CONTEXT NETWORK FOR FINE-GRAINED VISUAL RECOGNITION
    Ou, Xinyu
    Wei, Zhen
    Ling, Hefei
    Liu, Si
    Cao, Xiaochun
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
  • [37] A Novel Smart Lightweight Visual Attention Model for Fine-Grained Vehicle Recognition
    Boukerche, Azzedine
    Ma, Xiren
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 13846 - 13862
  • [38] Weakly-Supervised Fine-Grained Event Recognition on Social Media Texts for Disaster Management
    Yao, Wenlin
    Zhang, Cheng
    Saravanan, Shiva
    Huang, Ruihong
    Mostafavi, Ali
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 532 - 539
  • [39] Weakly Supervised Fine-grained Recognition Based on Combined Learning for Small Data and Coarse Label
    Hu, Anqi
    Sun, Zhengxing
    Li, Qian
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 194 - 201
  • [40] Foreground-Background Partitioning and Feature Fusion for Weakly Supervised Fine-Grained Image Recognition
    Liu, Chenglin
    Li, Jiuliang
    Chen, Yanmin
    Luo, Jun
    Zhou, Mengyao
    Yang, Jian
    Li, Zhenfei
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, PRCV 2024, 2025, 15033 : 17 - 30