Hierarchical complementary learning for weakly supervised object localization

被引:0
|
作者
Benassou, Sabrina Narimene [1 ]
Shi, Wuzhen [2 ]
Jiang, Feng [1 ]
Benzine, Abdallah [3 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, 92 Xidazhi St, Harbin, Peoples R China
[2] Shenzhen Univ, Coll Elect & Informat Engn, 3688 Nanhai Ave, Shenzhen, Peoples R China
[3] Digeiz, AI Lab, 47 Rue Marcel Dassault, F-92100 Boulogne, France
基金
美国国家科学基金会;
关键词
Weakly supervised object localization; Class activation map; Complementary map; Fusion strategy;
D O I
10.1016/j.image.2021.116520
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly supervised object localization (WSOL) is a challenging problem that aims to localize objects without ground-truth bounding boxes. A common approach is to train the model that generates a class activation map (CAM) to localize the discriminative features of the object. Unfortunately, the limitation of this method is that they detect just a part of the object and not the whole object. To solve this problem, previous works have removed some parts of the image (Zhang et al., 2018; Zhang et al., 2018; Singh and Lee, 2017; Choe and Shim, 2019) to force the model to detect the full object extent. However, these methods require one or many hyper-parameters to erase the appropriate pixels on the image, which could involve a loss of information. In this paper, we propose a Hierarchical Complementary Learning Network method (HCLNet) that helps the CNN to perform better on classification and localization. HCLNet uses a complementary CAM to generate multiple maps that detect different parts of the object. Unlike previous works, this method does not need any extra hyper-parameters, as well as does not introduce a big loss of information. In order to fuse these different maps, two different fusion strategies known as the addition strategy and the I-1-norm strategy have been used. These strategies allow to detect the whole object while excluding the background. Extensive experiments show that HCLNet obtains better performance than state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Foreground Activation Maps for Weakly Supervised Object Localization
    Meng, Meng
    Zhang, Tianzhu
    Tian, Qi
    Zhang, Yongdong
    Wu, Feng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3365 - 3375
  • [42] Token Masking Transformer for Weakly Supervised Object Localization
    Xu, Wenhao
    Wang, Changwei
    Xu, Rongtao
    Xu, Shibiao
    Meng, Weiliang
    Zhang, Man
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2059 - 2069
  • [43] Rethinking erasing strategy on weakly supervised object localization
    Fan, Yuming
    Wei, Shikui
    Tan, Chuangchuang
    Chen, Xiaotong
    Yang, Dongming
    Zhao, Yao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 135
  • [44] Aggregation of attention and erasing for weakly supervised object localization
    Koo, Bongyeong
    Choi, Han-Soo
    Kang, Myungjoo
    IMAGE AND VISION COMPUTING, 2023, 129
  • [45] Progressive Representation Adaptation for Weakly Supervised Object Localization
    Li, Dong
    Huang, Jia-Bin
    Li, Yali
    Wang, Shengjin
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1424 - 1438
  • [46] Evaluating Weakly Supervised Object Localization Methods Right
    Choe, Junsuk
    Oh, Seong Joon
    Lee, Seungho
    Chun, Sanghyuk
    Akata, Zeynep
    Shim, Hyunjung
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3130 - 3139
  • [47] DANet: Divergent Activation for Weakly Supervised Object Localization
    Xue, Haolan
    Liu, Chang
    Wan, Fang
    Jiao, Jianbin
    Ji, Xiangyang
    Ye, Qixiang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6588 - 6597
  • [48] Shallow Feature Matters for Weakly Supervised Object Localization
    Wei, Jun
    Wang, Qin
    Li, Zhen
    Wang, Sheng
    Zhou, S. Kevin
    Cui, Shuguang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5989 - 5997
  • [49] ViTOL: Vision Transformer for Weakly Supervised Object Localization
    Gupta, Saurav
    Lakhotia, Sourav
    Rawat, Abhay
    Tallamraju, Rahul
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4100 - 4109
  • [50] Weakly Supervised Object Localization Using Size Estimates
    Shi, Miaojing
    Ferrari, Vittorio
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 105 - 121