Hierarchical complementary learning for weakly supervised object localization

被引：0

作者：

Benassou, Sabrina Narimene ^{[1
]}

Shi, Wuzhen ^{[2
]}

Jiang, Feng ^{[1
]}

Benzine, Abdallah ^{[3
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, 92 Xidazhi St, Harbin, Peoples R China

[2] Shenzhen Univ, Coll Elect & Informat Engn, 3688 Nanhai Ave, Shenzhen, Peoples R China

[3] Digeiz, AI Lab, 47 Rue Marcel Dassault, F-92100 Boulogne, France

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2022年 / 100卷

基金：

美国国家科学基金会;

关键词：

Weakly supervised object localization; Class activation map; Complementary map; Fusion strategy;

D O I：

10.1016/j.image.2021.116520

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Weakly supervised object localization (WSOL) is a challenging problem that aims to localize objects without ground-truth bounding boxes. A common approach is to train the model that generates a class activation map (CAM) to localize the discriminative features of the object. Unfortunately, the limitation of this method is that they detect just a part of the object and not the whole object. To solve this problem, previous works have removed some parts of the image (Zhang et al., 2018; Zhang et al., 2018; Singh and Lee, 2017; Choe and Shim, 2019) to force the model to detect the full object extent. However, these methods require one or many hyper-parameters to erase the appropriate pixels on the image, which could involve a loss of information. In this paper, we propose a Hierarchical Complementary Learning Network method (HCLNet) that helps the CNN to perform better on classification and localization. HCLNet uses a complementary CAM to generate multiple maps that detect different parts of the object. Unlike previous works, this method does not need any extra hyper-parameters, as well as does not introduce a big loss of information. In order to fuse these different maps, two different fusion strategies known as the addition strategy and the I-1-norm strategy have been used. These strategies allow to detect the whole object while excluding the background. Extensive experiments show that HCLNet obtains better performance than state-of-the-art methods.

引用

页数：7

共 50 条

[41] Foreground Activation Maps for Weakly Supervised Object Localization
Meng, Meng
Zhang, Tianzhu
Tian, Qi
Zhang, Yongdong
Wu, Feng
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3365 - 3375
[42] Token Masking Transformer for Weakly Supervised Object Localization
Xu, Wenhao
Wang, Changwei
Xu, Rongtao
Xu, Shibiao
Meng, Weiliang
Zhang, Man
Zhang, Xiaopeng
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2059 - 2069
[43] Rethinking erasing strategy on weakly supervised object localization
Fan, Yuming
Wei, Shikui
Tan, Chuangchuang
Chen, Xiaotong
Yang, Dongming
Zhao, Yao
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 135
[44] Aggregation of attention and erasing for weakly supervised object localization
Koo, Bongyeong
Choi, Han-Soo
Kang, Myungjoo
IMAGE AND VISION COMPUTING, 2023, 129
[45] Progressive Representation Adaptation for Weakly Supervised Object Localization
Li, Dong
Huang, Jia-Bin
Li, Yali
Wang, Shengjin
Yang, Ming-Hsuan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1424 - 1438
[46] Evaluating Weakly Supervised Object Localization Methods Right
Choe, Junsuk
Oh, Seong Joon
Lee, Seungho
Chun, Sanghyuk
Akata, Zeynep
Shim, Hyunjung
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3130 - 3139
[47] DANet: Divergent Activation for Weakly Supervised Object Localization
Xue, Haolan
Liu, Chang
Wan, Fang
Jiao, Jianbin
Ji, Xiangyang
Ye, Qixiang
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6588 - 6597
[48] Shallow Feature Matters for Weakly Supervised Object Localization
Wei, Jun
Wang, Qin
Li, Zhen
Wang, Sheng
Zhou, S. Kevin
Cui, Shuguang
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5989 - 5997
[49] ViTOL: Vision Transformer for Weakly Supervised Object Localization
Gupta, Saurav
Lakhotia, Sourav
Rawat, Abhay
Tallamraju, Rahul
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4100 - 4109
[50] Weakly Supervised Object Localization Using Size Estimates
Shi, Miaojing
Ferrari, Vittorio
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 105 - 121

← 1 2 3 4 5 →