Adversarial Transformers for Weakly Supervised Object Localization

被引:5
|
作者
Meng, Meng [1 ]
Zhang, Tianzhu [1 ]
Zhang, Zhe [2 ,3 ]
Zhang, Yongdong [4 ]
Wu, Feng [4 ,5 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[3] Lunar Explorat & Space Engn Ctr CNSA, Beijing, Peoples R China
[4] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
[5] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
关键词
Perturbation methods; Adversarial training; transformers; weakly supervised object localization; SEMANTIC SEGMENTATION;
D O I
10.1109/TIP.2022.3220055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) aims at localizing objects with only image-level labels, which has better scalability and practicability than fully supervised methods. However, without pixel-level supervision, existing methods tend to generate rough localization maps, which hinders localization performance. To alleviate this problem, we propose an adversarial transformer network (ATNet), which aims to obtain a well-learned localization model with pixel-level pseudo labels. The proposed ATNet enjoys several merits. First, we design an object transformer ( $G$ ) that can generate localization maps and pseudo labels effectively and dynamically, and a part transformer ( $D$ ) to accurately discriminate detailed local differences between localization maps and pseudo labels. Second, we propose to train $G$ and $D$ via an adversarial process, where $G$ can generate more accurate localization maps approaching pseudo labels to fool $D$ . To the best of our knowledge, this is the first work to explore transformers with adversarial training to obtain a well-learned localization model for WSOL. Extensive experiments with four backbones on two standard benchmarks demonstrate that our ATNet achieves favorable performance against state-of-the-art WSOL methods. Besides, our adversarial training can provide higher robustness against adversarial attacks.
引用
收藏
页码:7130 / 7143
页数:14
相关论文
共 50 条
  • [41] Weakly Supervised Object Localization with Progressive Domain Adaptation
    Su, Shuochen
    Heide, Felix
    Swanson, Robin
    Klein, Jonathan
    Callenberg, Clara
    Hullin, Matthias
    Heidrich, Wolfgang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : CP40 - CP40
  • [42] Generative Prompt Model for Weakly Supervised Object Localization
    Zhao, Yuzhong
    Ye, Qixiang
    Wu, Weijia
    Shen, Chunhua
    Wan, Fang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6328 - 6338
  • [43] Adaptive Zone Learning for Weakly Supervised Object Localization
    Chen, Zhiwei
    Wang, Siwei
    Cao, Liujuan
    Shen, Yunhang
    Ji, Rongrong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [44] Soft Proposal Networks for Weakly Supervised Object Localization
    Zhu, Yi
    Zhou, Yanzhao
    Ye, Qixiang
    Qiu, Qiang
    Jiao, Jianbin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1859 - 1868
  • [45] Video-based Object Recognition with Weakly Supervised Object Localization
    Liu, Yang
    Kouskouridas, Rigas
    Kim, Tae-Kyun
    Proceedings 3rd IAPR Asian Conference on Pattern Recognition ACPR 2015, 2015, : 46 - 50
  • [46] Dual-Gradients Localization Framework for Weakly Supervised Object Localization
    Tan, Chuangchuang
    Gu, Guanghua
    Ruan, Tao
    Wei, Shikui
    Zhao, Yao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1976 - 1984
  • [47] Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization
    Kim, Eunji
    Kim, Siwon
    Lee, Jungbeom
    Kim, Hyunwoo
    Yoon, Sungroh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14238 - 14247
  • [48] MOST: Multiple Object localization with Self-supervised Transformers for object discovery
    Rambhatla, Sai Saketh
    Misra, Ishan
    Chellappa, Rama
    Shrivastava, Abhinav
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15777 - 15788
  • [49] CONTEXT-AWARE TRANSFORMERS FOR WEAKLY SUPERVISED BAGGAGE THREAT LOCALIZATION
    Velayudhan, Divya
    Ahmed, Abdelfatah
    Hassan, Taimur
    Bennamoun, Mohammed
    Damiani, Ernesto
    Werghi, Naoufel
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3538 - 3542
  • [50] Complementary adversarial mechanisms for weakly-supervised temporal action localization
    Wang, Chuanxu
    Wang, Jing
    Liu, Peng
    PATTERN RECOGNITION, 2023, 139