One-stage detection for unsupervised domain adaptation with efficient multi-scale attention and confidence-augmented combination

被引:0
|
作者
Xiang, Nan [1 ,2 ,3 ]
Liu, Qianxi [4 ]
Jiang, Yaoyao [4 ]
机构
[1] Chongqing Univ Technol, Liangjiang Int Coll, Chongqing, Peoples R China
[2] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
[3] Chongqing Jialing Special Equipment Co Ltd, Chongqing, Peoples R China
[4] Chongqing Univ Technol, Coll Comp Sci & Engn, Chongqing, Peoples R China
基金
中国博士后科学基金;
关键词
unsupervised domain adaptation; object detection; unsupervised domain adaptation for object detection;
D O I
10.1117/1.JEI.33.6.063025
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unsupervised domain adaptation for object detection leverages a labeled domain to learn an object detector generalizing to a different domain free of annotations. We propose efficient multi-scale attention, confidence mixing, augmentation, and combination (ECAC), an adaptive object detector learning method based on a region-level confidence sample mixing strategy. Compared with the current methods, our approach crops high-confidence detection regions from both the source and target domains, augments them, and combines them to generate composite samples. In addition, consistency loss is utilized to solve the domain adaptation problem. Furthermore, we introduce the efficient multi-scale attention (EMA) into the detector. To retain channel information and reduce computational overhead, EMA attention restructures part of the channels into the batch dimension and groups the channel dimension into multiple sub-features, ensuring spatial semantic features are evenly distributed within each feature group. EMA employs a shared 1 x 1 convolution branch from the CA attention module, along with a parallel 3 x 3 convolution kernel to aggregate multi-scale spatial structure information. This approach effectively enhances the model's focus on region-level features by integrating local and global information with multi-scale parallel sub-networks and cross-spatial learning. For pseudo-label filtering, we progressively transition from a loose to a stricter confidence threshold. Initially, this allows more pseudo-labels, facilitating the detector's learning of target domain representations. As training progresses, stricter thresholds are applied to select more reliable pseudo-labels, gradually filtering out inaccurate pseudo-detections. Our extensive experiments on three datasets demonstrate that ECAC achieves state-of-the-art performance on two of them. On the third dataset, our method improves the mean average precision by over 2% compared with the latest methods. (c) 2024 SPIE and IS&T
引用
收藏
页数:19
相关论文
共 50 条
  • [1] An improved one-stage pedestrian detection method based on multi-scale attention feature extraction
    Ma, Jun
    Wan, Honglin
    Wang, Junxia
    Xia, Hao
    Bai, Chengjie
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (06) : 1965 - 1978
  • [2] An improved one-stage pedestrian detection method based on multi-scale attention feature extraction
    Jun Ma
    Honglin Wan
    Junxia Wang
    Hao Xia
    Chengjie Bai
    Journal of Real-Time Image Processing, 2021, 18 : 1965 - 1978
  • [3] Shadow detection via multi-scale feature fusion and unsupervised domain adaptation
    Zhou, Kai
    Wu, Wen
    Shao, Yan-Li
    Fang, Jing-Long
    Wang, Xing-Qi
    Wei, Dan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [4] Category-related attention domain adaptation for one-stage cross-domain object detection
    Guan, Shengxian
    Dong, Shuai
    Gao, Yuefang
    Zou, Kun
    IET IMAGE PROCESSING, 2024, 18 (02) : 362 - 378
  • [5] DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation
    Wu, Xinyi
    Wu, Zhenyao
    Guo, Hao
    Ju, Lili
    Wang, Song
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15764 - 15773
  • [6] One stage multi-scale efficient network for underwater target detection
    Zhang, Huaqiang
    Dai, Chenggang
    Chen, Chengjun
    Zhao, Zhengxu
    Lin, Mingxing
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2024, 95 (06):
  • [7] MSDAN: Multi-Scale Self-Attention Unsupervised Domain Adaptation Network for Thyroid Ultrasound Images
    Ying, Xiang
    Zhang, Yulin
    Wei, Xi
    Yu, Mei
    Zhu, Jialin
    Gao, Jie
    Liu, Zhiqiang
    Li, Xuewei
    Yu, Ruiguo
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 871 - 876
  • [8] Unsupervised scene adaptation for faster multi-scale pedestrian detection
    Karaman, Svebor
    Lisanti, Giuseppe
    Karaman, Svebor
    Bagdanov, Andrew D.
    Del Bimbo, Alberto
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3534 - 3539
  • [9] A One-Stage Domain Adaptation Network With Image Alignment for Unsupervised Nighttime Semantic Segmentation
    Wu, Xinyi
    Wu, Zhenyao
    Ju, Lili
    Wang, Song
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 58 - 72
  • [10] OSAN: A One-Stage Alignment Network to Unify Multimodal Alignment and Unsupervised Domain Adaptation
    Liu, Ye
    Qiao, Lingfeng
    Lu, Changchong
    Yin, Di
    Lin, Chen
    Peng, Haoyuan
    Ren, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3551 - 3560