Domain adaptation with temporal ensembling to local attention region search for object detection

被引:0
|
作者
Shi, Haobin [1 ]
He, Ziming [1 ]
Hwang, Kao-Shing [1 ,2 ,3 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[2] Natl Sun Yat sen Univ, Dept Elect Engn, Kaohsiung 80424, Taiwan
[3] Kaohsiung Med Univ, Dept Healthcare Adm & Med Informat, Kaohsiung 80708, Taiwan
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Reinforcement learning; Object detection; Domain adaptation; Temporal ensembling; Attention mechanism; Medical imaging;
D O I
10.1016/j.knosys.2024.112846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection relies heavily on supervised learning, which requires labeled data for training. However, manual labeling often cannot keep pace with the speed of data collection, and models trained on one dataset may not generalize well to new datasets with different characteristics, leading to domain shift issues. Domain adaptation addresses this problem by leveraging labeled data from a source domain and unlabeled data from a target domain to improve performance on the target domain. Limited by the existing domain adaption architecture, the object detection accuracy in the target domain has much room for improvement. In addition, the global search of feature maps costs too much computation. All these problems make it difficult for domain adaptive object detection to be directly applied to tasks such as medical imaging. To this end, this article proposes two architectures: Region-based Object Detection with Domain Adaptation and Temporal Ensembling (DATE) and Local Attention Region Search Algorithm (LARSA). DATE combines domain adaptation and temporal ensembling to enhance feature alignment between domains. At the same time, LARSA employs an attention mechanism to efficiently search for regions of interest and decide when to terminate the search early. Experiments on various datasets demonstrate the effectiveness of the proposed approaches in improving object detection performance under domain shift and reducing computational cost. The proposed framework has the potential to further promote the application of object detection in the field of medical imaging.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Mixed local channel attention for object detection
    Wan, Dahang
    Lu, Rongsheng
    Shen, Siyuan
    Xu, Ting
    Lang, Xianli
    Ren, Zhijie
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [12] Assessing domain gap for continual domain adaptation in object detection
    Doan, Anh-Dzung
    Nguyen, Bach Long
    Gupta, Surabhi
    Reid, Ian
    Wagner, Markus
    Chin, Tat-Jun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
  • [13] Confidence-Driven Region Mixing for Optical Remote Sensing Domain Adaptation Object Detection
    Liu, Chang
    Dong, Yanni
    Zhang, Yuxiang
    Li, Xue
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [14] Disentangled Discriminator for Unsupervised Domain Adaptation on Object Detection
    Zhu, Yangguang
    Guo, Ping
    Wei, Haoran
    Zhao, Xin
    Wu, Xiangbin
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5685 - 5691
  • [15] Enhanced soft domain adaptation for object detection in the dark
    Bai, Yunfei
    Liu, Chang
    Yang, Rui
    Li, Xiaomao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 106
  • [16] Multi-Source Domain Adaptation for Object Detection
    Yao, Xingxu
    Zhao, Sicheng
    Xu, Pengfei
    Yang, Jufeng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3253 - 3262
  • [17] Unsupervised Domain Adaptation for Object Detection in Cultural Sites
    Pasqualino, Giovanni
    Furnari, Antonino
    Farinella, Giovanni Maria
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 983 - 990
  • [18] Progressive Sparse Local Attention for Video Object Detection
    Guo, Chaoxu
    Fan, Bin
    Gu, Jie
    Zhang, Qian
    Xiang, Shiming
    Prinet, Veronique
    Pan, Chunhong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3908 - 3917
  • [19] Selected and refined local attention module for object detection
    Luo, Xiaofan
    Hu, Haifeng
    ELECTRONICS LETTERS, 2020, 56 (14) : 712 - +
  • [20] Local Attention Sequence Model for Video Object Detection
    Li, Zhenhui
    Zhuang, Xiaoping
    Wang, Haibo
    Nie, Yong
    Tang, Jianzhong
    APPLIED SCIENCES-BASEL, 2021, 11 (10):