Semi-supervised Open-World Object Detection

被引:1
|
作者
Mullappilly, Sahal Shaji [1 ]
Gehlot, Abhishek Singh [1 ]
Anwer, Rao Muhammad [1 ]
Khan, Fahad Shahbaz [1 ,2 ]
Cholakkal, Hisham [1 ]
机构
[1] Mohamed bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[2] Linkoping Univ, Linkoping, Sweden
基金
瑞典研究理事会;
关键词
D O I
10.1609/aaai.v38i5.28227
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional open-world object detection (OWOD) problem setting first distinguishes known and unknown classes and then later incrementally learns the unknown objects when introduced with labels in the subsequent tasks. However, the current OWOD formulation heavily relies on the external human oracle for knowledge input during the incremental learning stages. Such reliance on run-time makes this formulation less realistic in a real-world deployment. To address this, we introduce a more realistic formulation, named semi-supervised open-world detection (SS-OWOD), that reduces the annotation cost by casting the incremental learning stages of OWOD in a semi-supervised manner. We demonstrate that the performance of the state-of-the-art OWOD detector dramatically deteriorates in the proposed SS-OWOD setting. Therefore, we introduce a novel SS-OWOD detector, named SS-OWFormer, that utilizes a feature-alignment scheme to better align the object query representations between the original and augmented images to leverage the large unlabeled and few labeled data. We further introduce a pseudo-labeling scheme for unknown detection that exploits the inherent capability of decoder object queries to capture object-specific information. On the COCO dataset, our SS-OWFormer using only 50% of the labeled data achieves detection performance that is on par with the state-of-the-art (SOTA) OWOD detector using all the 100% of labeled data. Further, our SS-OWFormer achieves an absolute gain of 4.8% in unknown recall over the SOTA OWOD detector. Lastly, we demonstrate the effectiveness of our SS-OWOD problem setting and approach for remote sensing object detection, proposing carefully curated splits and baseline performance evaluations. Our experiments on 4 datasets including MS COCO, PASCAL, Objects365 and DOTA demonstrate the effectiveness of our approach. Our source code, models and splits are available here https://github.com/sahalshajim/SS-OWFormer.
引用
收藏
页码:4305 / 4314
页数:10
相关论文
共 50 条
  • [1] Open-world semi-supervised relation extraction
    Zhou, Diange
    Duan, Yilin
    Li, Shengwen
    Yao, Hong
    NEURAL NETWORKS, 2025, 186
  • [2] Open-world Semi-supervised Novel Class Discovery
    Liu, Jiaming
    Wang, Yangqiming
    Zhang, Tongze
    Fan, Yulu
    Yang, Qinli
    Shao, Junming
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4002 - 4010
  • [3] LogOW: A semi-supervised log anomaly detection model in open-world setting
    Ye, Jingwei
    Liu, Chunbo
    Gu, Zhaojun
    Zhang, Zhikai
    Meng, Xuying
    Zhang, Weiyao
    Zhang, Yujun
    JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 222
  • [4] Confidence-Guided Open-World Semi-supervised Learning
    Li, Jibang
    Yang, Meng
    Feng, Mao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 87 - 99
  • [5] Promote knowledge mining towards open-world semi-supervised learning
    Zhao, Tianhao
    Lin, Yutian
    Wu, Yu
    Du, Bo
    PATTERN RECOGNITION, 2024, 149
  • [6] Open-Set Semi-Supervised Object Detection
    Liu, Yen-Cheng
    Ma, Chih-Yao
    Dai, Xiaoliang
    Tian, Junjiao
    Vajda, Peter
    He, Zijian
    Kira, Zsolt
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 143 - 159
  • [7] Open-World Semi-Supervised Learning for fMRI Analysis to Diagnose Psychiatric Disease
    Hu, Chang
    Dong, Yihong
    Peng, Shoubo
    Wu, Yuehan
    Information (Switzerland), 2025, 16 (03)
  • [8] CowSSL: contrastive open-world semi-supervised learning for wafer bin map
    Baek, Insung
    Hwang, Sung Jin
    Kim, Seoung Bum
    JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (03) : 2163 - 2175
  • [9] Bridging the Gap: Learning Pace Synchronization for Open-World Semi-Supervised Learning
    Ye, Bo
    Gan, Kai
    Wei, Tong
    Zhang, Min-Ling
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 5362 - 5370
  • [10] OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
    Rizve, Mamshad Nayeem
    Kardan, Navid
    Khan, Salman
    Khan, Fahad Shahbaz
    Shah, Mubarak
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 382 - 401