HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection

被引:17
|
作者
Li, Wuyang [1 ]
Chen, Zhen [1 ]
Li, Baopu [2 ]
Zhang, Dingwen [3 ]
Yuan, Yixuan [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] Baidu Res, Sunnyvale, CA 94089 USA
[3] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Proposals; Object detection; Feature extraction; Cognition; Location awareness; task-decoupled framework;
D O I
10.1109/TIP.2021.3126423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decoupling the sibling head has recently shown great potential in relieving the inherent task-misalignment problem in two-stage object detectors. However, existing works design similar structures for the classification and regression, ignoring task-specific characteristics and feature demands. Besides, the shared knowledge that may benefit the two branches is neglected, leading to potential excessive decoupling and semantic inconsistency. To address these two issues, we propose Heterogeneous task decoupling (HTD) framework for object detection, which utilizes a Progressive Graph (PGraph) module and a Border-aware Adaptation (BA) module for task-decoupling. Specifically, we first devise a Semantic Feature Aggregation (SFA) module to aggregate global semantics with image-level supervision, serving as the shared knowledge for the task-decoupled framework. Then, the PGraph module performs progressive graph reasoning, including local spatial aggregation and global semantic interaction, to enhance semantic representations of region proposals for classification. The proposed BA module integrates multi-level features adaptively, focusing on the low-level border activation to obtain representations with spatial and border perception for regression. Finally, we utilize the aggregated knowledge from SFA to keep the instance-level semantic consistency (ISC) of decoupled frameworks. Extensive experiments demonstrate that HTD outperforms existing detection works by a large margin, and achieves single-model 50.4%AP and 33.2% AP(s) on COCO test-dev set using ResNet-101-DCN backbone, which is the best entry among state-of-the-arts under the same configuration. Our code is available at https://github.com/CityU-AIM-Group/HTD.
引用
收藏
页码:9456 / 9469
页数:14
相关论文
共 50 条
  • [1] Decoupling and Interaction: task coordination in single-stage object detection
    Ma J.-W.
    Tian S.
    Man H.
    Chen S.-L.
    Qin J.
    Yin X.-C.
    Multimedia Tools and Applications, 2025, 84 (10) : 8149 - 8178
  • [2] Two-stage Co-salient Object Detection
    Wang, Zuyi
    Zhang, Lihe
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 287 - 290
  • [3] Two-Stage Unattended Object Detection Method with Proposals
    Nam Trung Pham
    Leman, Karianto
    Zhang, Jie
    Pek, Isaac
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 1 - 4
  • [4] Two-Stage Approach to Small-Object Detection
    Yu, Mingrui
    Leung, Henry
    SYSTEMS ENGINEERING, 2025,
  • [5] Salient Object Detection via Two-Stage Graphs
    Liu, Yi
    Han, Jungong
    Zhang, Qiang
    Wang, Long
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1023 - 1037
  • [6] SaliencyRank: Two-stage manifold ranking for salient object detection
    Qi W.
    Cheng M.-M.
    Borji A.
    Lu H.
    Bai L.-F.
    Computational Visual Media, 2015, 1 (4) : 309 - 320
  • [7] TWO-STAGE ABSORBING MARKOV CHAIN FOR SALIENT OBJECT DETECTION
    Zhang, Qing
    Luo, Desi
    Li, Wenju
    Shi, Yanjiao
    Lin, Jiajun
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 895 - 899
  • [8] FPCB Surface Defect Detection: A Decoupled Two-Stage Object Detection Framework
    Luo, Jiaxiang
    Yang, Zhiyu
    Li, Shipeng
    Wu, Yilin
    IEEE Transactions on Instrumentation and Measurement, 2021, 70
  • [9] FPCB Surface Defect Detection: A Decoupled Two-Stage Object Detection Framework
    Luo, Jiaxiang
    Yang, Zhiyu
    Li, Shipeng
    Wu, Yilin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [10] OccTr: A Two-Stage BEV Fusion Network for Temporal Object Detection
    Fu, Qifang
    Yu, Xinyi
    Ou, Linlin
    ELECTRONICS, 2024, 13 (13)