HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection

被引:17
|
作者
Li, Wuyang [1 ]
Chen, Zhen [1 ]
Li, Baopu [2 ]
Zhang, Dingwen [3 ]
Yuan, Yixuan [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] Baidu Res, Sunnyvale, CA 94089 USA
[3] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Proposals; Object detection; Feature extraction; Cognition; Location awareness; task-decoupled framework;
D O I
10.1109/TIP.2021.3126423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decoupling the sibling head has recently shown great potential in relieving the inherent task-misalignment problem in two-stage object detectors. However, existing works design similar structures for the classification and regression, ignoring task-specific characteristics and feature demands. Besides, the shared knowledge that may benefit the two branches is neglected, leading to potential excessive decoupling and semantic inconsistency. To address these two issues, we propose Heterogeneous task decoupling (HTD) framework for object detection, which utilizes a Progressive Graph (PGraph) module and a Border-aware Adaptation (BA) module for task-decoupling. Specifically, we first devise a Semantic Feature Aggregation (SFA) module to aggregate global semantics with image-level supervision, serving as the shared knowledge for the task-decoupled framework. Then, the PGraph module performs progressive graph reasoning, including local spatial aggregation and global semantic interaction, to enhance semantic representations of region proposals for classification. The proposed BA module integrates multi-level features adaptively, focusing on the low-level border activation to obtain representations with spatial and border perception for regression. Finally, we utilize the aggregated knowledge from SFA to keep the instance-level semantic consistency (ISC) of decoupled frameworks. Extensive experiments demonstrate that HTD outperforms existing detection works by a large margin, and achieves single-model 50.4%AP and 33.2% AP(s) on COCO test-dev set using ResNet-101-DCN backbone, which is the best entry among state-of-the-arts under the same configuration. Our code is available at https://github.com/CityU-AIM-Group/HTD.
引用
收藏
页码:9456 / 9469
页数:14
相关论文
共 50 条
  • [41] Task switching as a two-stage decision process
    Sinha, N
    Brown, JTG
    Carpenter, RHS
    JOURNAL OF NEUROPHYSIOLOGY, 2006, 95 (05) : 3146 - 3153
  • [42] Analysis of two-stage heterogeneous separation processes
    Sridhar, LN
    AICHE JOURNAL, 1996, 42 (10) : 2761 - 2764
  • [43] Design and decoupling optimization of two-stage vibration isolation system
    Sun, Yuhua
    Dong, Dawei
    Yan, Bing
    Pan, Yanliang
    Wu, Junda
    Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2014, 34 (02): : 361 - 365
  • [44] TSFNet: Two-Stage Fusion Network for RGB-T Salient Object Detection
    Guo, Qinling
    Zhou, Wujie
    Lei, Jingsheng
    Yu, Lu
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1655 - 1659
  • [45] Lightweight two-stage transformer for low-light image enhancement and object detection
    Kou, Kangkang
    Yin, Xiangchen
    Gao, Xin
    Nie, Fuhui
    Liu, Jing
    Zhang, Guoying
    DIGITAL SIGNAL PROCESSING, 2024, 150
  • [46] Hierarchical Two-stage modal fusion for Triple-modality salient object detection
    Wen, Hongwei
    Song, Kechen
    Huang, Liming
    Wang, Han
    Wang, Junyi
    Yan, Yunhui
    MEASUREMENT, 2023, 218
  • [47] TS-BiT: Two-Stage Binary Transformer for ORSI Salient Object Detection
    Zhang, Jinfeng
    Liu, Tianpeng
    Zhang, Jiehua
    Liu, Li
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [48] Two-stage 3D object detection guided by position encoding q
    Xu, Wanpeng
    Zou, Ling
    Fu, Zhipeng
    Wu, Lingda
    Qi, Yue
    NEUROCOMPUTING, 2022, 501 : 811 - 821
  • [49] Two-stage local attention network for salient object detection in remote sensing images
    Lin, Qihui
    Xia, Lurui
    Li, Sen
    Chen, Wanfeng
    IET IMAGE PROCESSING, 2023, 17 (03) : 849 - 861
  • [50] Two-stage multiple object detection using CNN and correlative filter for accuracy improvement
    Kinasih, Fabiola Maria Teresa Retno
    Machbub, Carmadi
    Yulianti, Lenni
    Rohman, Arief Syaichu
    HELIYON, 2023, 9 (01)