HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection

被引:17
|
作者
Li, Wuyang [1 ]
Chen, Zhen [1 ]
Li, Baopu [2 ]
Zhang, Dingwen [3 ]
Yuan, Yixuan [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] Baidu Res, Sunnyvale, CA 94089 USA
[3] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Proposals; Object detection; Feature extraction; Cognition; Location awareness; task-decoupled framework;
D O I
10.1109/TIP.2021.3126423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decoupling the sibling head has recently shown great potential in relieving the inherent task-misalignment problem in two-stage object detectors. However, existing works design similar structures for the classification and regression, ignoring task-specific characteristics and feature demands. Besides, the shared knowledge that may benefit the two branches is neglected, leading to potential excessive decoupling and semantic inconsistency. To address these two issues, we propose Heterogeneous task decoupling (HTD) framework for object detection, which utilizes a Progressive Graph (PGraph) module and a Border-aware Adaptation (BA) module for task-decoupling. Specifically, we first devise a Semantic Feature Aggregation (SFA) module to aggregate global semantics with image-level supervision, serving as the shared knowledge for the task-decoupled framework. Then, the PGraph module performs progressive graph reasoning, including local spatial aggregation and global semantic interaction, to enhance semantic representations of region proposals for classification. The proposed BA module integrates multi-level features adaptively, focusing on the low-level border activation to obtain representations with spatial and border perception for regression. Finally, we utilize the aggregated knowledge from SFA to keep the instance-level semantic consistency (ISC) of decoupled frameworks. Extensive experiments demonstrate that HTD outperforms existing detection works by a large margin, and achieves single-model 50.4%AP and 33.2% AP(s) on COCO test-dev set using ResNet-101-DCN backbone, which is the best entry among state-of-the-arts under the same configuration. Our code is available at https://github.com/CityU-AIM-Group/HTD.
引用
收藏
页码:9456 / 9469
页数:14
相关论文
共 50 条
  • [31] A Two-Stage Bayesian Integration Framework for Salient Object Detection on Light Field
    Anzhi Wang
    Minghui Wang
    Xiaoyan Li
    Zetian Mi
    Huan Zhou
    Neural Processing Letters, 2017, 46 : 1083 - 1094
  • [32] Fast Two-Stage 3D Object Detection with Semantic Guidance
    Huang Mang
    Hui Bin
    Liu Zhaoji
    Jin Tianming
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (12)
  • [33] A Progressive Approach to Generic Object Detection: A Two-Stage Framework for Image Recognition
    Aamir, Muhammad
    Rahman, Ziaur
    Abro, Waheed Ahmed
    Bhatti, Uzair Aslam
    Dayo, Zaheer Ahmed
    Ishfaq, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 6351 - 6373
  • [34] TSF: Two-Stage Sequential Fusion for 3D Object Detection
    Qi, Heng
    Shi, Peicheng
    Liu, Zhiqiang
    Yang, Aixi
    IEEE SENSORS JOURNAL, 2022, 22 (12) : 12163 - 12172
  • [35] PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection
    Li, Wentao
    Zhao, Danpei
    Yuan, Bo
    Gao, Yue
    Shi, Zhenwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [36] A New Two-Stage Object Detection Network without RoI-Pooling
    Yang, Chao
    Chen, Weihai
    Chen, Peter C. Y.
    Amezquita, Kendrick S.
    Wu, Xingming
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 1680 - 1685
  • [37] TSFF: a two-stage fusion framework for 3D object detection
    Jiang, Guoqing
    Li, Saiya
    Huang, Ziyu
    Cai, Guorong
    Su, Jinhe
    PeerJ Computer Science, 2024, 10
  • [38] Rethinking Label Assignment and Sampling Strategies for Two-Stage Oriented Object Detection
    Qian, Jinjin
    Liu, Chengxu
    Ai, Yubin
    Zhao, Guoshuai
    Qian, Xueming
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [39] HETEROGENEOUS IMAGE CHANGE DETECTION BASED ON TWO-STAGE JOINT FEATURE LEARNING
    Han, Te
    Tang, Yuqi
    Chen, Yuzeng
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3215 - 3218
  • [40] Two-Stage Saliency Fusion for Object Segmentation
    Sun, Guangling
    Ren, Jingru
    Liu, Zhi
    Shan, Wei
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 139 - 148