HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection

被引:17
|
作者
Li, Wuyang [1 ]
Chen, Zhen [1 ]
Li, Baopu [2 ]
Zhang, Dingwen [3 ]
Yuan, Yixuan [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] Baidu Res, Sunnyvale, CA 94089 USA
[3] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Proposals; Object detection; Feature extraction; Cognition; Location awareness; task-decoupled framework;
D O I
10.1109/TIP.2021.3126423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decoupling the sibling head has recently shown great potential in relieving the inherent task-misalignment problem in two-stage object detectors. However, existing works design similar structures for the classification and regression, ignoring task-specific characteristics and feature demands. Besides, the shared knowledge that may benefit the two branches is neglected, leading to potential excessive decoupling and semantic inconsistency. To address these two issues, we propose Heterogeneous task decoupling (HTD) framework for object detection, which utilizes a Progressive Graph (PGraph) module and a Border-aware Adaptation (BA) module for task-decoupling. Specifically, we first devise a Semantic Feature Aggregation (SFA) module to aggregate global semantics with image-level supervision, serving as the shared knowledge for the task-decoupled framework. Then, the PGraph module performs progressive graph reasoning, including local spatial aggregation and global semantic interaction, to enhance semantic representations of region proposals for classification. The proposed BA module integrates multi-level features adaptively, focusing on the low-level border activation to obtain representations with spatial and border perception for regression. Finally, we utilize the aggregated knowledge from SFA to keep the instance-level semantic consistency (ISC) of decoupled frameworks. Extensive experiments demonstrate that HTD outperforms existing detection works by a large margin, and achieves single-model 50.4%AP and 33.2% AP(s) on COCO test-dev set using ResNet-101-DCN backbone, which is the best entry among state-of-the-arts under the same configuration. Our code is available at https://github.com/CityU-AIM-Group/HTD.
引用
收藏
页码:9456 / 9469
页数:14
相关论文
共 50 条
  • [21] An Approximation Scheme for Heterogeneous Parallel Task Scheduling in a Two-Stage Hybrid Flow Shop
    Sun, Jinghao
    Meng, Yakun
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (04) : 1291 - 1308
  • [22] Two-stage controller design with integral action and decoupling
    Gundes, AN
    Kabuli, MG
    PROCEEDINGS OF THE 35TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1996, : 4637 - 4642
  • [23] Federated two-stage decoupling with adaptive personalization layers
    Hangyu Zhu
    Yuxiang Fan
    Zhenping Xie
    Complex & Intelligent Systems, 2024, 10 : 3657 - 3671
  • [24] Federated two-stage decoupling with adaptive personalization layers
    Zhu, Hangyu
    Fan, Yuxiang
    Xie, Zhenping
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3657 - 3671
  • [25] Ghostformer: A GhostNet-Based Two-Stage Transformer for Small Object Detection
    Li, Sijia
    Sultonov, Furkat
    Tursunboev, Jamshid
    Park, Jun-Hyun
    Yun, Sangseok
    Kang, Jae-Mo
    SENSORS, 2022, 22 (18)
  • [26] Two-Stage Object Detection Based on Deep Pruning for Remote Sensing Image
    Wang, Shengsheng
    Wang, Meng
    Zhao, Xin
    Liu, Dong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 137 - 147
  • [27] A Two-Stage Bayesian Integration Framework for Salient Object Detection on Light Field
    Wang, Anzhi
    Wang, Minghui
    Li, Xiaoyan
    Mi, Zetian
    Zhou, Huan
    NEURAL PROCESSING LETTERS, 2017, 46 (03) : 1083 - 1094
  • [28] A Two-Stage Foreground Propagation for Moving Object Detection in a Non-Stationary
    Chung, WonTaek
    Kim, YongHyun
    Kim, Yong-Joong
    Kim, DaiJin
    2016 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2016, : 187 - 193
  • [29] A Two-Stage Model Compression Framework for Object Detection in Autonomous Driving Scenarios
    He, Qiyi
    Xu, Ao
    Ye, Zhiwei
    Zhou, Wen
    Zhang, Yifan
    Xi, Ruijie
    IEEE SENSORS JOURNAL, 2025, 25 (02) : 3735 - 3749
  • [30] TSFF: a two-stage fusion framework for 3D object detection
    Jiang, Guoqing
    Li, Saiya
    Huang, Ziyu
    Cai, Guorong
    Su, Jinhe
    PEERJ COMPUTER SCIENCE, 2024, 10