Dynamic multi-scale loss optimization for object detection

被引:0
|
作者
Yihao Luo
Xiang Cao
Juntao Zhang
Peng Cheng
Tianjiang Wang
Qi Feng
机构
[1] Huazhong University of Science and Technology,School of Computer Science and Technology
[2] Coolanyp Limited Liability Company,undefined
来源
关键词
Object detection; Multi-scale imbalance; Reinforcement learning; Multi-task;
D O I
暂无
中图分类号
学科分类号
摘要
With the continuous improvement of deep object detectors via advanced model architectures, imbalance problems in the training process have received more attention. It is a common paradigm in object detection frameworks to perform multi-scale detection. However, each scale is treated equally during training. In this paper, we carefully study the objective imbalance of multi-scale detector training. We argue that the loss in each scale level is neither equally important nor independent. Different from the existing solutions of setting multi-task weights, we dynamically optimize the loss weight of each scale level in the training process. Specifically, we propose an Adaptive Variance Weighting (AVW) to balance multi-scale loss according to the statistical variance. Then we develop a novel Reinforcement Learning Optimization (RLO) to decide the weighting scheme probabilistically during training. It makes better utilization of multi-scale training loss without extra computational complexity and learnable parameters for backpropagation. Without bells and whistles, the proposed method improves ATSS by 0.9 AP on the MS COCO benchmark. And it achieves 82.1 mAP on Pascal VOC 2007 test set, which outperforms other reinforcement-learning-based methods.
引用
收藏
页码:2349 / 2367
页数:18
相关论文
共 50 条
  • [21] Multi-scale Context Enhancement Network for Object Detection
    Wang, Yanan
    Ma, Yingdong
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 6 - 11
  • [22] Multi-scale semantic enhancement network for object detection
    Guo, Dongen
    Wu, Zechen
    Feng, Jiangfan
    Zou, Tao
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [23] StairsNet: Mixed Multi-scale Network for Object Detection
    Gao, Weiyi
    Cao, Wenlong
    Zhai, Jian
    Rui, Jianwu
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 303 - 314
  • [24] Multi-scale Interactive Network for Salient Object Detection
    Pang, Youwei
    Zhao, Xiaoqi
    Zhang, Lihe
    Lu, Huchuan
    arXiv, 2020,
  • [25] AUTONOMOUS MULTI-SCALE OBJECT DETECTION WITH HOUGH FORESTS
    Scalzo, Maria
    Velipasalar, Senem
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1643 - 1647
  • [26] Multi-Scale Aggregation Transformers for Multispectral Object Detection
    You, Shuai
    Xie, Xuedong
    Feng, Yujian
    Mei, Chaojun
    Ji, Yimu
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1172 - 1176
  • [27] Lightweight multi-scale network for small object detection
    Li, Li
    Li, Bingxue
    Zhou, Hongjuan
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [28] Multi-scale Pyramid Feature Maps for Object Detection
    Hao Huijun
    Ye Ronghua
    Chen Zhongyu
    Zheng Zhonglong
    2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, : 237 - 240
  • [29] Multi-scale volumes for deep object detection and localization
    Ohn-Bar, Eshed
    Trivedi, Mohan Manubhai
    PATTERN RECOGNITION, 2017, 61 : 557 - 572
  • [30] Multi-scale HOG Feature Used in Object Detection
    Li, Jin
    Zhang, Hong
    Zhang, Lei
    Li, Yawei
    Kang, Qiaochu
    Luo, Zhaohui
    Wu, Yujie
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069