A dual-balanced network for long-tail distribution object detection

被引:0
|
作者
Gong, Huiyun [1 ]
Li, Yeguang [2 ]
Dong, Jian [1 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Management Changchun Univ Technol, Sch Econ, Jilin, Peoples R China
[3] China Elect Standardizat Inst, Beijing, Peoples R China
关键词
computer vision; learning (artificial intelligence); object detection; SMOTE;
D O I
10.1049/cvi2.12182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection on datasets with imbalanced distributions (i.e. long-tail distributions) dataset is a significantly challenging task. Some re-balancing solutions, such as re-weighting and re-sampling have two main disadvantages. First, re-balancing strategies only utilise a coarse-grained global threshold to suppress some of the most influential categories, while overlooking locally influential categories. Second, very few studies have specifically designed algorithms for object detection tasks under long-tail distribution. To address these two issues, a dual-balanced network for fine-grained re-balancing object detection is proposed. Our re-balancing strategies are both in proposal and classification logic, corresponding to two sub-networks, the Balance Region Proposal Network (BRPN) and the Balance Classification Network (BCN). The BRPN sub-network equalises the number of proposals in the background and foreground by reducing the sampling probability of simple backgrounds, and the BCN sub-network equalises the logic between head and tail categories by globally suppressing negative gradients and locally fixing the over-suppressed negative gradients. In addition, the authors advise a balance binary cross entropy loss to jointly re-balance the entire network. This design can be generalised to different two-stage object detection frameworks. The experimental mAP result of 26.40% on this LVIS-v0.5 dataset outperforms most SOTA methods.
引用
收藏
页码:565 / 575
页数:11
相关论文
共 50 条
  • [21] Long-tail Detection with Effective Class-Margins
    Hyun Cho, Jang
    Krähenbühl, Philipp
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13668 LNCS : 698 - 714
  • [22] A unified and costless approach for improving small and long-tail object detection in aerial images of traffic scenarios
    Zhongxia Xiong
    Tao Song
    Shan He
    Ziying Yao
    Xinkai Wu
    Applied Intelligence, 2023, 53 : 14426 - 14447
  • [23] Complementary expert balanced learning for long-tail cross-modal retrieval
    Liu, Peifang
    Liu, Xueliang
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [24] Complementary expert balanced learning for long-tail cross-modal retrieval
    Peifang Liu
    Xueliang Liu
    Multimedia Systems, 2024, 30
  • [25] C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection
    Wang, Tong
    Zhu, Yousong
    Chen, Yingying
    Zhao, Chaoyang
    Yu, Bin
    Wang, Jinqiao
    Tang, Ming
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6970 - 6979
  • [26] A unified and costless approach for improving small and long-tail object detection in aerial images of traffic scenarios
    Xiong, Zhongxia
    Song, Tao
    He, Shan
    Yao, Ziying
    Wu, Xinkai
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14426 - 14447
  • [27] ON THE LONG-TAIL SOLAR-WIND ELECTRON VELOCITY DISTRIBUTION
    SHLESINGER, MF
    COPLAN, MA
    JOURNAL OF STATISTICAL PHYSICS, 1988, 52 (5-6) : 1423 - 1428
  • [28] Distribution Alignment: A Unified Framework for Long-tail Visual Recognition
    Zhang, Songyang
    Li, Zeming
    Yan, Shipeng
    He, Xuming
    Sun, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2361 - 2370
  • [29] A pest image recognition method for long-tail distribution problem
    Chen, Shengbo
    Gao, Quan
    He, Yun
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2024, 12
  • [30] ADDRESSING DATA ACCESS NEEDS OF THE LONG-TAIL DISTRIBUTION OF GEOSCIENTISTS
    Malik, Tanu
    Foster, Ian
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 5348 - 5351