A dual-balanced network for long-tail distribution object detection

被引:0
|
作者
Gong, Huiyun [1 ]
Li, Yeguang [2 ]
Dong, Jian [1 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Management Changchun Univ Technol, Sch Econ, Jilin, Peoples R China
[3] China Elect Standardizat Inst, Beijing, Peoples R China
关键词
computer vision; learning (artificial intelligence); object detection; SMOTE;
D O I
10.1049/cvi2.12182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection on datasets with imbalanced distributions (i.e. long-tail distributions) dataset is a significantly challenging task. Some re-balancing solutions, such as re-weighting and re-sampling have two main disadvantages. First, re-balancing strategies only utilise a coarse-grained global threshold to suppress some of the most influential categories, while overlooking locally influential categories. Second, very few studies have specifically designed algorithms for object detection tasks under long-tail distribution. To address these two issues, a dual-balanced network for fine-grained re-balancing object detection is proposed. Our re-balancing strategies are both in proposal and classification logic, corresponding to two sub-networks, the Balance Region Proposal Network (BRPN) and the Balance Classification Network (BCN). The BRPN sub-network equalises the number of proposals in the background and foreground by reducing the sampling probability of simple backgrounds, and the BCN sub-network equalises the logic between head and tail categories by globally suppressing negative gradients and locally fixing the over-suppressed negative gradients. In addition, the authors advise a balance binary cross entropy loss to jointly re-balance the entire network. This design can be generalised to different two-stage object detection frameworks. The experimental mAP result of 26.40% on this LVIS-v0.5 dataset outperforms most SOTA methods.
引用
收藏
页码:565 / 575
页数:11
相关论文
共 50 条
  • [31] Long-tail image captioning with dynamic semantic memory network
    Liu, Hao
    Yang, Xiaoshan
    Xu, Changsheng
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (08): : 1399 - 1408
  • [32] FLRF: Federated recommendation optimization for long-tail data distribution
    Gong, Zaigang
    Chen, Siyu
    Dai, Qiangsheng
    Feng, Ying
    Zhang, Jinghui
    ARRAY, 2024, 24
  • [33] Fiber-Optic Four-Channel Dual-Balanced Heterodyne Phase Detection
    Shi Jianbo
    Zhang Juan
    Liu Dean
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (09):
  • [34] Adaptive Embedding and Distribution Re-margin for Long-Tail Recognition
    Su, Yulin
    Chen, Boan
    Feng, Ziming
    Yan, Junchi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 38 - 50
  • [35] One-Shot Learning for Long-Tail Visual Relation Detection
    Wang, Weitao
    Wang, Meng
    Wang, Sen
    Long, Guodong
    Yao, Lina
    Qi, Guilin
    Chen, Yang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12225 - 12232
  • [36] A Dual Heterogeneous Graph Attention Network to Improve Long-Tail Performance for Shop Search in E-Commerce
    Niu, Xichuan
    Li, Bofang
    Li, Chenliang
    Xiao, Rong
    Sun, Haochuan
    Deng, Hongbo
    Chen, Zhenzhong
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3405 - 3415
  • [37] Empowering Long-tail Item Recommendation through Cross Decoupling Network (CDN)
    Zhang, Yin
    Wang, Ruoxi
    Cheng, Derek Zhiyuan
    Yao, Tiansheng
    Yi, Xinyang
    Hong, Lichan
    Caverlee, James
    Chi, Ed H.
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5608 - 5617
  • [38] Fitting mixtures of exponentials to long-tail distributions to analyze network performance models
    Feldmann, A
    Whitt, W
    PERFORMANCE EVALUATION, 1998, 31 (3-4) : 245 - 279
  • [39] Fitting mixtures of exponentials to long-tail distributions to analyze network performance models
    Feldmann, A
    Whitt, W
    IEEE INFOCOM '97 - THE CONFERENCE ON COMPUTER COMMUNICATIONS, PROCEEDINGS, VOLS 1-3: SIXTEENTH ANNUAL JOINT CONFERENCE OF THE IEEE COMPUTER AND COMMUNICATIONS SOCIETIES - DRIVING THE INFORMATION REVOLUTION, 1997, : 1096 - 1104
  • [40] Long-tail Hashtag Recommendation for Micro-videos with Graph Convolutional Network
    Li, Mengmeng
    Gan, Tian
    Liu, Meng
    Cheng, Zhiyong
    Yin, Jianhua
    Nie, Liqiang
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 509 - 518