Scale-aware token-matching for transformer-based object detector

被引:1
|
作者
Jung, Aecheon [1 ]
Hong, Sungeun [1 ]
Hyun, Yoonsuk [2 ]
机构
[1] Sungkyunkwan Univ, Dept Immers Media Engn, Seoul, South Korea
[2] Inha Univ, Dept Math, Incheon, South Korea
基金
新加坡国家研究基金会;
关键词
Vision transformer; Object detection; Small object detection;
D O I
10.1016/j.patrec.2024.08.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Owing to the advancements in deep learning, object detection has made significant progress in estimating the positions and classes of multiple objects within an image. However, detecting objects of various scales within a single image remains a challenging problem. In this study, we suggest a scale-aware token matching to predict the positions and classes of objects for transformer-based object detection. We train a model by matching detection tokens with ground truth considering its size, unlike the previous methods that performed matching without considering the scale during the training process. We divide one detection token set into multiple sets based on scale and match each token set differently with ground truth, thereby, training the model without additional computation costs. The experimental results demonstrate that scale information can be assigned to tokens. Scale-aware tokens can independently learn scale-specific information by using a novel loss function, which improves the detection performance on small objects.
引用
收藏
页码:197 / 202
页数:6
相关论文
共 50 条
  • [31] Global to Local: A Scale-Aware Network for Remote Sensing Object Detection
    Gao, Tao
    Niu, Qianqian
    Zhang, Jing
    Chen, Ting
    Mei, Shaohui
    Jubair, Ahmad
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [32] SARD: Towards Scale-Aware Rotated Object Detection in Aerial Imagery
    Wang, Yashan
    Zhang, Yue
    Zhang, Yi
    Zhao, Liangjin
    Sun, Xuewen
    Guo, Zhi
    IEEE ACCESS, 2019, 7 : 173855 - 173865
  • [33] A Transformer-Based Framework for Tiny Object Detection
    Liao, Yi-Kai
    Lin, Gong-Si
    Yeh, Mei-Chen
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 373 - 377
  • [34] A Transformer-Based Network for Hyperspectral Object Tracking
    Gao, Long
    Chen, Langkun
    Liu, Pan
    Jiang, Yan
    Xie, Weiying
    Li, Yunsong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [35] Survey of Transformer-Based Object Detection Algorithms
    Li, Jian
    Du, Jianqiang
    Zhu, Yanchen
    Guo, Yongkun
    Computer Engineering and Applications, 2023, 59 (10) : 48 - 64
  • [36] A Cross-Level Interaction Network Based on Scale-Aware Augmentation for Camouflaged Object Detection
    Ma, Ming
    Sun, Bangyong
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 69 - 81
  • [37] TransGOP: Transformer-Based Gaze Object Prediction
    Wang, Binglu
    Guo, Chenxi
    Jin, Yang
    Xia, Haisheng
    Liu, Nian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9, 2024, : 10180 - 10188
  • [38] A Scale-Aware Pyramid Network for Multi-Scale Object Detection in SAR Images
    Tang, Linbo
    Tang, Wei
    Qu, Xin
    Han, Yuqi
    Wang, Wenzheng
    Zhao, Baojun
    REMOTE SENSING, 2022, 14 (04)
  • [39] Transformer-Based Multiple-Object Tracking via Anchor-Based-Query and Template Matching
    Wang, Qinyu
    Lu, Chenxu
    Gao, Long
    He, Gang
    SENSORS, 2024, 24 (01)
  • [40] OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images
    Zhao, Jiaqi
    Ding, Zeyu
    Zhou, Yong
    Zhu, Hancheng
    Du, Wen-Liang
    Yao, Rui
    El Saddik, Abdulmotaleb
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62