A Multi-Scale Learnable Feature Alignment Network for Video Object Detection

被引:0
|
作者
Wang, Rui [1 ]
机构
[1] Beijing Univ Technol, Comp Coll, Beijing, Peoples R China
关键词
Object detection; Deep convolutional neural network (DCNN); Feature propagation; Feature fusion;
D O I
10.1109/MASS62177.2024.00078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection is an important task of computer vision used to detect instances of visual objects of a certain class in digital images. Video object detection aims to locate single or multiple objects in sequential images and assign category labels for them. There are similarities between video object detection and image object detection, so some image object detection methods are usually used for video object detection. However, due to motion blur, occlusion, morphological diversity, and illumination changes in video, video object detection algorithms have higher requirements. In the framework of video object detection based on feature reuse and recursive fusion, we propose a multi-scale learnable sampling alignment (MLFA) network for video object detection. MLFA divides the video frame into the key frame and non-key frame and propagates a memory feature containing historical key frame information in the time dimension to compensate for the current frame feature through feature fusion. In the process of alignment, the feature pyramid is first established, and then the alignment features of different levels are learned in a learnable way. After that, features from different levels are fused to leverage multi-scale information. MLFA maintains the efficiency and further improves the detection accuracy.
引用
收藏
页码:496 / 501
页数:6
相关论文
共 50 条
  • [1] Multi-Scale Feature Selective Matching Network for Object Detection
    Pei, Yuanhua
    Dong, Yongsheng
    Zheng, Lintao
    Ma, Jinwen
    MATHEMATICS, 2023, 11 (12)
  • [2] A multi-scale feature representation and interaction network for underwater object detection
    Yuan, Jiaojiao
    Hu, Yongli
    Sun, Yanfeng
    Yin, Baocai
    IET COMPUTER VISION, 2023, 17 (03) : 265 - 281
  • [3] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [4] Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection
    Wang, Hongyang
    Wang, Tiejun
    ELECTRONICS, 2023, 12 (01)
  • [5] Multi-Scale Object Detection Using Feature Fusion Recalibration Network
    Guo, Ziyuan
    Zhang, Weimin
    Liang, Zhenshuo
    Shi, Yongliang
    Huang, Qiang
    IEEE ACCESS, 2020, 8 : 51664 - 51673
  • [6] MDFN: Multi-scale deep feature learning network for object detection
    Ma, Wenchi
    Wu, Yuanwei
    Cen, Feng
    Wang, Guanghui
    PATTERN RECOGNITION, 2020, 100
  • [7] MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK
    Guan, Wenjie
    Zou, YueXian
    Zhou, Xiaoqun
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2596 - 2600
  • [8] Feature Enhancement for Multi-scale Object Detection
    Huicheng Zheng
    Jiajie Chen
    Lvran Chen
    Ye Li
    Zhiwei Yan
    Neural Processing Letters, 2020, 51 : 1907 - 1919
  • [9] Feature Enhancement for Multi-scale Object Detection
    Zheng, Huicheng
    Chen, Jiajie
    Chen, Lvran
    Li, Ye
    Yan, Zhiwei
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1907 - 1919
  • [10] Multi-scale feature aggregation and boundary awareness network for salient object detection
    Wu, Qin
    Wang, Jianzhe
    Chai, Zhilei
    Guo, Guodong
    IMAGE AND VISION COMPUTING, 2022, 122