Nearshore optical video object detector based on temporal branch and spatial feature enhancement

被引:0
|
作者
Zhao, Yuanlin [1 ]
Li, Wei [2 ]
Ding, Jiangang [1 ]
Wang, Yansong [1 ]
Pei, Lili [2 ]
Tian, Aojia [1 ]
机构
[1] Changan Univ, Sch Informat Engn, Xian 710064, Shaanxi, Peoples R China
[2] Changan Univ, Sch Data Sci & Artificial Intelligence, Xian 710064, Shaanxi, Peoples R China
关键词
Optical video object detection; Temporal branch; Fast re-parameterization network; Spatial feature enhancement; Intelligent nearshore transportation;
D O I
10.1016/j.engappai.2024.109387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The computing power of nearshore and ship-borne devices is limited, posing significant challenges for accurately detecting objects in real-time on such devices. We propose a nearshore video object detector (NVID) to tackle these challenges. Considering the abundance of dynamic entities in the nearshore environment, we have developed you can look more (YCLM) to perceive the temporal characteristics of these objects. Furthermore, to improve the ability to detect objects of different sizes of networks, we designed parallel deformable attention (PDA) based on the spatial features of objects. More importantly, we developed fast reparameterization convolution (FREConv) and faster conv (FConv). Building on these innovations, we proposed a fast re-parameterization network (FRENet) specifically tailored to produce low-parameter, multi-scale feature outputs. With end-to-end training, our pipeline outperforms other state-of-the-art (SOTA) methods on the nearshore objects (NearshoreObjects) dataset (90.4 average precision (AP) 50 (+4.7), parameters (Params) (-1.0M), 24.8 frames per second (FPS) (Jetson Nano) (+0.6). In addition, NVID also achieved excellent results in the on board (OnBoard) dataset (90.3 AP50 (+2.8), 9.3 params (-1.0M), 26.5 FPS (Jetson Nano) (+0.8)). The source code can be accessed at https://github.com/Yuanlin-Zhao/NVID.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Low Light Video Enhancement Based on Temporal-Spatial Complementary Feature
    Zhang, Gengchen
    Zeng, Yuhang
    Fu, Ying
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 368 - 379
  • [2] Video object segmentation based on temporal frame context information fusion and feature enhancement
    Hou, Zhiqiang
    Li, Fucheng
    Wang, Shuiyuan
    Dai, Nan
    Ma, Sugang
    Fan, Jiulun
    APPLIED INTELLIGENCE, 2023, 53 (06) : 6496 - 6510
  • [3] Video object segmentation based on temporal frame context information fusion and feature enhancement
    Zhiqiang Hou
    Fucheng Li
    Shuiyuan Wang
    Nan Dai
    Sugang Ma
    Jiulun Fan
    Applied Intelligence, 2023, 53 : 6496 - 6510
  • [4] Weakly supervised video anomaly detection based on spatial–temporal feature fusion enhancement
    Weijie Liang
    Jianming Zhang
    Yongzhao Zhan
    Signal, Image and Video Processing, 2024, 18 : 1111 - 1118
  • [5] Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection
    Xu, Chao
    Zhang, Jiangning
    Wang, Mengmeng
    Tian, Guanzhong
    Liu, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7809 - 7820
  • [6] SPATIAL-TEMPORAL FEATURE AGGREGATION NETWORK FOR VIDEO OBJECT DETECTION
    Chen, Zhu
    Li, Weihai
    Fei, Chi
    Liu, Bin
    Yu, Nenghai
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1858 - 1862
  • [7] Self-supervised spatial-temporal feature enhancement for one-shot video object detection
    Yao, Xudong
    Yang, Xiaoshan
    NEUROCOMPUTING, 2024, 601
  • [8] Joint Spatial and Temporal Feature Enhancement Network for Disturbed Object Detection
    Zhang, Fan
    Ji, Hongbing
    Zhang, Yongquan
    Zhu, Zhigang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12258 - 12273
  • [9] Digital Video Steganalysis Based on a Spatial Temporal Detector
    Su, Yuting
    Yu, Fan
    Zhang, Chengqian
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (01): : 360 - 373
  • [10] Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video
    Fujitake, Masato
    Sugimoto, Akihiro
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7684 - 7691