Nearshore optical video object detector based on temporal branch and spatial feature enhancement

被引：0

作者：

Zhao, Yuanlin ^{[1
]}

Li, Wei ^{[2
]}

Ding, Jiangang ^{[1
]}

Wang, Yansong ^{[1
]}

Pei, Lili ^{[2
]}

Tian, Aojia ^{[1
]}

机构：

[1] Changan Univ, Sch Informat Engn, Xian 710064, Shaanxi, Peoples R China

[2] Changan Univ, Sch Data Sci & Artificial Intelligence, Xian 710064, Shaanxi, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 138卷

关键词：

Optical video object detection; Temporal branch; Fast re-parameterization network; Spatial feature enhancement; Intelligent nearshore transportation;

D O I：

10.1016/j.engappai.2024.109387

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The computing power of nearshore and ship-borne devices is limited, posing significant challenges for accurately detecting objects in real-time on such devices. We propose a nearshore video object detector (NVID) to tackle these challenges. Considering the abundance of dynamic entities in the nearshore environment, we have developed you can look more (YCLM) to perceive the temporal characteristics of these objects. Furthermore, to improve the ability to detect objects of different sizes of networks, we designed parallel deformable attention (PDA) based on the spatial features of objects. More importantly, we developed fast reparameterization convolution (FREConv) and faster conv (FConv). Building on these innovations, we proposed a fast re-parameterization network (FRENet) specifically tailored to produce low-parameter, multi-scale feature outputs. With end-to-end training, our pipeline outperforms other state-of-the-art (SOTA) methods on the nearshore objects (NearshoreObjects) dataset (90.4 average precision (AP) 50 (+4.7), parameters (Params) (-1.0M), 24.8 frames per second (FPS) (Jetson Nano) (+0.6). In addition, NVID also achieved excellent results in the on board (OnBoard) dataset (90.3 AP50 (+2.8), 9.3 params (-1.0M), 26.5 FPS (Jetson Nano) (+0.8)). The source code can be accessed at https://github.com/Yuanlin-Zhao/NVID.

引用

页数：19

共 50 条

[1] Low Light Video Enhancement Based on Temporal-Spatial Complementary Feature
Zhang, Gengchen
Zeng, Yuhang
Fu, Ying
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 368 - 379
[2] Video object segmentation based on temporal frame context information fusion and feature enhancement
Hou, Zhiqiang
Li, Fucheng
Wang, Shuiyuan
Dai, Nan
Ma, Sugang
Fan, Jiulun
APPLIED INTELLIGENCE, 2023, 53 (06) : 6496 - 6510
[3] Video object segmentation based on temporal frame context information fusion and feature enhancement
Zhiqiang Hou
Fucheng Li
Shuiyuan Wang
Nan Dai
Sugang Ma
Jiulun Fan
Applied Intelligence, 2023, 53 : 6496 - 6510
[4] Weakly supervised video anomaly detection based on spatial–temporal feature fusion enhancement
Weijie Liang
Jianming Zhang
Yongzhao Zhan
Signal, Image and Video Processing, 2024, 18 : 1111 - 1118
[5] Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection
Xu, Chao
Zhang, Jiangning
Wang, Mengmeng
Tian, Guanzhong
Liu, Yong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7809 - 7820
[6] SPATIAL-TEMPORAL FEATURE AGGREGATION NETWORK FOR VIDEO OBJECT DETECTION
Chen, Zhu
Li, Weihai
Fei, Chi
Liu, Bin
Yu, Nenghai
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1858 - 1862
[7] Self-supervised spatial-temporal feature enhancement for one-shot video object detection
Yao, Xudong
Yang, Xiaoshan
NEUROCOMPUTING, 2024, 601
[8] Joint Spatial and Temporal Feature Enhancement Network for Disturbed Object Detection
Zhang, Fan
Ji, Hongbing
Zhang, Yongquan
Zhu, Zhigang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12258 - 12273
[9] Digital Video Steganalysis Based on a Spatial Temporal Detector
Su, Yuting
Yu, Fan
Zhang, Chengqian
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (01): : 360 - 373
[10] Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video
Fujitake, Masato
Sugimoto, Akihiro
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7684 - 7691

← 1 2 3 4 5 →