UWV-Yolox: A Deep Learning Model for Underwater Video Object Detection

被引:6
|
作者
Pan, Haixia [1 ]
Lan, Jiahua [1 ]
Wang, Hongqiang [1 ]
Li, Yanan [1 ]
Zhang, Meng [1 ]
Ma, Mojie [1 ]
Zhang, Dongdong [1 ]
Zhao, Xiaoran [1 ]
机构
[1] Beihang Univ, Sch Software, Beijing 100191, Peoples R China
关键词
underwater video; object detection; coordinate attention; loss function; frame-level optimization;
D O I
10.3390/s23104859
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Underwater video object detection is a challenging task due to the poor quality of underwater videos, including blurriness and low contrast. In recent years, Yolo series models have been widely applied to underwater video object detection. However, these models perform poorly for blurry and low-contrast underwater videos. Additionally, they fail to account for the contextual relationships between the frame-level results. To address these challenges, we propose a video object detection model named UWV-Yolox. First, the Contrast Limited Adaptive Histogram Equalization method is used to augment the underwater videos. Then, a new CSP_CA module is proposed by adding Coordinate Attention to the backbone of the model to augment the representations of objects of interest. Next, a new loss function is proposed, including regression and jitter loss. Finally, a frame-level optimization module is proposed to optimize the detection results by utilizing the relationship between neighboring frames in videos, improving the video detection performance. To evaluate the performance of our model, We construct experiments on the UVODD dataset built in the paper, and select mAP@0.5 as the evaluation metric. The mAP@0.5 of the UWV-Yolox model reaches 89.0%, which is 3.2% better than the original Yolox model. Furthermore, compared with other object detection models, the UWV-Yolox model has more stable predictions for objects, and our improvements can be flexibly applied to other models.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Underwater U-Net: Deep Learning with U-Net for Visual Underwater Moving Object detection
    Bajpai, Vatsalya
    Sharma, Akhilesh
    Subudhi, Badri Narayan
    Veerakumar, T.
    Jakhetiya, Vinit
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [32] Ghost-YOLOX: A Lightweight and Efficient Implementation of Object Detection Model
    Wang, Chun-Zhi
    Tong, Xin
    Zhu, Jia-Hui
    Gao, Rong
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4552 - 4558
  • [33] Summary Embedded Deep Learning Object Detection Model Competition
    Guo, Jiun-In
    Tsai, Chia-Chi
    Yang, Yong-Hsiang
    Lin, Hung-Wei
    Wu, Bo-Xun
    Kuo, Ted T.
    Wang, Li-Jen
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [34] A Novel FFT_YOLOX Model for Underwater Precious Marine Product Detection
    Wang, Peng
    Yang, Zhipeng
    Pang, Hongshuai
    Zhang, Tao
    Cai, Kewei
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [35] Deep learning for video object segmentation: a review
    Gao, Mingqi
    Zheng, Feng
    Yu, James J. Q.
    Shan, Caifeng
    Ding, Guiguang
    Han, Jungong
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (01) : 457 - 531
  • [36] Evaluation of Deep Learning Strategies for Underwater Object Search
    Knapik, Mateusz
    Cyganek, Boguslaw
    2019 FIRST INTERNATIONAL CONFERENCE ON SOCIETAL AUTOMATION (SA), 2019,
  • [37] Deep learning for video object segmentation: a review
    Mingqi Gao
    Feng Zheng
    James J. Q. Yu
    Caifeng Shan
    Guiguang Ding
    Jungong Han
    Artificial Intelligence Review, 2023, 56 : 457 - 531
  • [38] Feature Fusion Based Background Model Learning for Video Object Detection
    Padhi, Aditya Narayan
    Acharya, Subhabrata
    Nanda, Pradipta Kumar
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 126 - 129
  • [39] Object Detection and Tracking using Deep Learning and Artificial Intelligence for Video Surveillance Applications
    Mohana
    Aradhya, H. V. Ravish
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (12) : 517 - 530
  • [40] IMPROVED OBJECT DETECTION IN VIDEO SURVEILLANCE USING DEEP CONVOLUTIONAL NEURAL NETWORK LEARNING
    Dhiyanesh, B.
    Kanna, Rajesh K.
    Rajkumar, S.
    Radha, R.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 913 - 920