UWV-Yolox: A Deep Learning Model for Underwater Video Object Detection

被引:6
|
作者
Pan, Haixia [1 ]
Lan, Jiahua [1 ]
Wang, Hongqiang [1 ]
Li, Yanan [1 ]
Zhang, Meng [1 ]
Ma, Mojie [1 ]
Zhang, Dongdong [1 ]
Zhao, Xiaoran [1 ]
机构
[1] Beihang Univ, Sch Software, Beijing 100191, Peoples R China
关键词
underwater video; object detection; coordinate attention; loss function; frame-level optimization;
D O I
10.3390/s23104859
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Underwater video object detection is a challenging task due to the poor quality of underwater videos, including blurriness and low contrast. In recent years, Yolo series models have been widely applied to underwater video object detection. However, these models perform poorly for blurry and low-contrast underwater videos. Additionally, they fail to account for the contextual relationships between the frame-level results. To address these challenges, we propose a video object detection model named UWV-Yolox. First, the Contrast Limited Adaptive Histogram Equalization method is used to augment the underwater videos. Then, a new CSP_CA module is proposed by adding Coordinate Attention to the backbone of the model to augment the representations of objects of interest. Next, a new loss function is proposed, including regression and jitter loss. Finally, a frame-level optimization module is proposed to optimize the detection results by utilizing the relationship between neighboring frames in videos, improving the video detection performance. To evaluate the performance of our model, We construct experiments on the UVODD dataset built in the paper, and select mAP@0.5 as the evaluation metric. The mAP@0.5 of the UWV-Yolox model reaches 89.0%, which is 3.2% better than the original Yolox model. Furthermore, compared with other object detection models, the UWV-Yolox model has more stable predictions for objects, and our improvements can be flexibly applied to other models.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Deep learning for object detection in video
    Lu, Shengyu
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [2] Study of Underwater Fruit Object Detection Using Deep Learning Model
    Aravind, Jinka Venkata
    Prince, Shanthi
    OPTICAL AND WIRELESS TECHNOLOGIES, OWT 2021, 2023, 892 : 381 - 391
  • [3] YOLOX based Underwater Object Detection for Inshore Aquaculture
    Imada, Atsuki
    Katayama, Takafumi
    Song, Tian
    Shimamoto, Takashi
    2022 OCEANS HAMPTON ROADS, 2022,
  • [4] Deep learning model inspired by lateral line system for underwater object detection
    Jeong, Taekyeong
    Yoo, Janggon
    Kim, Daegyoum
    BIOINSPIRATION & BIOMIMETICS, 2022, 17 (02)
  • [5] Deep Learning on Underwater Marine Object Detection: A Survey
    Moniruzzaman, Md.
    Islam, Syed Mohammed Shamsul
    Bennamoun, Mohammed
    Lavery, Paul
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017), 2017, 10617 : 150 - 160
  • [6] Deep Learning Model for Object Detection
    Hristeva, T.
    Marinova, M.
    Lazarov, V.
    PROCEEDINGS OF THE 45TH INTERNATIONAL CONFERENCE ON APPLICATION OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE'19), 2019, 2172
  • [7] Video Object Detection for Tractability with Deep Learning Method
    Tian, Bing
    Li, Liang
    Qu, Yansheng
    Yan, Li
    2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2017, : 397 - 401
  • [8] Underwater Object Detection and Pose Estimation using Deep Learning
    Jeon, MyungHwan
    Lee, Yeongjun
    Shin, Young-Sik
    Jang, Hyesu
    Kim, Ayoung
    IFAC PAPERSONLINE, 2019, 52 (21): : 78 - 81
  • [9] Review of Underwater Image Object Detection Based on Deep Learning
    Luo Y.
    Liu Q.
    Zhang Y.
    Zhou H.
    Zhang J.
    Cao X.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (10): : 3468 - 3482
  • [10] New Generation Deep Learning for Video Object Detection: A Survey
    Jiao, Licheng
    Zhang, Ruohan
    Liu, Fang
    Yang, Shuyuan
    Hou, Biao
    Li, Lingling
    Tang, Xu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3195 - 3215