TFMFT: Transformer-based multiple fish tracking

被引:9
|
作者
Li, Weiran [1 ,2 ,3 ,4 ,5 ]
Liu, Yeqiang [1 ,2 ,3 ,4 ,5 ]
Wang, Wenxu [1 ,2 ,3 ,4 ,5 ]
Li, Zhenbo [1 ,2 ,3 ,4 ,5 ]
Yue, Jun [6 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr & Rural Affairs, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China
[3] Minist Agr & Rural Affairs, Key Lab Agr Informat Acquisit Technol, Beijing 100083, Peoples R China
[4] Beijing Engn & Technol Res Ctr Internet Things Agr, Beijing 100083, Peoples R China
[5] Minist Agr & Rural Affairs, Key Lab Smart Farming Aquat Anim & Livestock, Beijing 100083, Peoples R China
[6] LuDong Univ, Coll Informat & Elect Engn, Yantai 264025, Peoples R China
基金
国家重点研发计划;
关键词
Fish Tracking; Multiple Object Tracking; Transformer; Computer Vision; Deep Learning;
D O I
10.1016/j.compag.2023.108600
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Recent advancements in fish tracking methodologies provide valuable solutions for assessing fish growth, marine fisheries, and biological research. In particular, there has been a burgeoning interest in vision-based methods for fish tracking, owing to the enhanced computational capabilities facilitated by deep learning models. However, these methods face several challenges, including poor fish detection performance under complex backgrounds, the potential for identification switches caused by the non-rigid features and occlusions of fish, and the limited fault tolerance of extant approaches. In this paper, a transformer-based multiple fish tracking model (TFMFT) is proposed, specifically designed to address the issue of instance loss of fish targets in aquaculture ponds with complex background disturbance. In particular, we introduce a Multiple Association (MA) method that bolsters fault tolerance in tracking by synthesizing simple Intersection-over-Union matching in the identification (ID) matching module. Through empirical studies across diverse Transformer-based models, we comprehensively assessed the influence of architecture design on data requirements. Furthermore, to evaluate the performance and generalizability of fish tracking models, we present the Multiple_Fish_Tracking_2022 (MFT22) dataset. The results demonstrate that TFMFT achieves 30.6% IDF1 (Identification F-Score) on the MFT22 dataset, outperforming the state-of-the-art by 10.9% and showcasing superior performance over other models. The resources and pre-trained model will be available at: https://github.com/vranlee/TFMFT.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Transformer-based assignment decision network for multiple object tracking
    Psalta, Athena
    Tsironis, Vasileios
    Karantzalos, Konstantinos
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
  • [2] A transformer-based lightweight method for multiple-object tracking
    Wan, Qin
    Ge, Zhu
    Yang, Yang
    Shen, Xuejun
    Zhong, Hang
    Zhang, Hui
    Wang, Yaonan
    Wu, Di
    IET IMAGE PROCESSING, 2024, 18 (09) : 2329 - 2345
  • [3] Transformer-Based Maneuvering Target Tracking
    Zhao, Guanghui
    Wang, Zelin
    Huang, Yixiong
    Zhang, Huirong
    Ma, Xiaojing
    SENSORS, 2022, 22 (21)
  • [4] Transformer-based convolutional forgetting knowledge tracking
    Tieyuan Liu
    Meng Zhang
    Chuangying Zhu
    Liang Chang
    Scientific Reports, 13
  • [5] Transformer-based convolutional forgetting knowledge tracking
    Liu, Tieyuan
    Zhang, Meng
    Zhu, Chuangying
    Chang, Liang
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [6] A Transformer-Based Network for Hyperspectral Object Tracking
    Gao, Long
    Chen, Langkun
    Liu, Pan
    Jiang, Yan
    Xie, Weiying
    Li, Yunsong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] Video text tracking with transformer-based local search
    Zhou, Xingsheng
    Wang, Cheng
    Wang, Xinggang
    Liu, Wenyu
    NEUROCOMPUTING, 2024, 609
  • [8] A Novel Transformer-Based Model for Dialog State Tracking
    Miao, Yu
    Liu, Kuilong
    Yang, Wenbo
    Yang, Changyuan
    CROSS-CULTURAL DESIGN-APPLICATIONS IN BUSINESS, COMMUNICATION, HEALTH, WELL-BEING, AND INCLUSIVENESS, CCD 2022, PT III, 2022, 13313 : 148 - 156
  • [9] Swin transformer-based traffic video text tracking
    Yu, Jinyao
    Qian, Jiangbo
    Xin, Yu
    Wang, Chong
    Dong, Yihong
    APPLIED INTELLIGENCE, 2024, 54 (21) : 10581 - 10595
  • [10] Hierarchical Transformer-based Query by Multiple Documents
    Huang, Zhiqi
    Naseri, Shahrzad
    Bonab, Hamed
    Sarwar, Sheikh Muhammad
    Allan, James
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 105 - 115